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Preface 


The subject matter, facts, principles, 
and theories of child development are 
inherently interesting to many people 
and strongly influence the work of 
many specialists, both researchers and 
practitioners, in the biological and be- 
havioral sciences. In spite of this, re- 
search in this field since the mid-twen- 
ties has not generally improved in qual- 
ity or increased in quantity as much as 
research in many other scientific dis- 
ciplines. Recognizing this state of af- 
fairs, members of the Committee on 
Child Development of the National 
Academy of Sciences—National Re- 
search Council (Alfred Baldwin, 
Chairman, Roger Barker, Boyd Mc- 
Candless, and George G. Thompson) 
proposed publishing this handbook 
which, the Committee hoped, might 
help to accelerate the rate and raise the 
caliber of empirical investigations of 
growth and development. The National 
Academy of Sciences—National Re- 
search Council agreed to sponsor it. 

The specific purpose of the handbook 
is to make available in a single source 
concrete descriptions and evaluations of 
the most widely used research tech- 
niques in many aspects of the study of 
child growth and behavior. In addi- 
tion, it purports to communicate, 


wherever possible, some of the method- 
ological sophistication and “wisdom” 
that investigators gradually acquire but 
rarely make explicit in their publica- 
tions. For example, problems of ethics, 
feasibility, and public relations may af- 
fect the selection of subject popula- 
tions, the development of instruments, 
and the manipulation of environmental 
variables. Experienced investigators may 
be able to offer good advice on these 
issues, which often make research pro- 
grams involving children extremely 
complicated. 

The exclusive emphasis of the hand- 
book is research methods; substantive 
content is included only for illustrative 
purposes. In a sense, the volume is ad- 
dressed directly to present and potential 
investigators who can use it as a basic 
reference and a guide to research mat- 
ters outside their own research special- 
ties. It should also be valuable to ad- 
vanced scholars and graduate students 
interested in a balanced picture of the 
techniques applied in child study. 

Although the objective was a com- 
prehensive presentation of methods, it 
is obviously impossible to encompass 
all techniques and all areas of research. 
Therefore, early in the planning of the 
book the Committee and the Editor 
met to formulate principles for selec- 
tion and organization of topics. It was 
decided to include only methods that 
have actually been used with children 
and have not been thoroughly described 
in other volumes, as intelligence tests, 
for example, have been. Discussions 


posed to diagnostic or therapeutic— 
uses of these methods. Furthermore, 
primary emphasis should be given to 
methods that require special modifica- 


vi 

tions when used with children; thus 
statistical procedures would be omitted. 
These principles guided the writing and 
editing of this work. 

The Committee and the Editor de- 
vised a necessarily arbitrary organiza- 
tional plan and table of contents. To 
assure comprehensive coverage, several 
chapters had to be devoted to problems 
of general design and broad research 
orientations (e.g., experimental and nat- 
uralistic approaches); others dealt with 
specific techniques appropriate to a 
wide range of research activities (e.g., 
interview, projective techniques); still 
others focused on the most fruitful 
methods for studying particular con- 
tent areas (e.g., physical growth, per- 
sonality). 

Although such an arrangement in- 
evitably leads to some degree of overlap 
among chapters, it is often desirable. 
There are numerous techniques, useful 
over a broad range of investigatory 
problems, that can be appreciated most 
fully if they are considered from several 
points of view. For example, descrip- 
tions of specialized uses of projective 
and interview techniques are presented 
in the chapters on the assessment of 
personality characteristics and on at- 
titudes, even though comprehensive 
discussions of these techniques appear in 
other chapters. 

A volume of this sort has several 
obvious limitations. For one thing, re- 
search skills cannot be learned from 
books; they can be developed only 
through practice. At most, a handbook 
of this kind can aid the investigator 
only, by informing him of available 
techniques, their advantages and limita- 
tions, and the rules to be followed in 
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applying them. Second, the book nec- 
essarily reflects the present status of 
research methods in the field. With 
progress in this field, as in any other, 
available methods will be improved and 
new and better methods will be in- 
vented, As this occurs, this volume will 
become outdated and less useful. Thus, 
in a sense, the venture contains the 
germ of its own destruction: its goal is 
to stimulate advances that may make it 
obsolete. 

The actual organization and develop- 
ment of this book required the coopera- 
tion, advice, and hard work of many. In 
the early stages of planning, letters ex- 
plaining the project and tentative out- 
lines were sent to a number of distin- 
guished scholars who were asked to 
comment and criticize. Their enthusias- 
tic and gratifying responses brought 
many suggestions for revision and ex- 
pansion. After incorporating most of 
these suggestions into a revised outline, 
the Committee and the Editor invited a 
number of outstanding specialists to 
prepare the chapters of the volume. We 
were fortunate, indeed, to have so many 
experts accept invitations to undertake 
extremely arduous scholarly and writing 
responsibilities. The Committee and the 
Editor are deeply grateful to these con- 
tributors for their indispensable efforts 
throughout the project. 

The assistance of many others must 
also be acknowledged. In a real sense 
members of the Committee on Child 
Development of the National Academy 
of Sciences—National Research Coun- 
cil (Alfred Baldwin, Boyd McCandless, 
Glen Finch, Irving Hallowell, and 
Richard Solomon) were collaborating 
editors. They gave their time and 
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energy generously, working with me 
on the formulation of the original out- 
lines, the selection of the contributing 
authors, and the critical evaluation of 
the “working” outlines submitted by 
the authors before they began writing 
their chapters. I also sought the advice 
of colleagues at the University of Cali- 
fornia and at other institutions, who 
read one or more of the working out- 
lines or chapters and made valuable sug- 
gestions for changes, deletions, or ad- 
ditions, For this kind of assistance, I am 
indebted to Nancy Bayley, Merl E. 
Bonney, Orville Brim, Alfred Castaneda, 


vii 
Kenneth B. Clark, Susan Ervin, Mary 
Ford, Stanley Garn, Dale Harris, Edna 
Heidbreder, Alan Holmberg, Harry 
Levin, Eleanor E. Maccoby, John Mc- 
Kee, Norman Munn, Ethel Mussen, 
Norman Polansky, Philburn Ratoosh, 
Milton Rokeach, Earl Schaefer, Pauline 
Sears, Robert Sears, Melford Spiro, 
Celia B. Stendler, Mildred Templin, 
Leona Tyler, Sam Witryol, and John 
Zubek. Patricia Jones and Ethel Mussen 
prepared the index. 
Paur H. Mussen 
Berkeley, California EpITOR 
July 1960 
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The Study 
of Child Behavior 
and Development 


Alfred L. Baldwin 
Cornell Universily 


It is proverbial that any actor foolish 
enough to appear on the same stage 
with a child should resign himself to 
having the scene stolen away from him. 
Children are one of the most interesting 
phenomena in the world; they are news- 
worthy, they are good copy for adver- 
tisements, their cute sayings appear in 
hundreds of magazines and newspapers 
every day. 


VALUES OF CHILD STUDY 


Although the inherent fascination 
children have for adults has not been 
an unmixed blessing for the progress of 
the science of child psychology, it is 
novel and pleasant to find business and 
pleasure so conveniently combined as 
they are in the study of child develop- 
ment. For the study of children is an 
important business. On the one hand, 
a better knowledge of children and 


their development contributes in a very 
practical way to human betterment. On 
the other hand, a better understanding 
of children has many implications for 
a theory of human behavior. 

Undoubtedly the greatest potential 
contribution of the study of children 
to human welfare stems from the spe- 
cial importance of childhood to the 
achievement of mental health. The gen- 
erally deleterious effects of institution- 
alization, broken homes, parental rejec- 
tion, or inadequate parental supervision 
during childhood can hardly be denied. 
But the identification of the “toxic 
agents” in these experiences has hardly 
begun, and our knowledge of the psy- 
chological mechanisms that mediate 
their influence is even scantier. Because 
of the absence of sound knowledge of 
these mechanisms we cannot justify 
or perhaps even have faith in the effi- 
cacy of current procedures for coun- 
teracting and nullifying these detri- 
mental effects. Although the potential 
benefits are tremendous, we must admit 
that the contributions of child study to 
the improvement of mental health are 
still largely potential. These promises 
can be realized only by laborious and 
painstaking research, 

If mental health is the most important 
beneficiary of child-development re- 
search, education runs a close second. 
The current state of education, with its 
strongly antagonistic positions vying 
for acceptance, its rapid shifts from one 
procedure to another, its vulnerability 
to sloganized programs initiated by un- 
informed pressure groups, and its genu- 
ine failure to advance more rapidly, 
clearly points to a basic lack of solid 
empirical data on fundamental proc- 
esses. A great quantity of research in 
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problems of learning, language develop- 
ment, and thinking, among others, must 
be carried out, confirmed, and then ap- 
plied to the practical problems of edu- 
cation. 

Numerous other practical questions 
also demand research in child develop- 
ment. Social and political attitudes are 
developed and strengthened during 
childhood. Family stability in future 
generations partly depends upon the 
interpersonal patterns of adjustment and 
adaptation being developed in today’s 
children, Precision of motor behavior 
as well as accurate thinking are re- 
quired in a technological age. There is 
no end to the potential contributions 
of research in child development to hu- 
man welfare, 

How can the possible benefits be 
realized? One research strategy, of 
course, is the direct frontal attack. 
There have been many attempts to 
change attitudes, to improve teaching 
techniques, or to build up resistance to 
the deleterious effects of anxiety. There 
should be many others. This book con- 
tains advice and suggestions for meet- 
ing the many intricate problems faced 
by such research efforts. 

If the examples of our sister sciences 
are valid for behavioral science, the 
most important contributions to prac- 
tical problems will not come from an 
immediate frontal attack but instead 
from the development of a sound 
theory of behavior. Progress toward 
this difficult goal does not turn upon 
the contributions of any single field. It 
requires the maintenance of a broad 
research front involving studies of hu- 
mans and animals, adults and children, 
normals and abnormals. It is well to 


recognize, however, some of the spe- 
cial contributions that the study of 
children may make to the development 
and testing of theoretical hypotheses 

First, children pose a theoretical 
problem. A behavior theory must en 
compass the behavior of children as well 
as adults, and its parameters must de- 
scribe the differences between individ 
uals of different ages. 

Second, children are strategically 
valuable subjects for those whose aim 
is to add to general psychological 
theory. Especially for those whose co: 
cern lies in the elucidation of symbolic 
processes, children might be expected t 
provide unusually valuable experimental 
material. The relatively great impul 
sivity of children might make them 
useful subjects for the investigation of 
affective responses and the structure of 
defense mechanisms. By being relatively 
complicated, yet relatively inexperi- 
enced, children offer many attractive 
research possibilities to psychologists 
interested in learning problems. 

Third, child development is the ulti- 
mate problem of developmental psy- 
chology. Other species develop, of 
course, but most people’s interest in 
animal development lies in the potential 
contribution of this kind of study to 
the understanding of child development. 

There is no need to expound further 
on the potential values of child study 
for both practical and theoretical pur- 
poses. It is more important to assess the 
current state of the field, and this ap- 
praisal will be more intelligible if we 
look first at the historical development 
of child psychology. A review of the 
succession of problems that have cap- 
tured the attention of developmental 
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psychologists will show how the pur- 
pose of this book is conceived. 


HISTORICAL SURVEY 
OF CHILD PSYCHOLOGY 


At one time child development was 
considered of such central importance 
that nearly every major theorist wrote 
on the subject. G. S. Hall, J. M. Bald- 
win, William McDougall, E. Claparede, 
Wilhelm Stern, Kurt Koffka, and J. B. 
Watson all made major attempts to en- 
compass the facts of child behavior and 
development into their general theories. 
Unfortunately, however, there was no 
great abundance of empirical data to be 
embraced by the theories. The empirical 
studies of the early period were largely 
descriptive accounts of the development 
of individual children, although Hall 
made an interesting survey of the “con- 
tent of children’s minds” (Hall, 1891). 

Certainly a major shift of interest and 
an advance in the quality of empirical 
data began with the work of Binet 
(1903). The mental-testing movement 
for a time comprised almost the entire 
field of developmental psychology, and 
mental testing is still a major practical 
service of child psychologists. Although 
psychometrics began with the study of 
intelligence in children, it rapidly ex- 
panded both in its coverage of other 
areas of personality and ability and in 
its attention to other age levels. 

Mental tests of all kinds were of im- 
mense practical value. They also pro- 
vided important data on children’s be- 
havior, although the search for diag- 
nostic items seriously restricted its cov- 
erage, As test items, the ball and field 
test, absurdities, analogies, similarities, 


and differences are all valuable, but the 
data are too spotty to be as useful as 
they might be for those interested in 
the development of thinking. The use 
of the slope of the “per cent passing” 
curve as the major criterion for item 
selection in intelligence tests means that 
data are collected primarily at the age 
levels at which abilities change rapidly 
rather than in periods in which growth 
is more gradual. 

Although they do not describe all 
aspects of development—or any single 
one for that matter—mental tests do 
provide scores that are designed to show 
a continual gradual growth with age. 
This feature of the scores stimulated a 
somewhat unwarranted optimism that 
they described the psychological growth 
of the child in the same way that height 
and weight described his physical 
growth. It was not so apparent then as 
it is now that these curves are far from 
raw, theoretically neutral data and are 
dependent upon particular statistical 
manipulation. They are composite 
scores, very well suited for predicting 
such composite criteria as school 
achievement but not so useful for build- 
ing developmental theory. 

For various reasons, perhaps includ- 
ing these, the decade 1920-1930 marked 
the initiation of a number of “Growth 
Studies” at Harvard, the Brush Founda- 
tion, the University of California, the 
Child Research Council at Denver, Fels 
Institute, Merrill-Palmer, and other 
places. These long-range, longitudinal 
research programs were not all alike by 
any means, but, as originally conceived, 
they proposed to chart the physical, 
physiological, and psychological growth 
of individual children over a period of 
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years. For ten to fifteen years these 
programs and the scientists involved in 
them dominated the field of child de- 
velopment. The Committee on Child 
Development of the National Academy 
of Sciences—National Research Coun- 
cil, the group that is sponsoring this 
book began in 1920, One of its actions 
was the formation of the Society for 
Research in Child Development. 

This period of the 1920's was also 
marked by a certain fragmentation in 
the field. Psychometrics went its own 
way. Statistical sophistication became 
increasingly important in psychometric 
research, and psychometrics kept its 
close tie to applied problems. 

Child development, as represented in 
the longitudinal studies, became more 
closely linked to biological science than 
it had been formerly. Mental and psy- 
chological development viewed as one 
aspect of biological growth led research 
in child development in a somewhat 
different direction from the main 
stream of psychological research. This 
same organismic viewpoint did not en- 
courage the belief that the fate of de- 
velopmental psychology was closely 
tied to the growth of a general theory 
of behavior. At the same time, the main 
stream of psychological research was 
gradually turning toward the study of 
learning as the core problem, this em- 
phasis becoming more prominent after 
1930. Child development and behavior 
theory became separated from each 
other, and the field of developmental 
psychology was no longer a part of 
the main stream of psychological re- 
search. 

This does not mean that there were 
not conspicuous theoretical issues that 


marked the whole field of developmen- 
tal psychology in the 1920’s and early 
1930's. The controversy between he- 
redity and environment, maturation and 
learning, raged throughout the period 
although the hereditary position was 
considerably more popular. The con 
troversy appeared in the many studies 
of constancy of the IQ and its mod 
fiability by various sorts of environ 
mental treatments and special educa 
tional procedures (e.g., Terman, 1925; 
Goodenough, 1928). It appeared in the 
studies of the development of instinc 
tive responses, such as pecking in chicks 
(Bird, 1933), and in the studies showing 
how pecking can be modified by condi- 
tions in the egg (Kuo, 1932). It ap 
peared in the flurry of studies on the 
susceptibility of stair climbing, cutting 
with scissors, buttoning buttons, etc., to 
special training (Gesell and Thompson, 
1929; Hilgard, 1933). Many of these 
studies were excellent, and we now 
need more of them, but they were tied 
to a theoretical controversy, and as the 
heat died away the studies disappeared 
with it. Another hot issue was whether 
development proceeded from the gen- 
eral to the specific or the specific to 
the general (Carmichael, 1934; Kuo, 
1939). 

If mental tests marked one turning 
point in child development, the growth 
of interest in the effects of childhood 
experience upon adult personality 
marked a second. Somewhere between 
1935 and 1945 the entire atmosphere of 
child-development research shifted. 
Without ever repudiating openly the 
implied predictability of psychological 
development that was implicit in the 
original conception of the growth 
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studies, the field turned to studies of the 
effects of weaning, toilet training, birth 
injuries, broken homes, institutionaliza- 
tion, maternal absence, and maternal re- 
jection upon the psychological develop- 
ment of the child. Without question, 
this turn of affairs stems from the in- 
fluence of Freud. It is no accident that 
the early research was designed to test 
specific Freudian hypotheses—generally 
so oversimplified that no Freudian 
would accept them as valid tests. Grad- 
ually the interest spread from specific 
factors, such as weaning and toilet train- 
ing, to more general characteristics of 
childhood experience. Also, the research 
became less tied to any specific theo- 
retical hypothesis and became a more 
general study of the influence of child- 
hood experience upon personality and 
its growth. This period marks the con- 
vergence of learning theory and re- 
search in child development by way of 
the translation of Freudian concepts 
into the terminology of general be- 
havior theory by such theorists as Dol- 
lard and Miller (1950), Whiting and 
Child (1953), and Mowrer (1950). 

This history is oversimplified and 
points out only the most general trends. 
It neglects entirely, for example, the 
research stemming from the work of 
Piaget on thought, morals, etc. (Piaget, 
1926, 1932, 1950). It also fails to indi- 
cate the interrelations between cultural 
anthropology and child development 
that have played an important part in 
the history of research in family rela- 
tions and family sociology paralleling 
child-development research. The in- 
terrelations between these two fields 
have not been outlined, 

The fact that child-development re- 


search has only recently begun to be 
reintegrated into the general field of 
psychology has implications for re- 
search in both areas. Developmental 
psychologists have devised techniques 
and problems that are not common 
knowledge in the field of general psy- 
chology, and they have acquired a 
wisdom that will be useful to those who 
plan to conduct research with chil- 
dren. Conversely, there are techniques, 
experiments, and problems in the gen- 
eral psychological literature that may 
not be familiar to those research work- 
ers who have kept close to the main 
stream of child-development research. 

It seems appropriate, therefore, to 
try to collect in one volume an account 
of the research methods that have been 
used in the study of children, Some 
are methods that were developed for use 
with adults and require some modifica- 
tion before they can be made suitable 
for children. Others are buried in the 
literature and deserve to be brought to 
light. Partly, then, this book is a collec- 
tion of specific methods. It also has 
another function: to describe some of 
the general problems involved in work- 
ing with children. There are practical 
and ethical problems involved in ob- 
taining subjects, There are method- 
ological problems in which interview 
methods, questionnaires, attitudes scales, 
are employed and observational proce- 
dures, involving children, that are not 
required or are less important in the 
study of adults. 

We have attempted, therefore, to 
limit the contents of this book in two 
ways. First, it reports methods that 
have been used; only in a few cases 
have we tried to include interesting 
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possibilities of extending present meth- 
ods of research among adults or animals 
to the study of children. Second, it is 
limited, more or less, to methods that 
require some modification when they 
are used with children. There is almost 
nothing on statistical prodecures, for 
example, since there are no special 
children’s statistics. 

In organization, the book moves from 
the general to the specific. The re- 
mainder of this chapter is concerned 
with the most general problems of re- 
search in child development, scientific 
method and strategy, the classification 
of the problems of the field, the ethical 
problems involved in studying chil- 
dren, and, finally, the practical prob- 
lems of procurement of subjects, The 
next few chapters deal with general 
methods that are suitable for many 
different problems, such as experimenta- 
tion and observation. Then we move 
into specific areas of research and dis- 
cuss specific methods for investigating 
particular problems. Each of the chap- 
ters has been written by an active in- 
vestigator who has consented to survey 
the field and to describe the particular 
procedures that have been found 
suitable for the study of children, 


The Logic of Science and 
Scientific Strategy 


Behavioral scientists are rather self- 
conscious these days about the nature 
of scientific theory, and it is only proper 
that they should be. It is easy to confuse 
definitions with assumptions and ter- 
minology with findings in the difficult 
area of psychological research. At the 
same time, however, there is some 


danger that a preoccupation with the 
canons of theory construction ma) 
hamper research rather than aid it and 
may interfere with theory building 
rather than improve it. 

Consequently, the present discussion 
is divided into two portions. The first i 
concerned with the nature of scientific 
theory and the criteria a conceptual 
scheme must meet before it can be 
called a genuinely scientific theory 
These criteria are somewhat like ideals 
in that they mark the end of the road 
for theory building. We strive to pro- 
duce a theory that satisfies our ideals, 
but if we ever achieved it our job 
would be finished. All good theories in 
this sense are dead theories. 

Therefore, the second portion of the 
discussion is concerned with theory 
building, and it deals especially with 
the early stages of a science when non- 
sense questions and tacit assumptions 
lurk behind every hypothesis. A science 
in the making is an untidy affair, and 
wise answers to everyday problems 
are not obtained by knowing clearly 
what a mature science looks like. It 
is easier to know what we are striving 
for than how to attain it. 


NATURE OF SCIENTIFIC THEORY 


Turning then to the criteria of a 
scientific theory, we find a consensus on 
most of the elementary issues despite 
serious controversy on the basic prin- 
ciples and a lamentable lack of agree- 
ment on terminology. The present dis- 
cussion is couched largely in the lan- 
guage of Braithwaite (1955). 

A scientific theory is a deductive 
system. It contains a set of hypotheses, 
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some of which may be logically de- 
duced from the others in the set. These 
hypotheses can be arranged in a sort of 
hierarchical order. 


1. There will be a set of highest-level 
hypotheses, generally called assumptions 
and definitions, from which all the rest 
of the set can be deduced. Which ones 
are called assumptions and which ones 
are deduced from them is sometimes an 
arbitrary matter; there may be more 
than one way that the same set of hy- 
potheses can be arranged into a deduc- 
tive system. 

2. There will be middle-level hy- 
potheses that are deducible from the 
fundamental assumptions and are also 
the premises from which lower-level 
hypotheses are deduced—similar to the 
early theorem in a geometry course. 

3. There will be lowest-level hy- 
potheses that are deducible from the 
rest of the system but are not premises 
for further deductions. These are em- 
pirically testable hypotheses or predic- 
tions if the system as a whole is testable. 


We can illustrate these properties of 
a deductive system with the following 
set of hypotheses. It is a sort of carica- 
ture of the theory of rational man. 


1. Any person who chooses between 
several actions will choose the act whose 
consequences have the highest value for 
him. 

2. A financial gain has a higher value 
than a financial loss. 

3. Any person will choose a financial 
gain over a financial loss. 

4. If I give John Jones a choice be- 
tween gaining a one dollar bill and los- 
ing one dollar in my laboratory on 


Friday, April 10, at 2:00 P.M., he will 
choose to gain the dollar. 


In this system hypotheses 1 and 2 are 
“highest-level” hypotheses i.e., assump- 
tions. Hypothesis 3 is a middle-level hy- 
pothesis. It is deduced from 1 and 2 and 
is in turn the premise for the deduction 
of hypothesis 4. Hypothesis 4 is an 
empirically testable hypothesis deduced 
from 3. It is a specific description of 
one example of the class of events 
described by hypothesis 3. Actually, 
to make the deduction completely rig- 
orous, we would have to specify that 
John Jones is a person, that gaining the 
dollar bill is a financial gain, etc. But 
is it characteristic of testable hypotheses 
that they are couched in a terminology 
that permits the events described in 
them to be easily recognized. This prob- 
lem is discussed later, along with other 
problems of operational definitions. 
Suffice it to say here that for the 
theoretical system to be testable it must 
contain these lowest-level empirical hy- 
potheses that predict specific empirical 
events. 

Let us consider now the problem of 
confirmation and disconfirmation of the 
system. Any particular empirical hy- 

othesis is necessarily derived from 
some of the highest-level hypotheses of 
the system, but it need not depend on 
all of them. If the empirical hypothesis 
is confirmed, the higher-level hypoth- 
eses from which it is derived are sup- 

orted but not proved. On the other 
hand, the failure of the empirical hy- 
pothesis is proof that one or more of 
the hypotheses at the highest level from 
which it was derived are wrong. Usu- 
ally, however, it is impossible to assign 
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the error precisely to one specific high- 
est-level hypothesis. There is an error 
“somewhere,” but it is not clear where, 
and any one of several changes may 
remedy the problem. 

How precisely the faulty hypothesis 
may be located depends upon how 
many are involved in the derivation. 
A disconfirmation of a lowest-level hy- 
pothesis almost always disproves the 
hypothesis at the next level up because 
the lowest-level hypothesis is almost 
always merely a specific case of a gen- 
eral statement and is derived totally 
from one second-level hypothesis. 
Where the error lies above that point 
is generally problematical. 

Turning again to the consequences of 
a confirmation, we notice that a con- 
firmation lends support to the higher- 
level hypotheses from which it is de- 
rived, and it also provides indirect con- 
firmation to other collateral hypotheses 
that represent derivations from the same 
set of highest-level hypotheses. In other 
words, any support for a general hy- 
pothesis lends support to all its implica- 
tions, not merely to those in the direct 
line of derivation. 

Suppose, for example, we inserted a 
pair of hypotheses 1a and 1b just after 
hypothesis 1 in the example. Hypothesis 
1a restates hypothesis 1 for men, i.e., 
“any man who chooses between. . .” 
and hypothesis 1b states the same hy- 
pothesis for women. Now, the empirical 
hypothesis 4 is deduced by 1a, not 1b, 
since John Jones is a man. Thus, if the 
empirical hypothesis is confirmed, it 
supports hypothesis 1a and, through it, 
hypothesis 1. Now, if hypothesis 1 is 
true, hypothesis 1b is necessarily true; 
therefore, whatever support hypothesis 


1 receives from the test is also support 
for hypothesis 1b, even though hypoth- 
esis 1b is not in the direct line of deduc- 
tion. 

Intuitively, it seems that the test 
provides less support for 1 and 1b than 
for 1a because the hypothesis might 
hold for men and not for women. There 
have been some attempts to formulate 
these intuitive notions into a system 
that would quantify the support a par- 
ticular finding provided for the various 
higher-level hypotheses that were in- 
volved in the deductive system, but the 
problem is extremely difficult, and the 
attempts have not been very successful 
so far. 

This view of scientific theories as de- 
ductive systems is straightforward 
enough in its general features, but the 
exposition hides the multitude of dif- 
ficulties that arise from the fact that a 
scientific theory is a language as well 
as a deductive system. For example, in 
the little theory described earlier the 
term “value” is used in hypothesis 1. 
Depending upon the way that term is 
defined, the entire impact of the theory 
is changed. In the example it was de- 
fined in part by hypothesis 2. Financial 
gain is said by definition to have a 
higher value than financial loss. If 
hypothesis 2 is taken as a definition of 
value, then a failure of the prediction on 
John Jones would invalidate hypothesis 
1 because a definition cannot be invalid. 

On the other hand, we could define 
value in terms of preference. “The value 
of A is higher than the value of B if a 
person chooses A rather than B.” Now, 
hypothesis 1 is essentially a definition, 
and hypothesis 2 becomes a testable 
hypothesis of the values of financial 
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gains and losses. A failure of the predic- 
tion on John Jones would in this case 
invalidate hypothesis 2 rather than hy- 
pothesis 1. 

We can see from this discussion how 
complicated and at times arbitrary is the 
problem of distinguishing definitions 
from assumptions. Basically the prob- 
lem is that of deciding which hypotheses 
are “contingent” and which are “non- 
contingent.” A contingent hypothesis is 
one that makes some testable deduction 
that may be right or wrong. Tt can in 
theory be disproved. A noncontingent 
hypothesis is really a statement about 
the language in the theoretical system or 
about the rules of deduction and not 
about the subject matter. Noncontin- 
gent hypotheses cannot be tested or 
proved wrong. The scientist may de- 
cide, however, that a definition is not 
useful and redefine his conceptual 
terms in an attempt to improve a theory. 

Noncontingent hypotheses form a 
part of every theory because they per- 
mit the contingent hypotheses to be 
tested. The terms in the statement “all 
men are mortal” must be defined so that 
the scientist can recognize a “man” 
when he sees one and determine 
whether or not something is “mortal.” 
In other words, the terms in a directly 
testable hypothesis must be operation- 
ally or empirically defined—otherwise 
the hypothesis is not testable. 

Thus definitions are essential for test- 
ing a theory, but they may also reduce 
apparently testable hypotheses to logical 
tautologies. It is all too easy in an ex- 
periment to define the terms in a hy- 
pothesis so that the definitions and the 
derivations from the hypothesis coin- 
cide completely. In this case the hy- 


pothesis is completely deducible from 
the definition and is, therefore, not 
testable, since it cannot be disconfirmed. 
For example, the hypothesis that aggres- 
sive children will seize opportunities to 
hurt children who cannot retaliate may 
be derived from some high-level hy- 
potheses about aggression, but if the 
definition of aggressive is “a tendency 
to injure others when there are no nega- 
tive consequences” then the hypothesis 
becomes directly derivable from the 
definition. Consequently, it cannot be 
disconfirmed, and it lends no support 
to the higher-level hypothesis from 
which it was presumably derived. 

In summary, then, hypotheses may 
be untestable for a variety of reasons. 
They may not lead to empirical hy- 
potheses either because empirical defini- 
tions of the terms are lacking or be- 
cause no lower-level hypotheses can be 
derived from them. On the other hand, 
the terms may be so defined that the 
empirical hypothesis is derivable from 
the definition rather than from the 
hypothesis to be tested. 

Let us turn now to the discussion of 
operational or empirical definitions. The 
necessity for operational definitions of 
the terms in a scientific theory has been 
strongly emphasized in methodological 
discussions of psychological research 
during the last twenty years. The em- 
phasis has been valuable and has cer- 
tainly improved the quality of 
psychological research, But the demand 
for operational definitions has some- 
times been interpreted so rigidly and 
dogmatically that fruitful research 
activity has been curtailed. 

There is no question but that a good 
scientific theory must be testable, i.e., its 
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lowest-level hypotheses must be em- 
pirical hypotheses that can be checked 
against the facts and confirmed or not. 
These tests cannot be made unless the 
terms in the lower-level hypotheses have 
some sort of operational definition, but 
it is important not to put unnecessarily 
strict requirements upon the operation- 
alization of a theory. 

We propose, therefore, to state the 
requirements for operationalization in 
a somewhat weaker form than that in 
which they are usually put. Without 
losing the essential value of operational 
definitions, we can make the following 
four statements in regard to those situa- 
tions in which unique operational defini- 
tions of every term are not necessary. 


1. The same term may have different 
operational definitions in different hy- 
potheses. 

2. There is no necessity for a term 
to have an operational definition that is 
applicable in situations in which there 
are no hypotheses to test. 

3. The lack of explicit operational 
definitions of some of the terms in the 
higher-level hypotheses does not imply 
that the lower-level hypotheses are un- 
testable. 

4. It is not necessary for every in- 
dividual term to have an individual 
operational definition. Sets of terms may 
correspond to sets of empirical events 
without each term corresponding to an 
individual event. The third and fourth 
statements may be equivalent ways of 
saying the same thing. 

Equivalence of Different Definitions. 
It is obvious in other sciences that the 
same term is not always operationally 
defined in the same way. The measure- 
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ment of the temperature of the sun is 
made in a completely different way 
from the measurement of the tempera- 
ture of a room. This does not mean that 
the selection of the measurement device 
is arbitrary or that the two measures 
of temperature are not equivalent. In 
physics, a complicated scientific system 
containing many terms, it often turns 
out that the same term temperature 
turns up all alone on one side of a 
formula, with some set of terms on the 
other side that does not include tem- 
perature. It may be related to the 
length of a solid bar in one formula, the 
color of molten metal in another, and 
the pressure of a gas in a third. Every 
such formula is a potential definition of 
the term temperature. As long as all of 
these different formulas are members of 
the same set of hypotheses forming a 
deductive system, then the different 
measures of temperature will be equi- 
valent. There may be some situations 
in which one measurement is tech- 
nically or theoretically inapplicable. 
The equivalence lies in the coherent 
structure of the whole system, not in the 
detailed matching of one instrument 
against the other. 

In behavioral science the same term 
frequently appears in different con- 
texts, but seldom do the different usages 
fit into a coherent logical system. Ag- 
gression may be measured by fights per 
hour in nursery school, by aggressive 
stories on the Thematic Apperception 
Test (TAT) or by violations of the 
rules of a game. There is no doubt that 
aggression has different operational defi- 
nitions in different situations, but the 
trouble lies in the fact that these three 
definitions are not all parts of a coherent 
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connected set of hypotheses. There isno 
assurance that the three measures are 
equivalent or compatible. 

It is common practice to correlate 
two measures to test for equivalence, 
and a high correlation is surely com- 
forting. On the other hand, two opera- 
tional definitions may be equivalent 
yet uncorrelatable. The length of the 
solid bar and the color of the molten 
metal are uncorrelatable yet compatible 
definitions of temperature. 

Inapplicability of Definitions. The 
statement that there is no need for 
operational definitions when there are 
no hypotheses to test seems obvious. If 
a theory makes no predictions about 
children, its terms need not be defined 
for children. This fact has not always 
been fully recognized, however. In some 
writings about operational definitions 
of motives, for example (Estes et al., 
1954), it has been tacitly assumed that a 
proper operational definition of a 
motive is one that permits us to watch 
an animal or a person in a free situation 
and to infer his motives from the ob- 
servation of his ongoing behavior. This 
is equivalent to asking for an operational 
definition of gravity or friction that 
could be applied by an observer of an 
avalanche. Although a generally ap- 
plicable operational definition of a 
motive would be a real achievement, 
it is not a necessity for scientific prog- 
ress. We can be well satisfied at first 
if a theory can make testable predictions 
even in very special purified situations 
—analogous to the fall of a body in a 
vacuum—and leave for later the more 
complicated problem of measuring 
gravity from the path of a projectile in 
the air. 


Fortunately, psychologists most often 
set these strict requirements when they 
are talking about methodology not 
when they are doing research, Hull, for 
example, defined habit strength in terms 
of the number of reinforcements of 
the response (Hull, 1943). This is a 
clear operational definition, but it is 
possible to use it only if the history of 
the organism is known. Given a strange 
animal, this definition does not provide 
any way of diagnosing the habits it has 
and how strong they are. Although this 
might be considered a sign of the 
theory’s incompleteness, the theory can- 
not be criticized on methodological 
grounds for failing to provide an ade- 

uate operational definition. 

Necessity of Operational Definitions. 
We have seen, thus far, that the same 
operational definition cannot be em- 
ployed in every situation in which a 
theory is relevant and that there may be 
others in which a theory provides no 
operational definition. Let us turn now 
to a more debatable point, that some of 
the terms in a theory may not have 
operational definitions at all. Such 
terms, if they exist, cannot occur in the 
two lowest levels of the system of 
hypotheses because these are the levels 
at which hypotheses must be directly 
testable. At the higher levels such terms 
may exist. Braithwaite (1955) gives a 
very simple example. Suppose A, B, and 
C are three observable characteristics. 
In the theoretical system the following 
three hypotheses might occur: 


Everything that is A is also L and M. 
Everything that is B is also M and N. 
Everything that is C is also N and L. 


This system has a number of testable 
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consequences, €.g., everything that is A 
and B is also C. However, the important 
feature of this discussion is that these 
testable hypotheses stem from the three 
listed above without requiring any 
operational definition of L, M, and N. 

The reader may feel that to invent 
three undefined terms to account for 
three observable terms is no great 
achievement. It is hardly parsimonious 
in the present case. If the model is ex- 
tended, however, the number of factors 
will be considerably less than the num- 
ber of observables. Thus six observables 
will correspond to four factors, fifteen 
observables to five factors, etc. The 
resemblance to factor analysis is more 
than superficial. 

In a brilliant discussion Braithwaite 
shows that there is no way to “solve” 
the three formulas for L, M, and N in 
terms of A, B, and C without making 
some added assumptions. If, for ex- 
ample, we assume that A is equivalent 
to L and M, rather than merely imply- 
ing L and M, and make similar assump- 
tions for the other formulas, then we 
can come to definitions of L, M, and N. 
But, as Braithwaite then shows, these 
assumptions so rigidify the theoretical 
system that it cannot easily be expanded 
to four factors to accommodate addi- 
tional observations. 

Regardless of whether we agree that 
there are actual disadvantages in de- 
manding operational definitions of all 
the higher-level terms in a theory, this 
illustration does demonstrate that the 
Jack of such definitions does not prevent 
the system from having testable hy- 
potheses. 

A similar sort of example could be 
used to illustrate the fact that in some 


situations a whole set of theoretical 
terms is operationally defined by a 
whole set of performance variables. In 
factor analysis, for example, in the 
absence of pure tests factor scores are 
determined by weighting a battery of 
test scores in a particular way. The 
same set of test scores is weighted 
differently to measure a different factor. 
Thus no single test score is related in a 
one-to-one fashion to a factor, but the 
whole battery serves to define a set ot 
factor scores. 

We see that even in a well-developed 
theoretical system operational defini- 
tions are not simple universal behavioral 
indicators of every theoretical term. 
The absolutely essential feature of a 
theory is its testability, and the demand 
for operational definitions has been in- 
tended to achieve that objective. We 
do well, however, to focus our atten- 
tion on the adequate testability of the 
theory rather than an insistence upon 
operational definitions for every in- 
dividual term. 


SCIENTIFIC STRATEGY 


The discussion thus far has been 
limited to those characteristics of a 
theory that make it adequate or inade- 
quate to explain a body of data. As 
scientists, we should have such a theory 
as our objective, but in a real sense 
these criteria have to do with theory 
testing rather than theory building. It is 
not obvious that the greatest contribu- 
tions to a science come from theories 
that meet these criteria of logical de- 
ducibility of subordinate from super- 
ordinate hypotheses or of maximum 
testability. In fact, there have been ex- 
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tremely stimulating conceptual systems 
whose fertility as sources of research 
activities cannot be doubted, despite the 
fact that they were poor theories from 
the point of view of logic and test- 
ability, Freud’s and Lewin’s writings 
come to mind immediately. On the 
other hand, Hull’s theory has also been 
very fruitful as a stimulus to research, 
and it was deliberately constructed to 
be a model of logic and testability. 

Just what makes one point of view 
fruitful and another sterile is not likely 
to be describable in a few simple prin- 
ciples. It is almost certainly not a func- 
tion of the theory alone; the man behind 
the theory can be the source of its 
popularity or its lack of acceptance. 
The state of the field is also important; 
there are numerous cases in the history 
of science in which precocious theoreti- 
cal suggestions were ignored until the 
field caught up. The present discussion, 
therefore, will not attempt a set of 
criteria for “good” errors or fruitful 
speculation in scientific thinking. On 
the other hand, it does seem worthwhile 
to consider the variety of contributions 
to the psychological literature that are 
called theoretical discussions and make 
some distinctions among them. 

Empirical Speculation. One type of 
discussion is called theoretical because 
it does not deal with established data, 
but in fact it is not properly theoretical 
at all. It is empirical speculation without 
empirical evidence. Freud’s hypotheses 
that in the early history of society 
people were organized into “primal 
hordes” is an example. Arguments about 
the extent to which young infants per- 
ceive the mother’s anxiety and/or dis- 
likes are essentially empirical arguments. 


Such hypotheses are frequently not in- 
tegrated into any system; they are 
merely stated. Sometimes they are 
stated as facts, sometimes as empirical 
hypotheses; but, in either case, the con- 
troversies they arouse usually concern 
the truth of the statement. In many 
cases the statements are testable enough 
in principle but are untestable in actual- 
ity because of technical difficulties. 
There is nothing essentially unobserv- 
able about a primal horde; we are 
hampered by technical difficulties, i.e., 
no time machine to take us back into 
the past. 

Such empirical hypotheses are fre- 
quently fruitful because they stimulate 
research to confirm or disconfirm them. 
When they are firmly deduced from 
theoretical systems, they can be ex- 
ceedingly fruitful as predictions whose 
confirmation will lend strong support 
to the theory. Even when they are not 
genuinely based upon a theoretical 
system, they may still lead to important 
investigations. 

There are two ways, however, that 
they may actually interfere with the 
progress of a science. First, they may 
lead to a merely verbal controversy that 
seems to assume that logic can decide 
the issue without any apparent realiza- 
tion that the question is an empirical 
one, Second, the empirical hypothesis, 
if it is confirmed, may be interpreted 
as supporting other statements made by 
the same person even though they are 
not logically related to the confirmed 
hypothesis. One of the consequences of 
the current insistence upon testing hy- 
potheses is that the investigator may 
pull his hypotheses out of thin air or 
out of his intuitive wisdom. The con- 
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firmation of such hypotheses may es- 
tablish the fact in question, but beyond 
that it constitutes evidence for the in- 
vestigator’s perspicacity or luck rather 
than evidence of his other theoretical 
statements. 

Terminology and Definitions. A 
second type of theoretical contribution 
might be described as the proposal of a 
terminology or a definition. Lists of 
needs, classifications of types of social 
groups, definitional distinctions be- 
tween guilt and shame or between hope 
for success and fear of failure are ex- 
amples of such contributions. Defini- 
tions and terminology are important 
parts of a theoretical system, and many 
of the major scientific advances can be 
traced to a novel definition of the 
problem or a happy classification of the 
phenomena to be explained. Frequently 
such attempts stem from the effort to 
refine our ordinary language descrip- 
tions to avoid confusions that have 
plagued previous attempts to structure 
the field. 

The fruitfulness of a definition is 
judged ultimately on the success of the 
theory that utilizes it. Definitions are 
untestable in isolation but they contain 
the implicit assumption that those events 
included in a single definitional category 
belong together. By being defined in 
the same term, they lose some individ- 
uality and must behave alike in many 
situations in which the term is involved 
in an empirical prediction. A defect in 
a theory may certainly be the result of 
poor definition, but still a definition is 
untestable except in connection with a 
theory. A proposed definition unac- 
companied by a theory that permits the 
whole structure to be tested may, there- 


SLOPMENT 


fore, appear to some people merely 
the spinning of verbal webs. Arguments 
about definitions can be fruitless, and 
one mark of an advanced science is less 
controversy over definitions. There 
certainly seems to be little value in try- 
ing to determine whether or not one 
person’s definition contains the “real” 
meaning of a term; for example, 
whether or not “motives are really just 
behavior sequences.” On the other hand, 
there may be a strong tendency in 
modern psychology to dismiss 
discussions by saying that “it is just a 
matter of definition,” as if such matters 
were unimportant. 

Proto-Theories. A third type of theo- 
retical contribution is the proposal of 
a genuine but incompletely specified 
theory, containing contingent state- 
ments as higher-level hypotheses and 
leading to predictions of one sort or 
another. These have been called “proto- 
theories.” 

Most psychological theories, es- 
pecially those of the cognitive variety, 
such as Lewin’s and Tolman’s, and 
psychoanalytic theories are proto- 
theories in this sense. The more rigorous 
behavior theories, such as Hull’s, are 
usually well specified in regard to the 
particular data of animal behavior they 
were designed to explain, but when 
they are applied to complex problems 
of human behavior or child develop- 
ment, as in Dollard and Miller (1950) or 
Sears, Maccoby, and Levin (1957), they 
must still be considered largely proto- 
theory. 

When proto-theories of human be- 
havior are carefully examined, their 
defects as scientific systems are fre- 
quently of two sorts. First, they are not 


as 


such 
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sufficiently testable—the terms have in- 
adequate operational definitions. Sec- 
ond, the so-called derivations from the 
theory are not strictly logical; they skip 
steps, they depend on tacit assumptions, 
and they substitute intuitive reasonable- 
ness for rigorous logical deduction. 

To illustrate the first type of dif- 
ficulty, the cognitive theories of Lewin 
and Tolman have been quite justly 
described as postperceptual and pre- 
behavioral. The actual stimulus condi- 
tions that result in a given valence, or 
cognitive structure, are not specified. 
Similarly, the conditions under which 
a force leads to locomotion are left 
unclear. All these are defects in opera- 
tional definitions. Similar problems arise 
in Sears’ definition of aggression as be- 
havior intended to injure. 

The trouble can often be traced to 
the fact that our “naïve” theory of 
human behavior interferes with our 
“scientific” theory. In everyday life 
psychologists, like other people, operate 
on the assumption that others have an 
awareness of their surroundings, includ- 
ing such future events as the conse- 
quences of an act. They assume people 
are purposive, intentional, and rational, 
i.e., they choose what they do in terms 
of its consequences and behave as they 
do in order to attain good consequences 
and avoid bad. These assumptions of 
naïve theory have been described in 
more detail by Heider (1958). 

In everyday life we recognize other 
people’s intentions, we judge pretty well 
what they are aware of. We judge, 
for example, whether we have been 
intentionally ignored or actually not 
seen. Although these judgments are ap- 
parently reasonably accurate—at least 


we get by in everyday life—we have 
very little clear knowledge of the cues. 
Psychologists, therefore, who try to 
build this naïve theory into a scientific 
one have great trouble in providing 
clear operational definitions. We are so 
sure that goal-directed behavior exists 
that the extreme difficulty of defining it 
in operational terms is never quite 
recognized. 

Despite the lack of operational defini- 
tions, hypotheses couched in ordinary 
language may lead to research. The in- 
vestigator who is willing to step out- 
side his behavioristic role can under- 
stand everyday language as well as 
anyone else. If he is sympathetic with 
cognitive theories of behavior, he devises 
experiments based on what the theory 
means as well as what it says. The 
danger is that he will not recognize the 
absence of the genuine operational 
definition. 

The second common defect in proto- 
theories of human behavior is the lack 
of strictly logical derivations of conclu- 
sions from premises. “His mother con- 
stantly compared him unfavorably with 
his brother; therefore he developed a 
strong resentment of the brother” il- 
lustrates this point. The statement is an 
unsatisfactory derivation. It follows 
common sense but does not fill in the 
logical gaps that are necessary to make 
it a true derivation. Similarly, to say 
that a person moves toward a goal 
object because he sees where it is may 
be excellent sense, but it is not a logical 
derivation unless many of the tacit 
assumptions are made explicit. 

The intuitive reasonableness of a con- 
sequence following an antecedent is 
affected by experience. In psycho- 
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analytic circles such statements as “his 
conformity was so rigid that it must 
conceal a real impulse to rebel” have 
become customary and now appear in- 
tuitively sensible. Yet such a phrase con- 
ceals much ambiguity. The definition 
of “rigid” is not clear; other criteria 
than rigidity are necessary before a 
behavior can be realistically attributed 
to reaction formation. The terms “con- 
ceal” and “real impulse” carry many 
connotations that are difficult to specify 
and may even distort the intended 
meaning of the phrase. 

From the point of view of scientific 
strategy, what is the value of proto- 
theories with their poor operational 
definitions and their loose and shaky 
derivations? One opinion is that such 
theories are actual blocks to progress. 
The only way that we can progress is 
to state predictions in such a form that 
everybody knows just what they mean 
and everybody will agree when they are 
confirmed or invalidated. Then we can 
change the theory in the light of the 
revealed weaknesses and move on to 
another attempt. From this point of 
view, pseudo-deductions such as the 
ones illustrated merely hide problems, 
and any satisfactions obtained from 
their confirmation are illusory. 

A second point of view is that such 
theoretical statements, inadequate 
though they may be, are steps toward an 
eventual adequate theory. Those who 
hold this position frequently maintain 
that the insistence upon highly objective 
testable predictions and rigorously logi- 
cal derivations tends to focus the sci- 
entist’s attention upon relatively insig- 
nificant problems and results in very 
narrow theories. They argue that if we 


must wait for a completely adequate 
theory of perception and motor be- 
havior before we begin to investigate at- 
titudes, motives, feelings, and coflicts we 
will never get to the important problems. 
Such people view scientific progress as 
the gradual refinement and differentia- 
tion of a broad theoretical position 
whose scope is adequate to include all 
varieties of behavior. Their opponents 
argue for establishing miniature theories 
of limited scope and then for gradually 
increasing the scope. 

A second issue also separates these 
two camps. How close will our eventual 
behavior theory be to “naive” theory 
(see p. 17.) If the tacit theories of the 
man on the street are distillations of 
generations of human interaction, then 
common sense is a good base for theory 
building. Psychology should first try 
to catch up with common sense by 
making naive theories explicit before 
it tries to forge ahead into new ter- 
ritory. If, on the other hand, common- 
sense psychology is merely a projective 
system as far from the truth as primitive 
theories of disease and astronomy, then 
we must cut loose immediately and try 
to build theories from the ground up. 

This controversy is especially rele- 
vant for the study of children because 
the man in the street makes little distinc- 
tion between adults and children. 
Mothers frequently say that the 3- or 4- 
month-old child begins to act human. 
By this she probably means that she 
now attributes consciousness, inten- 
tions, and goal-directed behavior to him. 
The child is seen as an inadequate, 
poorly controlled, irresponsible adult 
for several more years but not as quali- 
tatively different. This view is probably 
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what is meant by the “miniature-adult” 
theory of children that begins so many 
authors’ surveys of theories of child 
behavior. The validity of the naive 
theory of children is then considerably 
more questionable than the naive theory 
of adult behavior. 

Still a third point of view on scientific 
strategy would argue that science, at 
least in its early stages, should not try 
theory building at all but should con- 
centrate upon establishing a solid body 
of theoretically neutral factual data. 
Gradually we can organize these data in 
terms of lower-level hypotheses and 
build from the data up to an eventual 
theoretical system. Extremists maintain 
that the facts will eventually build the 
theory for us. If an anatomist wants to 
know about the course of the carotid 
artery, he dissects and finds out where 
it goes without waiting until he has a 
theory that predicts where it goes. He 
does not even need a hypothesis to 
test. Many examples of theoretically 
blind yet ‘useful data collection can be 
found in anatomy, taxonomy, geology, 
and other sciences. 

Those who maintain the importance 
of theory as a guide to scientific devel- 
opment argue that in behavioral science 
no data is theoretically neutral and 
that an antitheoretical position merely 
allows a number of tacit theoretical 
assumptions to have full sway under 
the guise of theoretical neutrality. They 
can point to the implicit theory in any 
attempt to categorize behavior. Does the 
boy move his arm or slam the door? 
The first phrase is not more theoreti- 
cally neutral than the other; it merely 
denies goal-directed behavior tacitly 
rather than explicitly. 


We have described three scientific 
strategies and tried to present the argu- 
ments for and against each, The first 
strategy is to make every theory for- 
mally sound with clear operational 
definitions and rigorous derivations. The 
second is to begin with global theories, 
necessarily vague, and refine them by 
research. The third strategy argues for 
fact finding rather than theory building 
at all. 

Perhaps it is important to recognize 
that all of these positions are proposing 
empirical hypotheses about the develop- 
ment of a science without any real 
evidence. Theory building is a variety 
of human behavior and is within the 
realm of behavioral science. But it is a 
realm in which we have very little 
data, Although it seems quite likely that 
each of the three approaches is a sound 
strategy under some conditions and that 
behavioral science will profit from the 
existence of all these points of view, we 
really have little evidence to support 
any position. The soundest statement is 
the negative one that until better evi- 
dence emerges it would seem rash to 
maintain dogmatically the worthlessness 
of any of these strategies of scientific 
advance. 


Specifications for a Theory 


The purpose of this section is to 
provide a table of contents of a sort for 
a theory of child behavior and develop- 
ment. It is in no way a statement of the 
content of a theory but rather of the 
specifications for a theory. It is an 
outline of the problem areas that an 
adequate theoretical system must some 
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they were before it. These considera- 
tions make it convenient to distinguish 
between the contemporaneous causes of 
behavior and the temporal changes in 
the individual’s properties caused by 
factors operating over a period of time. 

The factors related to change may be 
conveniently classified into those de- 
scriptive of the individual before the 
change took place and the environ- 
mental impacts occurring between that 
time and the time the change occurred. 
If there are developmental changes that 
are internally controlled and unin- 
fluenced by intervening environmental 
events, they represent a limiting case. 

Just as in the study of behavior, the 
research task consists of properly de- 
fining the characteristics of the person 
and the characteristics of the interven- 
ing events so that they may be related 
by a system of theoretical hypotheses. 
Since the characteristics of the person 
enter into both a theory of behavior and 
a theory of change, it would be very 
convenient for the investigator if the 
same set of personality variables ap- 
peared in the two theories, but such a 
convenient simplification may not be 
valid. There is no a priori reason that 
the categories of personal traits that 
cohere meaningfully in the analysis of 
the contemporaneous situation must 
be the same categories that are homo- 
geneous with respect to the process of 
change. 

For example, one possible conse- 
quence of being deprived of social re- 
lations for a period of time may be to 
lose the desire for human companion- 
ship. One sort of personality charac- 
teristic might be this tendency to adapt 
readily to social isolation. Whether this 
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personality variable would be detectable 
by any purely behavioral measure 
restricted to a study of the contem- 
porary situation is a debatable but em- 
pirical question. It might be as difficult 
to measure as the trait of “early 
menarche” in the 4-year-old. 

On the side of the environment, it 
seems almost certain that the descrip- 
tion of the stimulus properties of the 
situation will not be an adequate de- 
scription of the total impact of an en- 
vironment in changing the person. In 
terms of present notions, the latter in- 
cludes all of the reinforcements that an 
environment provides as well as the 
stimuli that elicit responses. Also, the 
environment includes more than a single 
situation. It might well be thought of as 
containing distributions, sequences, and 
sets of situations. Its description might 
be couched, therefore, in such terms as 
the probability of various situations 
within the environmental setting and 
the contingent probability of sequences 
of situations following upon various 
actions of the organism. The uncer- 
tainty about the basic description of a 
psychological environment shows how 
little attention this fundamental problem 
has received in behavioral science. 

The role of child and developmental 
psychology in achieving a theory of 
change is quite obvious. The central 
problem of developmental psychology 
is to explain those changes that occur as 
the child grows up to become mature 
and socialized. 


Special Research Problems 


This grand overview of the fields 
of behavioral science and the role of 


tar 
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child psychology is exhilarating in its 
breadth, but it contains more than a 
little Utopian thinking. It is well, there- 
fore, to come back down out of the 
clouds to some of the more mundane 
problems of research in child develop- 
ment. This whole volume is founded 
upon the supposition that research in 
child development offers some special 
problems, some unique challenges, and 
some unusual opportunities. What are 
these problems? 


PROBLEMS ARISING FROM 
CHILDREN’S CHARACTERISTICS 


One set of special problems arises 
from the fact that the subjects of child- 
development research are children. 
Quite aside from the fact that the job 
of child psychology is to understand 
children, we must understand them 
somewhat even to conduct research 
with them. In terms of traditional re- 
search operations, children are some- 
where between animals and adults. In 
research on adults we are accustomed to 
structuring the situation with language, 
to instructing the subjects with lan- 
guage, and, in many experiments, to 
collecting verbal responses that are 
treated as communications by the 
subject about some subject matter. Re- 
search with animals is conducted on 
quite a different basis. The situation is 
structured by such things as walls, elec- 
tric grills, and stimulus patterns, and 
the behavior of the animals is taken as a 
reaction, not a communication. Chil- 
dren fall in between. For very young 
children, the techniques used with ani- 
mals are necessary, but as the child 
grows older we can begin to utilize 


language as a research tool. The trouble 
is that children do not understand in- 
structions very well, and verbal struc- 
turing of the situation does not always 

roduce the same results that it does 
with adults, We cannot count upon 
the effectiveness of these verbal pro- 
cedures with children. We can, of 
course, treat children as if they were 
animals and study them nonverbally, 
but that is to sacrifice the very thing 
about children that makes them es- 
pecially valuable as research subjects. 
This very lack of docility of children 
to verbal procedures is a major field 
for investigation; we ought to study it, 
not circumvent it. 

How do these characteristics of chil- 
dren affect research? One of the prob- 
lems is to establish a prescribed psy- 
chological situation for the child. Sup- 
pose, for example, that we want to offer 
the child a choice between two alterna- 
tives to determine preference. For an 
adult, we can be reasonably sure that 
when we give him a choice he is 
actually choosing. A child may pick the 
first alternative he sees without ever 
looking at the other one. He may vacil- 
late so much from one alternative to the 
other that we suspect his final behavior 
is more accidental than deliberate, or 
he may break down emotionally because 
he cannot have both alternatives. 

In many experimental conflict situa- 
tions, for example, we want to be sure 
the subject knows what he can do, 
what the consequences are of each pos- 
sible course of action, and what courses 
of action are not permitted in the ex- 

eriment. With adults these objectives 
can usually be achieved by making the 
apparatus clear and the instructions 
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specific, but children may pay no at- 
tention to the directions or may be 
attracted to one feature of the apparatus 
at the expense of another. They may in- 
sist on the experimenter’s help when the 
adult would conform to the proper 
mores of the experiment. These be- 
haviors might be made the subject of 
investigation, but when we want to 
study some other aspect of child be- 
havior we do not want to be sidetracked 
into an investigation of all these dif- 
ficulties. Consequently, the experi- 
menter must use considerable skill to be 
sure that the psychological impact of 
the situation is what the experimenter 
intends, In later chapters specific meth- 
ods are discussed. 

If one of the problems of working 
with children is on the stimulus side, 
there are equivalent problems in inter- 
preting the meaning of the child’s re- 
sponse. In most research with adults we 
operate on the assumption that the be- 
havior is either a communication to the 
experimenter or that it is a response to 
the situation. In the latter case, its com- 
municative aspects are interferences. 
When we want to determine the sub- 
ject’s response to some harmful stimulus 
imposed upon him by another person, 
we recognize that one factor that may 
inhibit his hostile behavior is his un- 
willingness to appear to be hostile in 
the eyes of the experimenter. Here 
the communicative significance of the 
behavior interferes with the measure- 
ment of the subject’s reaction to instiga- 
tion. 

With adults, we count upon our good 
intuitive knowledge of other adults to 
keep us out of trouble. It is part of the 


art of experimentation to design situa- 
tions that do not inhibit the behavior we 
wish to study. In the case of children 
our intuitions are more likely to play 
us false. Sometimes it seems as if the 
child’s response is uninfluenced by any 
recognition of its meaning to the ex- 
perimenter, and in fact one attraction 
about working with children is their 
relative freedom from convention, But 
again the child cannot be counted on to 
be consistent in such matters—adults 
are not either, but relatively more so. 
So, in interpreting the child’s response, 
whether it is a reaction or whether it is 
a story or self-report that is to be 
treated as a communication, there are 
special skills that experienced investiga- 
tors have acquired. These questions are 
discussed in much more detail in the 
chapters that follow. 

Another illustration points to a 
slightly different problem. When a child 
is asked to repeat digits, he may sud- 
denly start to count in the middle 
of the repetition, “4, 1, 3, 7, 8, 9, 10.” 
If an adult fails on repeating digits, we 
can reasonably assume that he forgot 
and attribute the error to memory. 
The same thing may be true of the 
child, but another possibility is that he 
lost his task orientation and wandered 
off into a habitual response. As far as 
being a failure on the intelligence test 
is concerned, there is no serious prob- 
lem; but in interpreting the failure the 
particular characteristics of children 
need to be kept in mind. 

This difference between children and 
adults is, of course, only an extreme ex- 
ample of the difference between chil- 
dren of different ages. Such differences 
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make it quite difficult to plot develop- 
ment from one age level to another 
because we do not know the funda- 
mental dimensions of behavior in terms 
of which children of different ages can 
be compared. The difficulties in de- 
scribing the growth of intelligence il- 
lustrate the point. When individual 
growth curves on mental age are 
plotted, it is quite common to find con- 
siderable variability from one time to 
another. These are frequently attributed 
to the effects of environment, but such 
an interpretation is not the only pos- 
sibility. Bayley (1933), for example, 
suggests that such variation may well be 
due to the fact that intelligence tests 
measure different abilities at different 
age levels. In view of the way intelli- 
gence test items are selected, this in- 
terpretation seems reasonable. 

If Bayley’s interpretation is true, it 
means that intelligence is not a com- 
parable measure at all age levels. What 
then is a dimension on which child de- 
velopment can be plotted with the as- 
surance that the same thing is being 
measured at each age level? A complete 
answer to this question would require 
a really adequate theory of develop- 
ment, but until we do answer it we can 
expect difficulty in knowing whether or 
not the variables we choose for the 
comparison of age levels are the proper 
ones, Until a more adequate theory is 
available, we shall probably do well to 
study direct behavioral indices, which 
are as close to the raw data as possible. 
Height is such a physical measurement. 
A change in height does not always 
mean the same thing; it may mean 
growth in the length of the legs in some 


cases and change in sitting height in 
others. Even so, it has real advantages 
over such a highly derived index as 
mental age. 


PROBLEMS ARISING 
FROM THE STUDY OF CHANGE 


These general research problems 
plague and challenge research workers 
in the interpretation of the behavior 
of a child of any particular age. There 
are other problems that are inherent in 
the study of changes, especially long- 
term developmental changes. 

The most serious technical problem 
is the real inability to control the en- 
vironment of a child over any rea- 
sonably long period. In longitudinal 
studies we must be content with very 
inadequate controls over the events in 
the child’s life. 

Longitudinal studies, as they were 
originally conceived, seemed to have 
gambled on the existence of clear de- 
velopmental trends that would shine 
through the welter of influences of un- 
controlled events. In physical growth 
the gamble paid off quite well. Many 
measures of physical growth are rel- 
atively uninfluenced by the disturb- 
ances found in the normal life of the 
child. Even though illnesses slow down 
the growth of long bones and delay the 
appearance of ossification centers, the 
disturbance is relatively minor. The 
general shape of the growth curve is 
quite apparent, and the extrinsic dis- 
turbances do not affect the individual’s 
growth pattern enough to destroy the 
predictability of early or late matura- 
tion, adolescent growth rate, adult 
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height, etc. The organism’s own growth 
pattern can be identified despite lack 
of control over many environmental 
factors. 

On the face of it, major psychological 
changes, which are part of growth, ap- 
pear equally prominent and obvious. 
The increase in ability, self-control, and 
perceptual sensitivity seems almost as 
obvious and as universal as the in- 
crease in height and weight. But diligent 
and competent attempts to identify 
meaningful individual patterns in psy- 
chological development analogous to 
the various physical growth patterns 
have not yet been very successful. 

When the interest of child-develop- 
ment research shifted from the study of 
the maturational process to the study of 
the effect of early childhood experi- 
ences, the famous longitudinal studies 
were caught in mid-stream. They were 
committed to a research design that 
involved the careful periodic description 
of a relatively unselected group of chil- 
dren from birth onward. The design 
was a reasonable one for the purposes 
that guided its inception, but for the 
investigation of the effects of early 
childhood experiences it was not so well 
selected. Yet in a way the longitudinal 
method as a whole has been evaluated 
on the basis of these studies, which were 
originally designed for another purpose. 

What are the problems of a longi- 
tudinal investigation of the effects of 
early childhood experiences? The first 
is the determination of the early child- 
hood experiences and effects to study. 
This is no light task because the intro- 
duction of any variable into a longitudi- 
nal program that will last for several 
years is expensive in time, money, and 


personnel. The same care and pretesting 
should go into the selection of the 
proper variables that go into the selec- 
tion of measuring instruments for an 
artificial satellite. 

The problem is especially serious on 
the side of the independent variable be- 
cause this determines the selection of 
cases. To expect that enough of the 
right kind of cases will turn up in an 
unselected longitudinal sample is fool- 
hardy indeed. Yet to set up a new 
longitudinal sample for every new ex- 
perimental and control variable is out 
of the question. So a major problem is to 
pick out likely variables whose effects 
will be so strong that they will show up 
even when the error term contains all 
of the variations introduced by uncon- 
trolled events in the child’s life. 

A second problem is the selection of 
control variables. These controls are 
maintained in the usual research design 
because they probably influence the 
dependent variable. If they can be kept 
constant, the residual error variation is 
reduced. In a longitudinal study the 
control may also be used to select cases 
whose lives are less likely to be sub- 
jected to violent upheavals because 
those upheavals increase the error vari- 
ance. Therefore, geographical stability 
is highly desirable—for practical as well 
as theoretical reasons. Perhaps the 
limitation of the study to cooperative 
middle-class families is to be desired 
rather than deplored because of the 
relative placidity and stability of mid- 
dle-class family life. To be sure, gen- 
erality is sacrificed, but there are strong 
arguments for maximizing the chances 
of finding something before worrying 
about the generality of what is found. 


THE STUDY OF CHILD BEHAVIOR AND DEVELOPMENT 27 


A third problem is the selection of the 
dependent variables and measuring in- 
struments. There is an almost ir- 
resistible urge in a longitudinal study 
to cover all aspects of personality, since 
the cost of just one more test or one 
more observation seems so small by 
comparison with what is already in- 
vested in the program. In a sense, the 
argument is justified; it probably is 
better to err on the side of too much 
rather than too little coverage. But the 
additional dependent variable is some- 
times just a shot in the dark and has no 
theoretical reason for being influenced 
by the variables under investigation. 
In that case it would be well to consider 
its cost, not in relation to the tre- 
mendous investment of the total pro- 
gram, but in terms of what the same 
time and money could accomplish in a 
completely separate small study to 
determine what the variable is related 
to. 

It should be apparent that a longi- 
tudinal study of the effects of one, two, 
or three variables of childhood experi- 
ence upon later personality is a big in- 
vestment for relatively small return— 
quantitatively speaking. It behooves us, 
therefore, to precede such an undertak- 
ing with careful pretesting, study of 
cross-sectional differences, and cruder 
retrospective studies to establish the 
likelihood of major effects, A longi- 
tudinal study is the Jast, not the first, 
step in a research program. It is an ab- 
solutely essential research method if we 
are to get firm knowledge of psycho- 
logical change, but paradoxically it is 
to be avoided whenever possible. 

There is another type of longitudinal 
study, the analysis of the individual 


cases based upon records as complete 
as possible. This type of research is 
often defended as an exploratory study 
to suggest hypotheses. Although it has 
been useful at times for that purpose, 
it seems a very laborious and inefficient 
procedure for generating hypotheses. 

In the author's opinion the elaborate 
longitudinal case study belongs at ex- 
actly the opposite end of the con- 
tinuum. Human behavior is so complex 
that theories must eventually outgrow 
the restrictions of the three- or four- 
variable research design. Once behavior 
theories become genuinely multivariate, 
it is impossible to find any sample of 
cases that is alike on enough variables 
to belong in the same cell. Then we 
must move to the case study. We shall 
appraise individual cases on the ten or 
twenty variables contained in the theory 
and let the theory make testable indi- 
vidual predictions about each case. The 
predictions will be different for every 
case, but the accuracy of the predic- 
tions on a sequence of cases will permit 
the theory to be tested just as rigorously 
as the more orthodox design. 

Whether there are theories ready for 
this type of test is an open question. 
John Benjamin (1949) who has written 
an excellent analysis of this problem, 
thinks that psychoanalytic theory can 
make some predictions on this basis. He 
believes, for example, that the theory 
can predict that certain children cannot 
under any circumstances develop a par- 
ticular mental disease. If they become 
mentally ill, the illness will take some 
other form. This type of prediction, if 
it can be successfully upheld, neatly 
circumvents all the problems of uncon- 
trolled environmental events that occur 
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between the prediction and its test. 

In this section we have discussed the 
field of child-development research, 
have tried to identify the problems it 
must solve, and have pointed to some 
of the difficulties that must be over- 
come. The problems of investigating 
personality changes over long periods 
have come in for extensive discussion 
because they are so particularly central 
for the field of child development. Later 
chapters in this volume treat of many 
of these problems in much more detail. 


Ethical Problems of Research 


We saw how some of the research 
problems relating to children stem from 
their being half human, half animal. In 
an interesting way, this same marginal 
position is at the root of the special 
ethical problems of research on chil- 
dren. They are both precious and vul- 
nerable, Unlike animals, they cannot be 
sacrificed for research. They must be 
treated with all the safeguards that our 
civilization demands for the study of 
human beings. 

At the same time, they are vulnerable. 
There is a widespread assumption that 
children are less resistant and resilient 
than adults; that is, that stress, frustra~ 
tion, emotional experiences, and pres- 
sures that would cause only temporary 
discomfort in adults may produce long- 
range consequences in children, Ac- 
tually, the evidence for this belief is 
almost nonexistent, but neither is there 
clear contradictory evidence. Since no~ 
body wants to take any chances, be- 
havioral research among children is re~ 
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stricted by narrower limits than com 
parable research among adults. 

Not only are children perhaps more 
vulnerable, but they are also not free 
agents, able to make their own decision: 
and to take the consequences of thei 
They are below the age of consent, 
to speak, and the investigator who un 
dertakes the research must obtain con- 
sent from the people responsible for tl 
child and must act in loco parentis 
while the child is in his care. 

It is necessary on some occasions th 
the adult subject be unaware of tl 
purpose or the procedure of an exper! 
ment. This circumstance is sometime 
used as an excuse to put adult subjects 
into experiments that they would never 
enter voluntarily, even when the know! 
edge would not affect their behavior in 
the experiment as much as it would lead 
them to refuse to participate. The gen- 
eral practice of deceiving the subjects 
of psychological experiments is roundly 
condemned by a large section of the 
psychological profession, perhaps the 
majority. They argue that in most cases 
the experiment can be done equally well 
without deception, that if necessary the 
subject can be told frankly that it is 
necessary for him to be ignorant of the 
purpose of the experiment and that the 
few experiments in which deception is 
absolutely required are better left un- 
done. These psychologists point to the 
growing public image of the psycholo- 
gist as underhanded. The widespread 
use of deception in experiments and in 
disguised forms of testing, such as pro- 
jective methods, does justify this mis- 
trust. Such procedures are used to make 
the subjects reveal information with- 
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out the subject’s knowledge or consent. 
These suspicions of the intent of psy- 
chologists can have serious conse- 
quences, indeed. One of the ironic fea- 
tures of the situation is that in most 
cases the deception is unnecessary; con- 
sent could be readily obtained by a 
frank discussion of the psychologist’s 
interests, 

When we study children instead of 
adults, the issue is put more squarely. 
Clearly there is no reason to deceive the 
child’s parent or teacher about the pur- 
pose of the experiment, except the fear 
that that person would be unwilling to 
allow the child to participate or that 
he would by coaching or instruction try 
to ensure that the child’s behavior 
would not reflect unfavorably on him 
or his parents. The ways in which the 
parent or teacher might interfere with 
the experiment may seem unfortunate 
to the psychologist, but in a real sense 
they are the prerogatives of the person 
in whose care the child is placed. If he 
is not convinced that the psychologist 
will do the child no harm or fears the 
implications the experimenter might 
draw from the child’s behavior, the psy- 
chologist has either done a poor job of 
explaining what he is going to do or has 
persuaded the parent against his will to 
let the child participate. Although the 
experimenter may be convinced that the 
parent’s anxieties are groundless and 
that his fears are defensive, it is still the 
prerogative of the parent to have them 
and to behave accordingly. The psy- 
chologist who takes it upon himself to 
decide when another free agent’s fears 
are groundless without having been 
delegated that power is treading upon 


dangerous ground, All the arguments 
against deceiving adult subjects hold 
even more strongly when we consider 
the problem of deceiving not the child 
subject but his caretaker. 

The experimenter is well advised, 
therefore, to explain quite frankly to 
the person responsible for the child 
what he is interested in studying and 
how he intends to proceed and to an- 
swer openly any questions about the 
effects of the experiment on the child, 
If he feels sure that he cannot explain 
everything to the parent, then he should 
say that he cannot and ask the parent 
to trust him. In the vast majority of 
cases he can obtain consent; the experi- 
menter’s fear of being refused permis- 
sion are almost as groundless as the 
groundless anxieties he attributes to the 
subject. There will be refusals, but we 
have to accept them, Their frequency 
will be increased only if we adopt the 
short-sighted strategy of obtaining con- 
sent by deception. 

Once the caretaker’s consent has been 
obtained, then the experimenter faces a 
similar set of problems as far as the 
child is concerned. Here there may be 
real necessity to conceal some of the 
purposes of the experiment from the 
child, and in the case of younger chil- 
dren it may be quite impossible to make 
the child understand the purpose of the 
experimenter. The quite sensible gen- 
eral practice is to treat the child more 
and more like an adult subject the older 
he is. With preschool children, the re- 
quest to “play some games with the 
experimenter” or the explanation that 
“we want to see how children like you 
play with these games” is probably 
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about as good as can be done. 

After the experiment is over, par- 
ticularly if some deception of the child 
is necessary, the research worker can 
well take the time to give an explana- 
tion that is geared to the level of the 
child. School-age children can under- 
stand the purposes of research and the 
need for experimental and control 
groups. Such an explanation has many 
good effects. It makes the child feel 
that he is a participant in scientific re- 
search—science is a highly esteemed 
activity to which most people are de- 
lighted to contribute. It makes the chil- 
dren feel that the experimenter is an 
honest person whose intentions in his 
experiment were not malevolent. It al- 
lays some anxiety and provides an op- 
portunity for the experimenter and the 
caretaker to recognize individual chil- 
dren who should be given some special 
explanation and reassurance. Finally, it 
is an excellent concrete teaching de- 
vice for the discussion of the scientific 
method and allows the experimenter to 
communicate some of the basic scien- 
tific values that we feel are important. 


Procurement of Subjects 


Ethical problems merge impercepti- 
bly into the practical problems of pro- 
curing and working with child subjects. 
Many of the arguments for frankness 
and honesty are as strongly supported 
by the need for maintaining continuing 
access to schools, homes, and institu- 
tions as by strictly ethical arguments. 
There are, however, some practical is- 
sues in the procurement of subjects in 
which ethical issues are not central. 


THE APPROACH 
TO INDIVIDUAL HOMES 


The ultimate source of child subjects 
is, of course, the home. Although it is 
often more convenient to try to obtain 
subjects at school, it is often necessary 
to approach the home directly. Natu- 
rally, the contact to be made is with 
one or both parents of the child. Par- 
ents are responsible for their children, 
and legally they are the only ones who 
can agree to the use of -the child as a 
subject. With young children, it is 
natural to obtain consent from the par- 
ent, but with school-age children and 
adolescents it is often practical to ask 
for volunteers. This means that the child 
is acting independently—something he 
has no right to do. In every case it is 
important to approach the people in 
charge. Within the school, of course, 
volunteer subjects are in the care of 
the school authorities, who have the 
right to consent to their being subjects 
during school time. If, however, the 
child is to be tested outside the school, 
when the school is not responsible for 
him, then the parent is the responsible 
agent, and the parent's cooperation 
should be sought before the child is 
actually tested, 

Frankness in approaching the parent 
is not only the most honest but also 
the soundest policy. Investigators who 
work with children a great deal are 
almost universal in expressing amaze- 
ment at the cooperativeness of parents, 
particularly at the middle and upper 
socioeconomic levels. Most people feel 
pleased to participate in scientific re- 
search and will often go to great lengths 
to be helpful. 
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There are various things the investi- 
gator may be able to do to make the 
parent continue to feel cooperative 
after the experiment is over. Conserva- 
tion practices are just as important in 
the maintenance of research subjects as 
in the treatment of other natural re- 
sources. One good rule is not to push 
the subject to the limit of his coopera- 
tiveness. By providing, when needed, 
such things as transportation, baby sit- 
ters, and comfortable waiting space and 
by a generally friendly sociability, the 
experimenter can leave the parent with 
the feeling that psychologists are hu- 
man beings and not bad ones at that. 
The offer to report back on the results 
of the research, either to the parents 
individually or in a group or by way 
of local mothers’ clubs or PTA meet- 
ings, is highly desirable. If the experi- 
ment has been described frankly, the 
parents will usually be interested in the 
findings. Naturally, they are most in- 
terested in what the experiment can tell 
them about their own child, but in 
many experiments individual scores are 
highly unreliable. It is important in 
these cases to make some point of the 
fact to avoid the overinterpretation of 
individual scores. Often the discussion 
of individual scores can be avoided. 

We have, however, passed over the 
most difficult problem—how to gain 
access to the home to begin with. There 
are relatively easy ways to approach 
parents through groups like the PTA, 
but if the sampling problem is an im- 
portant one we must recognize that 
these groups are highly selected. One 
general procedure, commonly used, is 
to begin by writing a letter to each 
parent, introducing the project and 


naming by name the individual who 
will later telephone for an appointment. 
This letter on official stationery estab- 
lishes the bona fide character of the 
research; it offers the parents the op- 
portunity to call the writer of the let- 
ter for reassurance that the research 
project has the endorsement of the in- 
stitution under whose auspices the ex- 
periment is being conducted. In uni- 
versity towns, particularly, there are 
some unauthorized research activities 
carried on independently by students. 
These unsupervised studies are some- 
times not so carefully thought through 
as they should be, and they may involve 
such ticklish public relations issues as 
sex practices or racial attitudes. Most 
such studies can be done, but there is 
real value in thinking them through 
carefully and preparing ahead of time 
for possible difficulties. The general 
plan of beginning with a letter from 
the institution endorsing the project and 
introducing the investigator is one way 
to avoid unnecessary trouble. The ex- 
perimenter can then make the telephone 
call within a few days; he can answer 
further questions and, depending upon 
the circumstances, follow up with a 
personal visit to explain the project 
more fully. 

Another procedure may begin with 
an actual visit to the home. If the sam- 
pling is geographical, the investigator 
may not know the names of the people 
he wishes to study and cannot write 
them a letter. He then introduces the 
project at the door, offering the alter- 
native of doing the interview or experi- 
ment then and there or making an ap- 
pointment to come back at some other 
time. The details, of course, depend 
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upon who must be seen, where the ex- 
periment must be carried out, etc. This 
procedure probably makes it harder for 
the prospective subjects to say “no,” but 
it also makes research compete with 
door-to-door salesmen and may be 
viewed by the subjects as a high-pres- 
sure technique. When the investigation 
involves an interview or some easily 
portable experiment, so that the experi- 
menter can reasonably offer the oppor- 
tunity to carry it out immediately, this 
direct procedure may well save time 
and energy. In fact, it saves so much 
time and has been so successful that 
door-to-door salesmen sometimes gain 
access to the home by posing as inter- 
viewers for a survey. 


WORKING WITH SCHOOLS 
AND INSTITUTIONS 


The procurement of subjects in 
large groups in a school or other in- 
stitution is a different problem entirely. 
There are several very important things 
to keep in mind in connection with this 
problem. The first one is that there are 
fewer schools than homes, so that a bad 
experience in a school system can shut 
off a large group of subjects for a long 
time. In view of the amount of child- 
development and educational research 
that must be done in the next twenty 
years, conservation practices are most 
important indeed. 

A second principle stems from the 
fact that the basic responsibility for the 
children in an institution lies with the 
senior administrator, the school super- 
intendent, the principal, or the director. 
The place to begin is at the top because 
that is where the responsibility lies. In 


a city school system the superintendent 
is nearly always the first person to in- 
terview; in rural districts the principal 
may be autonomous and be able to 
make the decision himself. The princi- 
ple of starting at the top applies on 
down the line. Within a school, see the 
principal first, hopeful that the super- 
intendent has already passed down his 
endorsement. In most schools every 
visitor should check in at the office on 
every visit. The research worker may 
be a good friend of a teacher who is 
more than glad to cooperate in a small 
study, but the teacher does not have 
the right to consent to a research study 
or even to give an outsider access to her 
records. 

The injunction to start at the top 
does not mean that the project should 
be kept a secret from everybody who 
would be most actively involved. Pre- 
liminary discussions to design the study 
so that it will be feasible to do it in a 
school situation might involve a teacher, 
on her own time, the school psychol- 
ogist, or somebody else. When, how- 
ever, the investigator actually wants to 
obtain consent to do the study, the top 
man is the one to see. 

A third principle of working with 
school systems is to recognize that the 
schools have other jobs to do besides 
releasing children for research projects. 
Very few, if any, school administrators 
are against research activities. Their 
questions about a research project are 
likely to be the following: (1) Will it 
hurt the children? (2) Will it interfere 
too much with school activities? (3) 
Will it make parents or school-board 
members protest? 

The experimenter should prepare his 
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request with these three questions in 
mind, To answer the first, he should 
describe exactly what he proposes to do 
with each child subject. In case there 
are debatable points, he may wish to 
justify an unpleasant treatment of the 
child as temporary, as no worse than 
the child experiences every day in 
school or on the playground, and as an 
absolutely necessary feature of the ex- 
periment that could not be avoided 
without losing the whole value of the 
project. He may also describe how he 
proposes to justify the treatment to the 
children afterward. Beyond that, he 
must let the superintendent or principal 
decide. 

In anticipation of the second ques- 
tion, the experimenter will do well 
to know exactly how many children he 
needs, in what age groups, how many 
sessions with each, how long each ses- 
sion will be, and how many children 
will be needed in the experiment at one 
time. These are the facts the adminis- 
trator needs. In designing the study, it 
may be possible to make it more ac- 
ceptable. In general, schools would 
rather turn over a whole class to an ex- 
perimenter for an hour than to have 
the experimenter remove every child 
individually for fifteen minutes. This is 
partly a function of what the rest of 
the class will be doing while each child 
is out and is not a hard and fast rule. 

The problem of interference of re- 
search with other school activities is 
especially serious in the vicinity of a 
university because so many of the re- 
search plans of students involve the co- 
operation of the school system. Any 
school system must draw a line some- 
where on the number of different re- 


search programs it can accommodate. 
There are real advantages, therefore, in 
trying to find a school system that is 
not used by many other research work- 
ers. Although superintendents, princi- 
pals, and school boards in such towns 
may be less accustomed to the idea of 
cooperating with research programs and 
may require more explanation, they fre- 
quently welcome the opportunity to 
cooperate. 

The last problem of the school ad- 
ministrator, protests from the public, is 
much more difficult to anticipate. There 
is a large amount of unpredictability in 
public reaction. Probably there are rela- 
tively few research projects, if they 
were explained fully to the public, that 
would not be generally acceptable, but 
very few of them would be completely 
immune to criticism. Somebody would 
almost surely think they were a waste 
of time at best and a positive evil at 
worst. These are the people who will 
refuse to cooperate as individuals in a 
research program, but they can create 
real difficulty in a school system if they 
actively protest. It takes only a few 
protests to discourage school adminis- 
trators from cooperating. 

There seem to be a few subjects that 
are particularly dangerous, One, of 
course, is sex. Questionnaires on sexual 
attitudes and sexual activities are more 
likely than most to antagonize some 
people in the community. Discretion is 
probably the better part of wisdom, and 
the experimenter should not try to ob- 
tain such information through the pub- 
lic schools. 

A second ticklish issue is the invasion 
of the privacy of the home through 
questionnaires administered to children. 
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Parents who would not object to an- 
swering questions about their own 
child-rearing practices may well object 
to having these practices reported by 
the child. It appears somewhat like the 
child’s being asked to spy on his parents. 

Any local issue on which there are 
strong feelings may be a very ticklish 
one for the research worker, particu- 
larly if his manner of investigation leads 
to the perception that his sympathies 
are on one side rather than another. 

The sensitivity of public administra- 
tors to public relations may appear to 
the investigator as a sort of cowardice. 
Perhaps it is in a way, but the research 
worker must remember that his project 
is much more important to him than to 
anybody else. There is little benefit to 
the school system from most individual 
research projects except the sense of 
virtue in having cooperated and the 
cumulative benefits of the entire field 
of research activities. These are impor- 
tant, and most administrators recognize 
it, but there is not a lot to make them 
feel a risky individual study is worth- 
while. Finally, most administrators re- 
alize that social-science research is still 
on precarious ground as fai as public 
acceptance is concerned. Probably, if 
the question of allowing 2 or 3 per cent 
of the child’s time in school to be spent 
in serving as a research subject were 
brought to a vote, it would be rejected 
by the public as less important than the 
same time spent in academic work. 
From the point of view of practical 
politics, therefore, it is wise to be very 
cautious about public reaction. 

There are no hard and fast rules re- 
lating to any aspect of human relations. 
These suggestions may be helpful, but 


IN CHILD DEVELOPMENT 


the most important principle is to ex- 
ercise the same tact and consideration 
with research subjects that is appropri- 
ate in dealing with colleagues, class- 
mates, or professors. 


Summary 


This chapter has the double purpose 
of serving as a preface to a volume and 
as a general introduction to research in 
child development. 

This book has been compiled in the 
belief that child-development research 
has much to gain from and to contrib- 
ute to other branches of behavioral sci 
ence. It has been written in the hope of 
stimulating and improving the quality 
of research with children. To this end, 
it is intended to be a handbook of re- 
search methods for special use in the 
study of children and to contain some 
of the practical wisdom of experienced 
investigators. 

As an introduction, this chapter has 
been concerned with the most general 
issues in research in child development, 
running the gamut from the philosophy 
of science to the procurement of sub- 
jects. The topics that have been dis- 
cussed are (1) the logic of science and 
scientific strategy, (2) specifications for 
theories of child behavior and develop- 
ment, (3) the special problems met in 
research in these fields, (4) ethical prob- 
lems of working with children and (5) 
problems of procuring subjects. 
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chapter 2 


Research Design 
in the Study 

of Developmental 
Problems 


William Kessen 
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The study of child development derives 
from the primitive observation that hu- 
man beings change as they grow older. 
Whether the psychologist turns his at- 
tention to physical growth, or to lan- 
guage, or to motility, or to personality, 
he can readily confirm the common- 
sense principle that assigns to age an 
important place in the description of 
human behavior. From the days of the 
early baby biographers, students of 
child development have attempted to 
make systematic and reliable statements 
about this fundamental relationship. 
Classically, these statements have con- 
tained two kinds of terms or variables: 
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on one side, a description of behavior 
(e.g, test performance, s skill, use of 
words) or of state (e.g., height, endo- 
crine level, organ fizel; on the oth 
side, some index of chronological 
or developmental status (¢.g., degree of 
ossification, mental age, sexual n 
turity). Much of the work of develop- 
mental psychologists can be cast in t 
summary form R = f(A), if it is rec- 
ognized that R (response) may be as 
various as color naming and length of 
tibia and that Æ (age) represents not 
only time since birth but includes as 
well such variants as physiological age 
and mental age. With these reservations 
entered, the following formulation may 
be proposed: a characteristic is said to 
be developmental if it can be related to 
age in an orderly or lawful way. This 
chapter has as its central theme a dis- 
cussion of research designs and pro- 
cedures which permit the effective 
investigations of developmental phe- 
nomena. 


The Analysis of Developmental 
Change 


It should be noted: at the outset that 
although the foregoing specification of 
“developmental” may represent fairly 
the common usage of the term the es- 
tablishment of a lawful relationship be- 
tween behavior and age is by no means 
the only, or even the most important, 
task of the child psychologist. The for- 
mulation is inadequate as a general state- 
ment of the procedures of child study 
in at least three ways. In the first place, 
there are important aspects of child be- 
havior that show an orderly relationship 
to age, yet they demand an analysis in 
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terms of other variables. For example, 
the size of a child’s vocabulary can be 
set in a relatively simple relationship 
with chronological age, but most psy- 
chologists would insist that this devel- 
opment is dependent at least in part on 
the child’s history, or more generally, 
on variables other than time since birth. 

A second and closely related reserva- 
tion which must be stated in consider- 
ing the R=f(A) formula has to do 
with failure of control. One of the chief 
goals of systematic study is the segre- 
gation of determinants of events, that 
is, of those antecedent conditions or 
events which can be manipulated to 
produce a change in the phenomena 
under study. No matter how neat the 
function relating behavior to age may 
be, there is almost always some dis- 
satisfaction for the researcher unless he 
can go beyond the functional statement 
to say something about causes of or 
explanations for the effect. Even with- 
out an extended treatment of scientific 
explanation, the issue raised here can be 
understood by reflecting on the inade- 
quacy of a statement such as the fol- 
lowing: 

“Four-year-old children weigh 45 


pounds because they are four years 
old.” 


Child psychologists, whatever their the- 
oretical color, have with scarcely an 
exception postulated the action of some 
“more fundamental” or “less superficial” 
variable than age to account for devel- 
opmental change. In general, the ex- 
planatory notions advanced have em- 
phasized either physiological and mor- 
phological control (e.g, maturation, 
neural development) or environmental 


control (e.g., learning, trauma), but 
advocates of both positions seem equally 
uneasy about the use of age as an ex- 
planation of behavioral change. This is 
not mere prejudice; one of the goals of 
scientific explanation is to provide di- 
rections for the control of phenomena. 
The simple age-functions do not tell us 
how to change behavior and are there- 
fore frequently given a restatement in 
terms of manipulable antecedents. 

A third limitation of R = f(A) state- 
ments is their failure to take into ac- 
count individual variation. Because age 
is a parameter for the entire population 
under study, the age-functional gener- 
alizations do not provide an expression 
of the fact that children not only 
change with age but also show differ- 
ences one from another at the same age. 
Child psychologists seem, from the evi- 
dence of their research reports, to be 
increasingly concerned with the general 
question of why one group of children 
is different from another group of the 
same age rather than with the simple 
statement of age-to-age variation. For 
this purpose of describing variation, the 
age-functional statements are quite 
limited in usefulness. 

Although these reservations must 
be taken into account in a treatment 
of age-functional generalizations, the 
R = f(A) formula will continue to ex- 
press a good part of what is known about 
children. In a large number of cases, the 
general statement must suffice because 
we are ignorant of “true” or controlling 
antecedents. In plotting problem-solv- 
ing ability against chronological age, the 
researcher may be convinced that he is 
studying some complex resultant of 
neuronal maturation, history of rein- 


38 GENERAL RESEARCH METHODOLOGY IN CHILD DEVELOPMENT 


forcement, and so on, but until he has 
more precise information about how 
such variables affect child behavior he 
must be content with the less detailed 
expression of variation with age. 

There are other aspects of child be- 
havior in which the psychologist may 
be strongly persuaded that an environ- 
mental variation is relevant to a change 
in behavior but in which appropriate 
tests are impossible on ethical grounds. 
Thus the occurrence of smiling in the 
infant, although apparently universal, 
may be linked to the regularity of so- 
cial contact in the first weeks. Clearly, 
the soundest test of this possibility is 
the complete isolation of a group of 
babies and the comparison of their 
smiling behavior with that of normally 
reared infants. This procedure is mani- 
festly out of the question, so that the 
researcher is once more thrown back on 
an age-functional expression for the oc- 
currence and development of smiling. 
There are many developmental phe- 
nomena of this order that will remain 
resistant to further analysis because of 
the dangers of irreversible damage in- 
volved in manipulative study. 

Finally, the age-functional type of 
statement may remain part of the cor- 
pus of child psychology simply because, 
as a matter of empirical fact, some 
changes in behavior can be described 
most fruitfully as functions of age. 
Much of the research on physical 
growth, for example, and the bulk of 
our knowledge about developmental 
changes in intelligence-test performance 
have their basic statement in age-func- 
tional relationships. It is never possible 
to be certain that no change will take 
place in our treatment of these phe- 


nomena, but room should be left for 
the possibility that R = f(A) statements 
will remain as unreduced principles in 
a future child psychology. 

These arguments suggest that the 
study of development is a two-part cn- 
terprise—the search for regular rela- 
tionships between behavior and age and 
the analysis of such relationships to pro- 
vide precise and effective ways of pre- 
dicting child behavior. These goals of 
the developmental psychologist are by 
no means independent, the regularities 
we observe as children grow will in- 
fluence the kind of systematic statement 
we make, and the preconceptions or 
theories that we bring to our research 
will influence the selection of our ob- 
servations. Theories of child develop- 
ment are, in a sense, systems of pro- 
posals about the processes that lie be- 
hind changes in behavior with age, but 
we are so far from an acceptable gen- 
eral theory of development that we 
must continue for some time the slow 
accumulation of evidence that is ex- 
pressed in age-functional generaliza- 
tions. 

In any case, whether the psycholo- 
gists primary interest is in general 
developmental formulas or in a more 
complex analysis of the changes in chil- 
dren’s behavior, the research design 
chosen will depend in large measure on 
the kinds of statements he wants to 
make. An investigator concerned with 
a description of the development of 
form-perception will choose a design 
different from that used to study va- 
riation of behavior as a function of 
early feeding patterns; a researcher in- 
terested in the details of learning will 
set up his study in a way quite different 
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from that of the researcher interested in 
personality variation; and so on. There 
is no “standard” design for the study 
of child development; rather it is nec- 
essary to know the goal of the research 
before a sensible decision can be made 
about the appropriate research design. 
In the pages to follow a number of 
prototypical statements about the be- 
havior of children are put under ex- 
amination. The choice of research de- 
sign in the study of development is 
discussed in the light of what these 
statements demand. This is followed by 
a discussion of some of the ways in 
which preconception or theory deter- 
mines the course of developmental re- 
search, Finally, a treatment of practical 
problems in the study of development 
brings us back to current issues in the 
investigation of children’s behavior. 


Statements About Develop- 
mental Change 


RESPONSE VARIATION 
AS A SIMPLE FUNCTION 
OF AGE: R=f(A) 


The most general statement of devel- 
opmental change sets in relationship 
some behavior of the child and some 
index of his age or level of maturity. 
From the observations of Tiedemann 
down to the present, statements of this 
sort have made up a great part of our 
knowledge of child development. 

The classical work of Scammon 
(1930) on physical growth and of 
Gesell (1945) on changes in motor be- 
havior may stand as exemplary illustra- 
tions of the basic age-functional gen- 


eralization. Although research in sup- 
port of such general formulations is 
apparently becoming less popular, child 
psychologists continue to talk about 
“Twos” and “Tens,” expressing in this 
way the regularities observed in chil- 
dren’s behavior as they grow older. 

Choice of Research Design. In study- 
ing generalizations of the R= f(A) 
order, the chief task of the researcher 
is to reduce the variation in behavior 
contributed by factors other than age. 
This is obviously an impossible assign- 
ment if our goal is “pure age” gener- 
alizations; older children are always 
children of more extensive and complex 
history, bigger, more intelligent, better 
educated, and so on. To put the point 
in the language of method, there is no 
control group for studies of the basic 
developmental generalization, no group 
in which age and associated variables do 
not change. However, once this irre- 
medial difficulty is stipulated, there re- 
mains wide variation in the degree of 
possible contamination from variables 
other than age. Most researchers would 
be unhappy with a study of preference 
for play materials, say, in which the 
1-year-old group were girls and the 
2-year-old group, boys, or a study of 
adolescent disturbance in which differ- 
ent age groups were drawn from widely 
different socioeconomic levels, The 
problem can be stated in a general 
form: to what degree do groups differ- 
ing in age differ on other dimensions as 
well? 

There are two ways of reducing 
unwanted variation among groups— 
choice of sampling procedures and the 
method of data collection. 

Sampling. The issues raised in a- 
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treatment of sampling procedures are 
important to all psychological research, 
but, for reasons which will become clear 
shortly, the student of children seems 
peculiarly likely to stub his toe on this 
problem. The statisticians’ rule is de- 
ceptively simple; for justified generali- 
zation of findings to a specified popu- 
lation, each member of the population 
should have an equal chance of being 
included in the research sample. The 
implication of this mandate is that the 
researcher, having decided to study a 
certain population (e.g., American chil- 
dren), should arrange his enlistment 
procedures to sample at random from 
this group. For better or worse, this 
instruction cannot be obeyed in present- 
day studies of development; the cost in 
time and money of random sampling is 
prohibitive. Nor can the child psy- 
chologist usually take the way out 
available to the pollster, that is, by the 
use of a stratified sample from the 
population under study. Aside from 
practical problems almost as great as 
those of random sampling, the kind of 
information that would indicate the 
variables to be used for appropriate 
stratification is frequently not available. 
The result of these constraints is the 
almost universal use of “unsampled” 
subjects in developmental research; the 
researcher uses the children who are 
available when he needs them, often 
without convincing evidence of their 
representativeness in relation to some 
larger population. This necessary com- 
promise with ideal standards sets two 
demands on child development re- 
search: first, that replication with more 
than one sample take place in order to 
provide empirical support for the gen- 


eralization of findings from unsampled 
groups and, second, that we seek out 
information that will tell us what vari- 
ables can be disregarded in the selection 
of subjects and what variables must be 
controlled by more sophisticated sam- 
pling techniques. It may be a long time 
before the child psychologist can have, 
in his selection of subjects, the confi- 
dence that the chemist has in sampling 
distilled water in New York and dis- 
tilled water in Ann Arbor, but the re- 
sults of developmental study will carry 
greater conviction as we are better able 
to state dimensions of variation in the 
populations studied. It should be noted 
that the animal psychologist also nor- 
mally uses “unsampled” groups in much 
of his research; the advantages he can 
claim over the student of children are 
better knowledge and control of indi- 
vidual history and perhaps better infor 
mation of what variables require special 
treatment. 

In short, the child psychologist can- 
not ordinarily reduce complicating 
variation between groups by means of 
sophisticated sampling procedures; he 
must depend on replication and on his 
knowledge of the variables that tend to 
vary with age in order to gain confi- 
dence in his age-functional generaliza- 
tions. 

Methods of Data Collection. A sec- 
ond approach to the problem of reduc- 
ing variation in behavior from sources 
other than age lies in the choice of a 
technique for data collection. Stripped 
of subtleties that will require attention 
later, a distinction can be made between 
two methods of gathering observations 
on the development of children. In one 
of these the researcher makes repeated 
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observations on the same group of sub- 
jects—the longitudinal technique; in the 
other he makes a limited number of ob- 
ervations on different groups of sub- 
jects—the cross-sectional technique. In 
many problems in the study of develop- 
ment either of the two methods may be 
used. For example, the growth of vo- 
cabulary may be observed by following 
20 children from preverbal age to ado- 
lescence, with annual examination of 
their language skills, or, alternatively, 
groups of 20 children of different ages 
may be studied at more or less the same 
time, with no overlap in the member- 
ship of groups.? If we take this example 
as prototypical, several points of com- 
parison are apparent at once. The re- 
peated-measurements or longitudinal 
program will extend over a longer pe- 
riod of time than the cross-sectional (in 
the hypothetical case of vocabulary de- 
velopment, a dozen years as compared 
with as many weeks), even though the 
amount of time spent gathering data is 
precisely the same in both cases. It is 
largely for this reason that the longi- 
tudinal technique often appears to be 
inefficient or cumbersome, and there is 
no doubt that the literature contains 
relatively few reports of repeated meas- 
urements over long periods of time.* 


‘Bell (1953, 1954) has suggested a com- 
promise between cross-sectional and longi- 
tudinal study based on the assumption of 
equivalence in the behavior of different chil- 
dren. This promising proposal may serve to 
reduce some of the difficulties in develop- 
mental research that are discussed later. 

Stone and Onqué (1959) present a bib- 
liography of longitudinal studies, from Pesta- 
lozzi to 1954. More recent longitudinal re- 
search has been noted in several volumes of 
the Annual Review of Psychology. 
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Nonetheless, there are advantages to be 
claimed for the longitudinal method 
that warrant closer examination of its 
properties. 

The longitudinal study in child- 
development research is a variation of 
the “matched group” research design 
used in many studies of behavior, and 
it is this characteristic that supplies 
whatever formal advantage it has over 
cross-sectional studies of R=f(A) 
generalizations. Instead of comparing 
independently derived groups of scores 
and being limited thereby to statements 
of the differences between group means, 
the researcher is able to put his repeated 
measures in parallel, comparing each 
observation of a subject’s behavior with 
some earlier or later observation, An 
examination of the fictitious data in 
Table 2-1 will illustrate this difference 
between the methods and one of its 
implications. If there are small but re- 
liable changes in the behavior of sub- 
jects over age (Table 2-1 A), then lon- 
gitudinal study provides, because of the 
matching characteristic, more sensitive 
estimates of such changes, When the 
differences between groups are large 
(Table 2-1 B), the advantage of the 
longitudinal technique is less impressive. 
Making the point more generally, we 
may say that the longitudinal design 
is to be favored in the study of those 
age-functional relationships in which 
relatively small but individually stable 
changes over age are expected. 

Following the general prescription to 
reduce variation from factors other 
than age, we can segregate another class 
of age-functional generalizations that 
profit from longitudinal study, namely, 
those in which age differences are cor- 
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FICTITIOUS DATA WHICH ILLUSTRATE THE “MATCHED GROUP” 


CHARACTERISTIC OF LONGITUDINAL RESEARCH DESIGN 


TABLE 2-1. 
A 
Scores at Scores at 
Age a Agea+t 
4 5 
5 6 
6 7 
i 8 
8 9 
9 10 
10 11 
II 12 
12 13 
Mean score 8 9 


related with cultural shifts. For exam- 
ple, a cross-sectional study of night 
fears or response to sudden awakening 
among London children might well 
show striking changes at ages 15 to 18, 
not necessarily because of some charac- 
teristic of late adolescence, but because 
these children had been through the Bat- 
tle of Britain at susceptible ages. Stu- 
dents of child-rearing practices in pri- 
mitive groups (Whiting and Child, 
1953) may defend convincingly the 
presence of a stable cultural frame 
which permits cross-sectional assess- 
ment, but the speed and variety of 
change in the technology of modern 
communities make it increasingly diffi- 
cult for the child psychologist to as- 
sume with confidence that his 8-year- 
olds had much the same kind of early 
experience that his 3-year-olds had.* It 

1There are, of course, cases in which va- 
riation assignable to cultural shifts need not 
be assessed or reduced; the educational psy- 
chologist planning a curriculum for third and 
fourth graders is interested in how well they 
can read, not in whether differences between 


the two groups are “age-determined” or “cul- 
ture-determined.” 


B 

Scores at Scores at 
Age a Agea +t 

4 13 

5 14 

6 15 

7 16 

8 17 

9 18 

10 19 

11 20 

12 21 

Mean score 8 17 


should be clear, however, that the ap- 
lication of longitudinal methods will 
not by itself solve the problem of ana- 
lyzing variation ascribable to cultural 
change. In fact, if a repeated-measure- 
ments study extends over a long period 
of time, conclusions drawn from it may 
be of limited generalizability to later 
and changed cultural settings. How- 
ever, the longitudinal researcher is in a 
better position to assess the impact of 
shifts in milieu than is the investigator 
who studies at one time the behavior of 
age-differing groups of children. 

In summary, statements of the gen- 
eral form R= f(A) require for their 
appropriate investigation the selection 
of age-differing groups which vary as 
little as possible on dimensions other 
than age. Homogeneity of groups can 
usually be increased by the use of re- 
peated measurements on the same chil- 
dren, but the decision for longitudinal 
or cross-sectional methods must be 
made on the further criteria of research 
economy and the necessity of evaluat- 
ing cultural trends. Beyond this, the 
recognition by psychologists that much 
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of the child behavior that interests them 
varies over age in different ways for 
different subpopulations of children has 
resulted in a striking reduction in re- 
search emphasis on generalizations of 
the bald R = f(A) sort. 


RESPONSE VARIATION 
WITH AGE IN LIMITED 
POPULATIONS: R=f(A, P) 


The simple age-function relationships 
discussed in the preceding section are 
usually extended or supplemented by 
statements of developmental variation in 
subgroups defined by some special char- 
acteristic, The most usual criterion for 
separation into different groups is sex, 
and the literature is replete with de- 
scriptions of developmental phenomena 
plotted separately for boys and girls 
(Terman and Tyler, 1954). Analogous 
subgroups may be established on other 
criteria, €.g., hypothesized genetic vari- 
ation, intelligence differences, birth or- 
der, etc., to establish generalizations of 
the order R = f(A, P). Goodenough’s 
(1931) investigation of sex differences 
in the occurrence of anger outbursts 
and Shuttleworth’s (1939) description 
of mental age changes in early- and 
late-maturing boys may serve as typi- 
cal representatives of studies leading to 
R = f(A, P) statements. 

Choice of Research Design. The con- 
siderations relevant in the selection of 
design for the investigation of simple 
age-functional relationships are relevant 
to the study of development in sub- 
populations. The basic problem of ex- 
traneous variation is compounded, of 
course, because the researcher must aim 
at reducing not only the variation in 


age-differing groups but must also try 
to guard against additional sources of 
variation on the dimension that defines 
special groups. On occasion, this latter 
issue may be given negligible attention. 
For example, a study of physical growth 
in first- and second-born children will 
be relatively free of systematic varia- 
tion from sources other than age and 
birth order. It is extremely difficult, 
however, to extend this list of “safe” 
splits in population very far; even in 
the classical study of sex differences the 
differential history of boys and girls 
exerts an incalculable effect on the 
course of development. When one shifts 
to a study of racial differences or dif- 
ferences in social class, the number of 
confounding variables becomes enor- 
mous. In a sense, R = f(A, P) gener- 
alizations are shorthand for the more 
accurate expression R = f(A, Pit Pa)» 
in which variables correlated with the 
variable under consideration are recog- 
nized as relevant to the conclusions 
drawn from an investigation of sub- 
populations of children. It is convenient 
to express obtained differences in the 
behavior, say, of boys and girls as a 
simple function of age and sex; it is 
obviously far different to conclude that 
the groups do not differ on other di- 
mensions as well. 

Once the complexity of R = f(A, P) 
generalizations is recognized, the psy- 
chologist can proceed in his study, tak- 

1Once more, it should be noted that for 
some usually practical purposes contamination 
from correlated variables does not have to be 
reduced. An educator in New York may be 
concerned simply with how well 8-year-old 
Puerto Ricans understand English, not in the 
sources of their difference from native-born 
children. 
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ing the precautions or the risks dis- 
cussed earlier in regard to sampling and 
the choice between longitudinal and 
cross-sectional data-collection tech- 
niques. There are, however, two spe- 
cial and important instances of the 
R = f(A, P) sort that almost invariably 
demand longitudinal study—the case of 
the “lost sign” and the case of differ- 
ential rates of change. 

To illustrate the group of problems 
included in the category of “lost signs,” 
compare the study of sex differences 
with the study of early differences in 
general activity. When sex is the sub- 
population under study, the researcher 
has no difficulty in deciding the assign- 
ment of his subjects; at birth, at age 
6, or at age 16, boys are discriminably 
separable from girls. In the study of 
general activity, however, the re- 
searcher is unable to tell whether his 
16-year-old subjects belong in the “high 
early activity” group or the “low early 
activity” group. Segregation on dimen- 
sions of this variety involve a “lost sign” 
in that the investigator cannot deter- 
mine, at some later age, the character- 
istics of his population at an earlier age. 
In order to assess the differential devel- 
opment of children who vary in early 
general activity, it is necessary to set 
up the subpopulations at an early age 
and to make repeated measurements on 
the same subjects over the course of 
time. Quite a few characteristics of 
children that interest the student of 
child development fall into this cate- 
gory, e.g., the relation to later behavior 
of early intellectual differences, of au- 
toerotic activity in infancy, of degree 
of disturbance at puberty, or of re- 
sponse to the first months of school. 


The study of populations assigned on 
these dimensions ordinarily requires 
longitudinal study. Suppose, for exam- 
ple, that a researcher is interested in the 
relationship between intensity of 
stranger anxiety and dependence on the 
mother in the first days of nursery 
school. To get his groups for study, he 
must observe children during the sec- 
ond half of the first year, make his 
evaluations of degree of stranger anx- 
iety, then wait for the children to age. 
Only when they have entered nursery 
school and their dependency can be as- 
sessed will the research be completed. 

There is an alternative to the longi- 
tudinal study of behavior related to 
“lost signs” that depends on the report 
of behavior rather than on its observa- 
tion. In the example of stranger anx- 
iety the researcher may observe his 
subjects in nursery school to determine 
dependency score and interview the 
children’s mothers to establish the level 
of stranger anxiety observed in the past. 
In a study based completely on report 
the researcher may assess both depend- 
ency and anxiety by examination of ret- 
rospective maternal report. There is no 
doubt that the use of maternal recollec- 
tion introduces enormous savings of 
time and effort in the study of child 
development. One does not have to wait 
for children to grow, and one can get 
information rather quickly on many 
aspects of the child’s behavior, infor- 
mation that would be laboriously ob- 
tained if one drew it from observation 
of the children themselves. 

These remarkable gains from the 
analysis of report rather than the analy- 
sis of observations of behavior have 
brought the maternal interview to the 


‘front of substitutes for longitudinal 
dy, However, several questions must 
addressed to the researcher using this 
technique. The first concerns the ac- 
racy of maternal report about con- 

‘mporaneous events; how well is the 

other able to describe or categorize 
her child’s behavior at any moment? 
Will she, for example, make evaluations 
of dependency in nursery school that 
bear a systematic relationship to teach- 
er’s report or to the summary of the 
psychologist’s observations of the child? 
Fortunately, this is a testable question 
in many cases, so that the investigator 
concerned with the validity of maternal 
»ccounts of child behavior can compare 
what he sees with what the mother says 
about her child, It is less fortunate for 
our assurance on this issue that checks 
of validity are infrequently carried out, 
although Gordon (1957) and Smith 
(1958), among others, have provided 


some evidence on the accuracy of ma- 
ternal report, Usually, the mother’s re- 
port is accepted as valid without ex- 
ternal support, or else its validity is as- 
sumed from its stability, i.e., from the 
mother’s tendency to be consistent in 
her account. It should be noted, how- 
ever, that the child psychologist fre- 
quently has no choice in this regard; 
there are responses of the child that the 
mother alone (or some other adult 
member of the family) is competent to 
report on. Unless the observer can 
maintain a twenty-four hour vigil, he 
must rely on the mother to tell him 
about the child’s response to normal 
routines in feeding and sleep, about the 
child’s reaction to special events such 
as the visit of relatives or the introduc- 
tion of a pet, and about many aspects 
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of the child’s life that are considered 
relevant to his development. 

A satisfactory solution to the prob- 
lem of validity of contemporaneous re- 
port is difficult to achieve, but two safe- 
guards seem reasonable; first, to check 
maternal report against the report of 
other family members or, better, against 
observation of the child whenever this 
is possible and, second, to attempt an 
evaluation of the likelihood of error in 
maternal report. This precaution would 
probably demand attention both to in- 
dividual differences in tendency to dis- 
tort and to variations in distorting as a 
function of the particular behavior un- 
der study. It is to be expected that some 
mothers will distort more than others 
and that some responses of the child 
will be more liable to distorted report 
than others. Research related to the 
validity of maternal accounts is funda- 
mental to the study of children because 
even the most industrious investigator 
will not be able to make the variety and 
number of observations that a mother 
can. 

The validity of report about contem- 
porancous events is a problem of equal 
standing for both longitudinal and 
cross-sectional methods of data collec- 
tion, but the second general question 
about information obtained from par- 
ents—the validity of retrospective re- 
port—is peculiar to single-measurement, 
studies. Suppose, for example, that the 
researcher studying stranger anxiety 
and dependency in nursery school is sat- 
isfied that his dependency measure is 
sound, either because he has made care- 
ful observations himself or because he 
has evidence of the accuracy of ma- 
ternal reports of contemporaneous 
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events. What confidence can he place 
in their retrospective evaluation of their 
children’s stranger anxiety? Once again, 
the problem is a complex one, with 
several considerations relevant to the 
researcher's decision. There is, of 
course, the fact that retrospective re- 
port is based on events at a greater dis- 
tance in time than events reported con- 
temporaneously, and some decrease in 
accuracy as a function of time may be 
expected. But beyond this obvious 
source of error there are several further 
infusions of potential error in mothers’ 
retrospective accounts of child behavior. 
For one, child psychologists frequently 
depend on maternal recall for informa- 
tion about behavior in the child that is 
likely to be far from neutral in its im- 
pact on the mother. We may ask about 
thumb sucking, or responses to wean- 
ing, or reactions to the birth of a sibling 
—questions that are probably different 
in their effect from questions about 
physical growth, or age of walking, or 
responses to the family cat. If the 
mother had been in conflict about her 
own behavior at crucial points in the 
child’s life and if she has some “theory” 
about what the “right” answer should 
be, it is to be expected that her recall 
will be influenced by factors other than 
the behavior of her child. It is un- 
doubtedly risky for the child psycholo- 
gist to depend on the mother of a 6- 
year-old to tell him what the child’s 
behavior was when toilet training was 
begun at 9 months; we are ignorant of 
the relation between her statement and 
the statement that would have been 
made by an observer (even the mother) 
of the child at 9 months. 

Beyond the distortions to be expected 


in the report of so-called crucial events, 
there is a less obvious source of error 
in retrospective accounts—the error of 
the limited sample. Unless the reporting 
mother has had some professional con- 
tact with children, her knowledge 
about “normal” or “average” behavior 
is likely to be slim indeed. In response 
to questions about the child’s response 
to toilet training or about the intensity 
and duration of his early crying, the 
untrained mother has no normative con- 
text from which to draw her answer. 
The same duration and intensity of cry- 
ing may be for one mother “a lot” and 
for another “a little,” not because of 
profound personality variation but be- 
cause of ignorance about how much 
children cry. Of course, sophisticated 
researchers take pains to set up inter- 
views that reduce gross errors of this 
sort, but there will be some influence on 
maternal report derived from naïveté, 
at least so long as child psychologists 
depend on data that resist simple nu- 
merical statement. 

For these several reasons, the child 
psychologist interested in the sequelae 
of “lost signs” will do well to examine 
the assumptions he must make in using 
retrospective maternal accounts to place 
his subjects in subpopulations for study. 
There is some danger that he may not 
be studying the development of chil- 
dren but rather the relationships that 
exist between different responses of 
mothers. As was noted earlier, most of 
the problems raised here are susceptible 
to empirical study, and much suggestive 
research on children can be more ade- 
quately evaluated when we have evi- 
dence on the accuracy of maternal re- 
port, contemporaneous and retrospec- 
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tive. In the meantime, the investigator 
of this special group of R= f(A, P) 
generalizations should consider the 
utility of longitudinal study in reduc- 
mg error. 

The repeated-measurement technique 
is also the method of choice in studies 
that deal with rates of change of be- 
havior or state. For example, in the 
study of the spurt in height near me- 
narche, a cross-sectional investigation 
will show a smooth and gradual change 
in increments of height over age. If, 
however, longitudinal data are available, 
t can be demonstrated that this repre- 
entation poorly expresses the dramatic 
hifts seen in individual curves (Tanner, 
1955). By replotting the data around 
age of menarche instead of along a sim- 
ple chronological baseline, a fairer pic- 
ture is provided of adolescent growth. 
Ir should be noted, of course, that if 
the researcher is interested only in the 
mean increment in height at each age 
cross-sectional study will do the job as 
well as longitudinal. If a representation 
that reflects the pattern of individual 
growth of behavior is desired and there 
is reason to suspect variation among 
subjects in the timing of change, then 
the longitudinal method is to be pre- 
ferred over the cross-sectional. 

A related design, in which there is 
no substitute for longitudinal study, is 
that in which the subpopulations are 
established by examination of rate of 
change. If one wants to know about 
personality changes in  fast-maturing 
boys as compared with slow-maturing 
boys (Jones and Bayley, 1950), it is 
necessary to determine with repeated 
measurement the appropriate assign- 
ment to groups. Similarly, there are 


suggestions in the literature that rate of 
intellectual development may show wide 
individual variation that warrants fur- 
ther study of associated behavior 
(Bayley, 1955). It is clear that repeated 
measures are necessary to assess rate of 
change, and studies of this order will 
always require longitudinal investiga- 
tion. 

In summary, statements of the R = 
f(A, P) kind are somewhat more com- 
plicated extensions of the simple age- 
functional relationship. They can be 
established by cross-sectional study 
when it is possible to segregate sub- 
populations on the variable under study 
(P) across the age range being investi- 
gated. For two special classes of R = 
f(A, P) generalizations—the study of 
the sequelae of lost signs and the analy- 
sis of sequence or rate of change of 
behavior and state—the longitudinal 
method is usually preferable to the 
cross-sectional, 


RESPONSE VARIATION 
WITH AGE, INTERACTING 
WITH ENVIRONMENTAL 
CHANGE: R= f(4,S) 


Control of behavior becomes possible 
and prediction of behavior often be- 
comes more precise when the psychol- 
ogist is able to relate behavioral change 
to some discriminable aspect of the en- 
vironment. As the broad outlines of 
development are sketched, there is an 
increasing interest in the isolation of 
manipulable antecedents in the behavior 
of children. The results of studies with 
such aims can usually be stated in the 
form R = f (4A, S). 

In general, our knowledge of the in- 
teractive effect of age and environment 
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derives from two kinds of research set- 
ting, one involving experimental ma- 
nipulation and the other involving vari- 
ation in conditions. The first of these 
settings is given detailed treatment in 
Chapter 4, and only a cursory summary 
is given here of its place in the study 
of development. As in all experimental 
study, the chief advantage of this tech- 
nique for the study of children is the 
possibility of random assignment to 
groups treated differently; there is no 
need to worry about correlated differ- 
ences when assignment to subgroups is 
made purely by chance. It deserves it- 
eration that an experiment is not de- 
fined by manipulation but rather by 
manipulation applied to one of a ran- 
domly selected set of subject groups. 
Thus a study of learning in high- and 
low-dependency children does not meet 
this basic criterion; this is rather a study 
of interaction among environmental 
change, dependency, and whatever 
variables correlate with the dependency 
segregation. Stated in the notation 
adopted here, such a study is of the 
R=f(A,P,S) variety and not a “pure” 
experiment. If studies like this are ex- 
cluded from consideration as examples 
of R=f(A,S) generalizations, the 
number of developmental researches 
that fit this category is seen to be quite 
small. The bulk is concerned with learn- 
ing over relatively short periods of time 
(McGinnis, 1929; Bijou, 1957) and are, 
like all learning studies, longitudinal 
Ges repeated-measurement) rather than 
cross-sectional, An exception to this 
general statement are investigations of 
training effects in twins (McGraw, 
1935; Gesell and Thompson, 1929). 
Psychologists are usually blocked in 


their study of development by experi- 
mental techniques because they 

the power to effect important mañ] 
tions or because of ethical proscriptions. 
It is at this point that animal research on 
developmental problems can provide us 
with useful data, and important in- 
formation is beginning to ente! rhe 
literature from comparative develop- 


lack 
ila- 


mental studies using experimental meth- 
ods (Beach and Jaynes, 1954). 

A caution should be sounded in the 
experimental study of children. lt is 


almost always necessary to have a con- 
trol group that does not receive experi- 
mental treatment. The need for an 
“aging” control is clear enough in the 
study of practice effects on “maturi x” 
behavior; it is not perhaps so obvious 
in the case of short-term longitudinal 
studies such as those of learning. How- 
ever, until we are in possession of better 
information about the changes that take 
place in a child’s behavior over varying 
periods of time in the absence of special 
“training” or learning, it will be wise to 
run a “maturational” control group in 
experimental studies that extend beyond 
quite brief durations. Once more, there 
is no cookbook answer for how brief 
is “brief”; for the study of postado- 
lescent learning, important “matura- 
tional” changes would be unlikely over 
months, whereas, for the study of new- 
born learning, a control group would be 
necessary for an investigation that ran 
only for several hours. In the absence 
of reliable information on the safe dura- 
tion of a study, the researcher probably 
should err on the side of caution and 
arrange for a developmental control 
whenever he suspects that there may be 
“aging” effects. 
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The second research setting for es- 
cablishing R = f(A,S) statements be- 
longs in a class quite different from that 
containing the experiment. Although 
they aim at describing variations in be- 
havior as a function of environmental 
change, the studies of variation in con- 
litions are almost always perfectly 
analogous with R = f(A, P) studies and 
hare the design problems of this group. 
Research on the effects of institution- 
lization, on the impact of the patho- 
gical mother, or on variation in 
hild-rearing practice are usually sum- 
marized in the R= f(A,S) paradigm, 
but the critical absence of the possibility 
of random assignment to subgroups 
makes these so-called “natural experi- 
ments” incomparable with the experi- 
ments already discussed. Take the case 
of infantile institutionalization as an ex- 
ample. Even if a “control” group of 
noninstitutionalized children is set up 
for comparison, there is usually no 
guarantee that the two groups will be 
drawn from the same basic population, 
nor is there usually any precise descrip- 
tion of the dimensions other than resi- 
dence in an institution along which 
they may vary from one another. This 
consideration represents one of the 
fundamental difficulties in studying 
variation in conditions—the establish- 
ment of relevant comparison groups. 
Whether our interest is in enriched 
versus deprived environments, in social- 
class differences in child-rearing proce- 
dures, or in the effects of early exposure 
to sex differences, we face the task of 
ascertaining the further differences that 
may exist among the groups under 
study. 

Nevertheless, if we recognize that 


investigations of variation in conditions 
cannot be considered as experiments but 
rather must be understood as close 
cousins of R = f(A, P) studies, it is pos- 
sible to state several safeguards against 
serious error. It should be noted in 
passing that many studies of variation 
in conditions permit less ambiguous as- 
signment to groups than is possible in 
R= f(A, P) research. There are public 
records of institutionalization, there are 
external criteria for the assignment into 
social class, there are pathological 
mothers who often remain pathological 
across the child’s development, and so 
on. But, once more, many of the most 
interesting conditions determining de- 
velopment are in the nature of “lost 
signs,” ¢.g., handling of toilet training 
or feeding, parental tolerance for ag- 
gression, and early sibling interactions. 
For such cases, dependence must often 
be laid on parental retrospective report, 
with all the doubts that have been al- 
ready noted. Moreover, in the special 
instance of child-rearing practices there 
is evidence that parental responses tend 
to go together in clusters (Sears, et al., 
1957), making it difficult to assess de- 
termination, even if one is satisfied with 
the validity of retrospective report. 
For these several reasons, chiefly the 
difficulty of establishing initial com- 
parability and the problem of interpret- 
ing complex interactions of variables, 
there is sound reason for choosing a 
repeated-measurements design in the 
investigation of variation in conditions. 
Such longitudinal study offers no guar- 
antee that all research problems will be 
solved, but the thoughtful researcher 
will be made more tellingly aware of 
other variations that follow along with 
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the one he is studying. To return to the 
case of institutionalization, it is possible 
to perform a single-measurement study, 
observing at age 15 children known to 
have been institutionalized in infancy. 
There may be, however, a grave tempta- 
tion to make too much of the research 
outcome, a temptation that would be 
sharply reduced had these same chil- 
dren been seen as they entered the in- 
stitution and as they developed (usually 
in settings far different from the “nor- 
mal”) after they left institutional care. 
Longitudinal study is laborious and time 
consuming, but it exerts a calming in- 
fluence on the tendency to make simple 
what is very complicated in child de- 
velopment. Just as it was necessary to 
expand the statement of R= f(A, P) 
generalizations, so is it proper to think 
of statements drawn from “natural 
experiments” as R = f(A, Sı»: Sn), 
in which the environmental change 
on which the researcher is focused 
represents only one of an indetermi- 
nantly large number of variations in 
the environment of the child. We shall 
be more certain of our conclusions 
about environmental control of devel- 
opment to the degree that evaluation of 
these supposedly ancillary variations is 
possible. 

In summary, the most powerful state- 
ments for the control of behavior that 
can be made about developmental 
change are those that take into account 
environmental variation. In experi- 
mental settings permitting random as- 
signment to groups rigorous analysis is 
possible. In the commoner study of 
variation in conditions problems of 
comparability of groups and of com- 
plex interactions arise in which an ex- 


ENT 
amination much like that discussed in 
relation to R=f(A,P) studies is re- 
quired. Oftentimes, if only as a saf 
guard against oversimplification, the 
longitudinal method is to be preferred 
to a single-measurement design with 
recourse to public record or retrospec- 
tive report. 


safe- 


Theoretical Principles in th 
Study of Developmental 
Change 


The analysis of developmental change 
presented in the foregoing pages has 
bypassed a group of fundamental prob- 
lems in the study of children, Although 
there is a gain in convenience when we 
talk about types of developmental gen- 
eralizations, such a classificatory scheme 
tends to hide the impact of preconcep- 
tion, or, more elegantly, of theory in 
the selection of research design and 
procedures. No psychologist starts his 
investigation of child behavior with- 
out an intricate set of assumptions, 
guesses, and prejudices about techniques 
and about what is important in child 
behavior. There are, for example, sev- 
eral kinds of “longitudinal” studies— 
what determines the selection of one 
rather than another? There is an in- 
finite variety of behavior that could 
be fitted to the developmental formula 
—what are the grounds for selecting 
among them? There are many different 
ways of organizing developmental find- 
ings—how do these variations influence 
the design of research? Perhaps because 
child behavior can be so readily de- 
scribed in “natural” terms and because 
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its study has long been dominated by 

purely empirical orientation, in- 
sufficient attention has been given to the 
operation in developmental research of 
theoretical principles. No brief survey 
can encompass the complexities of 
theorizing, explicit and hidden, that 
limit the study of children, but an 
ittempt can be made to review some 
of the factors in the design of research 
that lie behind the statement of em- 
pirical generalizations. 


GOALS OF THE RESEARCH 


Exploration. The simplest definition 
of a research program in child psychol- 
ogy derives from an interest in “seeing 
what happens” to some aspect of be- 
havior over age. Such an interest may 
arise from casual observation, from a 
hunch about the way children develop, 
or from an articulated theory, but the 
aim of the researcher is not a systematic 
and detailed statement of a functional 
relationship; he is concerned rather with 
a preliminary survey, Or exploration, of 
some response system. The establish- 
ment of a natural history of child 
behavior on the basis of such explora- 
tion has much to recommend it; the 
changes we see as children grow are 
so various and complicated that a primi- 
tive taxonomy of behavior provides 
initial categories that we may use in 
building more refined treatments. Over 
the years, a substantial part of develop- 
mental research had been of this kind, 
dominated by studies of physical 
growth and of test performance. 

The use of exploratory survey is not 
limited to any particular aspect of 
child behavior; whether one is interested 


in newborn behavior, in the effect of 
nursery school, in the psychological 
changes that occur in adolescence, or 
in social-class variation in rearing pat- 
terns, it is usually appropriate to seek 
out the edges of the problem by an 
exploratory survey. 

For most of the studies that are de- 
signed to provide general guidelines for 
later intensive study, the cross-sectional 
method is almost ideally suited. For 
a relatively small expenditure of time 
and effort, the general character of a 
developmental pattern can be discerned. 
For example, if a psychologist is in- 
terested in the behavior of children 
when their mothers become pregnant 
again, he can reasonably set up an inter- 
view schedule or an observational pro- 
cedure to study their behavior. His 
comparison group could be obtained 
from children whose mothers are not 
pregnant, perhaps with the precaution 
of matching this exploratory “control” 
group with the first on age and sex, 
but, by and large, the exploratory re- 
searcher organizes his study to show up 
relatively gross effects, leaving a more 
subtle analysis for later on. Similarly, 
the psychologist who is interested in 
children’s handling of number concepts 
might profitably survey this area by 
observing at more or less the same time 
groups of children in different school 
grades. He probably would not gain 
from such an exploration a precise state- 
ment of stages of change or of details 
of development, but he would have 
put some helpful limitation on his next 
try, and he would have learned a great 
deal about techniques of study, changes 
in aspects of the child’s behavior other 
than the use of numbers, and so on. 
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In general, for the purpose of stak- 
ing out an area of child behavior for 
study, the cross-sectional method usu- 
ally has an advantage over the repeated- 
measurements method in permitting 
the rapid collection of data. This ad- 
vantage is, of course, much greater 
when the behavior under study changes 
across a wide span of time, e.g., develop- 
ment of fears, changes in attitude 
toward parental authority, and de- 
pendence on peers. Longitudinal study 
is normally too expensive and too haz- 
ardous to be optimally effective in the 
rough exploratory assessment under 
discussion here. For some problems 
noted earlier, there is no alternative to 
repeated-measures investigation, but it 
is generally the case that such problems 
have less ambiguous definition, i.e., they 
are not simply exploratory. It is un- 
fortunate that the notion of longitudinal 
study has become for many psychol- 
ogists too closely identified with ill- 
defined attempts to collect as many data 
over as long a developmental period as 
possible. The repeated-measurements 
method by no means demands such 
omnivorous planning and is probably 
badly suited for it. Moreover, as we 
shall see later, there is too much that can 
be observed and described in human be- 
havior to set out on a study of develop- 
mental phenomena without some def- 
inition of intent. When this definition 
is limited by a general interest in some 
developmental behavior but without ex- 
plicit hypotheses concerning subpopula- 
tion variation or the details of change 
(e.g., most “exploratory” studies of the 
simple R= f(A) sort), then the re- 
searcher should look to cross-sectional 
study to provide the rough outlines of 
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a phenomenon. When the information 
obtained in this way is analyzed and 
used as a source of more specific hy- 
potheses, the researcher may proceed to 
a longitudinal analysis with more con- 
fidence and with a markedly increased 
chance of finding significant relation- 
ships. 

Variation Study. Once the over-all 
picture of a developmental sequence has 
been drawn by cross-sectional survey, 
the psychologist’s interest often turns 
to a closer analysis of variation in the 
behavior. He sets up subpopulations for 
the study of differences within his age 


groups, the research becomes more fo- 
cused, and, although the researcher can- 
not anticipate his findings in detail, he 
knows at least the kinds of observations 
he should make. Two kinds of specifica- 
tion are usually involved here—the more 
precise definition of the observations 
of child behavior to be made and the 
selection of subpopulations on P or $ 
dimensions. Thus in the case of the 
child’s responses to maternal pregnancy 
the researcher may decide from his 
earlier survey that a sensitive index of 
change in response is the child’s concern 
with his own bodily functions; anal- 
ogously, the researcher may have a 
suspicion that sex and age will influence 
the character of the child’s behavior 
markedly. As a result of these re- 
finements of his original interest, he 
may fruitfully set up a study that is 
limited to observation of a particular 
set of responses in groups of subjects 
selected on explicit criteria. 

When the child psychologist has put 
some limitation on his goals, he faces 
once more the choice between longi- 
tudinal and cross-sectional study. It will 
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be clear from what has already been 
noted that no simple rule of thumb can 
dictate a universally satisfactory answer. 
However, it can be maintained that 
variation study represents the most ef- 
fective present-day use of longitudinal 
study. In research on physical growth 
(Tanner, 1955), on test performance 
(Hofstaetter, 1954), and on the stability 
of behavior (Morris et al., 1954) the 
problems of repeated-measurements 
are reduced by the existence of 

primitive but important theoretical 
stance—the concentration on a limited 
set of observations and on limited sub- 
populations of subjects. With research 
specified in this way, the ad- 
iges that come from studying 
“matched” subjects can be claimed. 
‘These advantages, it must be remem- 
bered, are often quite striking. If it turns 
out, for example, that dependent chil- 
dren become more dependent during 
mother’s pregnancy and that inde- 
pendent children become more in- 
dependent, then cross-sectional study 
can tell you only that the variance of 
dependency behavior changes, never 
that there is a systematic direction in the 
change, Similarly, and doubtlessly of 
even greater importance, having the 
same children under observation over a 
period of time increases the likelihood 
that the researcher will notice relation- 
ships that would escape him in cross- 
sectional study. To return to the “ef- 
fects-of-pregnancy” study for another 
illustration, most children may show no 
systematic response to maternal preg- 
nancy but a small subgroup may show 
marked behavior change. Ordinary 
statistical analysis may not demonstrate 
a difference when the full groups are 


tudy 
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compared, but the longitudinal re- 
searcher has a chance to look back to his 
earlier observations to determine 
whether there is anything systematically 
different about the earlier behavior of 
overresponders. 

This susceptibility to after-the-fact 
analysis is one of the chief merits of re- 
peated-measurements study. In an in- 
vestigation of psychological changes 
during adolescence one may find, during 
the third year of study, that several of 
the children show autistic symptoms. 
This behavior in a sense defines a new 
group for study, and the researcher 
with observations on the same children 
over the two preceding years may ex- 
plore his material to determine what 
earlier data might be related to the 
development of autistic behavior. It 
should be stated at once that this gain 
for longitudinal study carries with it a 
serious danger. If there is a sizable 

uantity of information relating to a 
group of children studied over a period 
of time and if the investigator examines 
it in a post hoc search for regularities, 
he may very well find them and they 
may very well be fortuitous. The hy- 
pothetical researcher’s autistic adoles- 
cents may have been unique in that they 
were early-maturers, lost a parent be- 
fore age 15, and had only one sibling, 
a pattern that a skillful psychologist 
could work up into a plausible “ex- 
planation” for later autism. Here we 
run against a perennial problem in psy- 
chology at large and child psychology 
in particular. Our theoretical principles 
are often flexible enough to encompass 
rather diverse findings, and in all in- 
nocence we may be led to explanations 
that are no more than personally satisfy- 
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ing. The major safeguard to be built 
against this danger of “discovering” 
merely accidental relationships in post 
hoe longitudinal analysis is replication 
or cross-validation. The researcher who 
notices discriminating characteristics 
of his autistic adolescents cannot stop 
there; he must proceed to check this 
profile of characteristics in a new group 
to determine how stable the relationship 
is. 

The replication safeguard against 
error in post hoc analysis is an elemental 
principle of competent research, and 
the failure of child psychologists always 
to respect it may be understood by con- 
sideration of two points. First, the in- 
vestigator who has invested a great 
deal of time and effort in the collection 
of his longitudinal data will be humanly 
reluctant to begin again before he has 
stated publicly the results of his work. 
Most researchers are careful to note the 
absence of replication and the danger 
of inappropriate generalization, but the 
promised validation often does not take 
place. The second deterrent to carefully 
designed replication of findings from 
longitudinal (or, occasionally, cross- 
sectional) analysis is subtler and more 
persuasive. Having “found” a relation- 
ship, it is altogether too easy for the 
researcher to construct a convincing 
explanation, and his achievement of this 
step will tend to make doubts and 
potential criticisms less obvious to him. 
Moreover, in writing up his results, 
his commitment to a particular under- 
standing of his data may frequently 
lead to a further constriction in the 
range of alternative explanations. Noth- 
ing in this critique should suggest that 
psychologists set out to present and 


defend dubious relationships, but we 
should recognize with great clarity 
the pitfalls that grow from the human 
desire to impose order on a set of data. 
The sole guarantee against error of this 
variety is the establishment of a new 
test or the examination of unanalyzed 
data to determine stability of a relation- 
ship. 

With these cautions in view, 
place of longitudinal study of va 
can be better assessed. It provides 
precise assessment of change, it supplies 
the researcher with suggestions for 
further study, and it permits post hoc 
developmental analysis of phenomena 
that appear in the course of study. 
These are striking advantages over 
single-measurement techniques, and the 
longitudinal method will be the method 
of power in studying variation to the 
degree that precautions are taken against 
improper generalization. 

Testing of Explicit and Detailed 
Hypotheses. Realistic examples are hard 
to come by in our present stage of 
knowledge of children, but it is hoped 
that one day child-development research 
can set about the testing of quite precise 
hypotheses. When clear-cut predictions 
can be stated, much of the objection to 
the cumbersomeness and inefficiency of 
longitudinal study will be answered. For 
example, if the psychologist can state 
an explicit and detailed hypothesis about 
the relationship between behavior at 
weaning and separation problems in 
school, he will be able to make two sets 
of observations widely separated in 
time, without the need to record every- 
thing he can about the behavior of his 
subjects and their families in the interval 
between observations. The analogue of 


nore 
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this procedure is the study of learning 
in animals and adults. A very limited 
number of observations of behavior are 
made, and it is clear in advance what 
will constitute a confirmation of the 
hypothesis. We still will not usually 
have the central advantage of random 
assignment available to learning studies, 
and most developmental research will 
spread out over much longer periods of 
time than are typical in animal work, 
but the advance over our present cir- 
cumstances is clear. The powerful use 
of longitudinal methods in the resting 
of explicit hypotheses demands not only 
the continued careful collection of data 
but also the building of appropriate 
and meaningful theories of child devel- 
opment. 


SELECTION OF RELEVANT 
OBSERVATIONS 


At no point in the design of child- 
development research does the role of 
guiding principles bulk larger than in 
the selection of what behavior to ob- 
serve. Decisions in this regard are by no 
means routine or obvious; it is necessary 
only to reflect on the infinite variety of 
observations that could be made to rec- 
ognize the operation of selective rules. 
The behavior of human beings can be 
classified in many different ways, with 
the psychologist isolating one aspect 
or another of responding, and it is plau- 
sible to believe that many of these be- 
havior categories will show change 
over age. Some hint can be had of the 
range of potential observations by €x- 
amining the brief history of child psy- 
chology- Compare Stern’s (1924) treat 
ment of infancy with Freud’s (1905), 


Koffka’s (1925) observations of devel- 
opment with Gesell’s (1941), oF a “clas- 
sical” text such as Karl Bithler’s (1930) 
with a recent text such as Hurlock’s 
(1956) or Mussen and Conger’s (1956) 
and the message of variety will become 
almost unbearably clear. If child psy- 
chology is in a natural-history phase of 
its growth, one may suspect that our be- 
havioral botanists are looking at quite 
different species, and it will not be 
surprising if they come up with quite 
different conclusions about the flora. 
This is to recommend neither a shift in 
research on children roward “rawer 
data” nor a limitation of research to that 
derived from available theoretical posi- 
tions. The first is regressive—it is im- 
possible to observe all behavior—and 
the second is at best premature; but 
there may be merit in attempting to 
sketch out some of the principles that 
determine the selection of particular 
observations. 

Selection Based on Natural Occur- 
rence. In some Ways, the problems of 
a science are not selected—they are in- 
herited. Particularly in the early decades 
of systematic study the major de- 
terminant of what constitutes important 
or relevant observation is Everyman's 
opinion. Landowners set the first prob- 
lems for geometers, gamblers set the 
first problems for statisticians, and 

arents set the first problems for child 

sychologists. Will Johnny be ready 
for second grade in September? Why is 
he so hard to train? Why is he so much 
like his father? Do all boys like to 
wrestle? Why can't he read? Over the 
years, there is a tendency for the sci- 
entific discipline to draw back from 
such practical problems to an increas- 
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ing interest in the general statement of 
issues and the precise formulation of 
research problems, but child psychol- 
ogy is still relatively close to a “natural” 
or common-sense definition of its aims. 
Parents and child psychologists remain 
interested in discipline, language devel- 
opment, school success, psychopathol- 
ogy, and a host of other problem 
areas that are practical in their defini- 
tion. Orientation toward issues of this 
order seems to produce a particular 
brand of research, which has as its chief 
functions classification and diagnosis, 
In order to assess prospects for school 
success more adequately, it is necessary 
to develop standard assessment tech- 
niques; an understanding of psy- 
chopathology requires a system of 
categories and reliable routines for 
assignment into these categories; and 
so on, 

The psychologist is called on to refine 
and extend categories or clusters of 
behavior that already have a primitive 
statement in the psychology of the 
layman. It is interesting to note, too, 
that our first ways of conceptualizing 
research problems derive in large meas- 
ure from common sense or what has 
been called “protopsychology” (Mand- 
Jer and Kessen, 1959). For example, 
early studies of language development 
took as their categories of description 
the grammarian’s classification into 
nouns, verbs, and so on, a procedure 
thar inadequately reflects fundamental 
changes in yerbal behavior (Brown, 
1958; Skinner, 1957). Similarly, classical 
studies of infantile emotion borrowed 
from common speech the distinctions of 
“rage” and “fear” that were held to be 
somehow fundamental. 


(ENT 


There are several consequences of 
this early dependence of systematic 
child psychology on the child psychol- 


ogy of the street and the home. lirst, 
there are so many immediately im- 
portant problems delivered by a practi- 
cal view of child behavior that our 
energies could be consumed for many 
years in the work of making them more 
readily susceptible to systematic study. 
Second, there is no clear evidence that 
a practical definition of any scientific 
discipline is the most strategic program 


in the long haul; there is, if anything, 
evidence in the history of other scicnces 
to suggest that this is a very inefficient 
procedure and that crucial extensions of 
our knowledge take place when 2 dis- 
connection or detachment from the ob- 
viously practical takes place. Third, the 
split allegiance of child psychologists, 
on one hand to the job of helping 
parents and teachers, on the other hand 
of establishing reliable and meaningful 
general statements about child behavior, 
has not permitted a sharp definition of 
our goals, This is not to say that we can 
or should turn our backs on the realities 
of child behavior in a free-living state; 
much of our productive work will 
grow from a recognition of practical 
distinctions and systematic attempts to 
account for them. But it should be 
equally clear that we cannot stop at the 
boundaries of “natural” description and 
explanation; the innovations of Freud, 
Watson, and Piaget are convincing 
evidence that the dimensions of child 
behavior which provide for stable and 
important prediction need not be the 
dimensions of common sense. In any 
case, the work of classification and 
diagnosis of child behavior will con- 
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tinue to make up a sizable chunk of 
developmental research; it will be sup- 
ortive of long-range scientific goals 
as long as child psychologists are alert to 
its limitations and keep their sights on 
statements of regularities and on ex- 
planatory notions that do not remain 
limited by “natural occurrence.” 
Selection Based on a Lawful Rela- 
tionship to Age. Among the criteria 
that may be suggested for the selection 
of observations, there is one which has 
high standing in the canons of science 
—the regularity or lawfulness of occur- 
rence, The search for stability in the 
behavior of children goes beyond the 
criterion of natural occurrence and puts 
the question: What behavior is con- 
sistently related to age, to some other 
characteristic of the child, or to some 
characteristic of the environment? Con- 
ceived most broadly, this is a brief state- 
ment of the basic scientific aim of 
establishing supported generalizations 
and will be seconded by almost all psy- 
chologists. Nonetheless, the existence in 
child psychology of a single, easily 
determined measure—age—has made 
the program of research resulting from 
this fundamental premise somewhat 
biased, Behavior came under investiga- 
tion not because of a criterion of sig- 
nificance, whether based on common 
sense or on general theoretical con- 
siderations, but rather because it seemed 
to have some systematic relationship 
with age. Take, as an example, the 
excellent research on grasping patterns 
by Halverson (1931), in which there 
is presented in rich detail the sequence 
of manual positions that children use 
to pick up small objects. Why were 
these particular observations chosen for 


study? The most reasonable answer 
would seem to be that a steady and 
consistent progression in these responses 
can be detected in most children, Les 
the behavior is lawfully related to 
chronological age. A researcher in- 
terested in grasping in young children 
might alternatively have observed speed 
of response or success in picking up 
small objects, or he may» have concen- 
trated on subpopulation variation in the 
behavior of grasping. 

Such hypothetical studies may have 
as much significance as the sequential 
motor analysis of Halverson, but they 
would not have the face validity that 
has come to be ascribed to any child- 
development research that states a re- 
lationship to chronological age. Again, 
it is a nicely balanced conflict for the 
researcher looking toward a future sys- 
tematic child psychology. We should 
have as much information as can 
feasibly be had about the regularity of 
changes in behavior over age, but this 
is not the same thing as maintaining 
that the collection of age-response regu- 
larities will lead us to a fruitful set of 
general principles. On the contrary, the 
existence of a “safe” area of research, in 
which the psychologist can be rel- 
atively certain that most behavior will 
change with age, may tend to reduce 
the research commitment to novel or 
uncertain hypotheses. The trend away 
from simple age-response research prob- 
ably indicates a consensus among child 
psychologists that effective generaliza- 
tions will not grow automatically from 
the establishment of lawful relation- 
ships with age. 

In the field of infant testing it is 
possible to see in more detail the con- 
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sequences of depending too exclusively 
on a criterion of regularity in the selec- 
tion of observations. It is known that 
the correlation between tests of early 
development and later tests of intelli- 
gence are remarkably low (Maurer, 
1946). An examination of the pro- 
cedures used in selecting items, i.e., in 
selecting relevant observations, will 
show why this negative finding should 
be no great surprise. Tests of in- 
telligence among school-age children 
are set up on the basis of their relation- 
ship to some external criterion, usually 
school performance, with an introduc- 
tion of new observations to ascertain 
the most effective way of predicting 
from the test to the external criterion. 
Infant tests, on the other hand, are built 
largely on the basis of lawful and re- 
liable relationships with chronological 
age. This is by no means their sole 
justification—designers of infant tests 
certainly mean them to be somehow 
representative of the child’s everyday 
behavior—but they are different from 
tests of older children in that they are 
not usually related to an external 
criterion, There is, in a word, no school 
performance that would permit us to 
judge whether or not the 2-year-old’s 
intellectual development is adequately 
reflected in his test score, and to include 
two such different procedures under 
the descriptive label “test of develop- 
ment” is hardly justified. The point of 
importance here, however, is the ready- 
made trap that exists in studies of human 
development—the face-valid but some- 
times misleading dominance of chron- 
ological age as a variable. To put the 
argument in capsule form, a lawful 


relationship with age is not a sufficient 
criterion for the fruitful selection of 
observations of child behavior, such 


fundamental description is significant 


when it is related to variables other 
than age and to explicit theory. 
Selection on the Basis of Available 
Instruments. Still another ground for 
the selection of observations of child 
behavior is the availability, usually from 
general psychology, of particular re- 
search instruments. The most obvious 
example of this criterion is the exten- 
sion upward and downward of carly 
intelligence tests; an instrument found 
useful for school-aged children was 
applied, with more or less sophistication, 
to adults and to younger children. The 


result of this particular extension of a 
technique has been productive of im- 
portant advances in our knowledge of 
human development; it is too early to say 
whether current research with projec- 
tive tests and lever boxes will also im- 
prove our understanding of child 
behavior. What is relevant in the present 
context is the recognition that applica- 
tion of a procedure established for the 
study of adults or animals does not 
guarantee important findings in chil- 
dren. It goes without saying that there 
is no legitimate justification for con- 
sidering extensions of technique im- 
proper or unscientific; it does deserve 
mention that their scientific utility can- 
not be justified merely because the tech- 
niques are available. ` 

Selection Based on Explicit Hy- 
pothesis. The criteria reviewed in the 
foregoing paragraphs have the function 
of putting some definition on the infinite 
variety of description possible in the 
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observation of children’s development. 
They rarely operate singly, and they 
certainly are not so “pure” as the 
present treatment makes them sound, 
but they outline a rough classification 
of starting points for the systematic ob- 
servation of children. Often these 
criteria are supplemented by yet an- 
other—an explicit statement of theoreti- 
cal principles. This is the ideal goal of 
systematic study; observations are made 
not because the behavior happens to be 
in view but rather because the re- 
searcher has an integrated set of general 
principles, which prescribe in a clear- 
cut fashion the kinds of observations it 
would be most strategic to study at 
any particular moment. Like most 
human ideals, this one is not often 
achieved, but it takes only a casual 
look at the history of child psychology 
to see the revolutions that were in- 
troduced into the field by the specula- 
tions of Freud and Watson. Of course, 
theories are not independent of prior 
observation, but it is striking what a 
difference is made in the distribution of 
energy and the organization of research 
in a field when the opportunity is pre- 
sented to test a hypothesis drawn from 
a wide-reaching set of principles. In 
child psychology today we are so used 
to the infusion of psychoanalysis and 
learning theory that it is difficult to con- 
ceive what the discipline would be like 
without the structure laid down by 
these two general systems." 


1There are others, of course; theoretical 
settings of importance for child study have 
been proposed by members of the Gestalt 
group, by Piaget, and by contemporary physi- 
ological psychologists, to name only a few. 
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The technical advantage that comes 
from explicit hypotheses in delimiting 
the scope of observation has already 
been mentioned; it will be seen later 
that this is no mean gain in solving the 
practical problems of the developmental 
researcher. Beyond this, several further 
supports for theory-directed research 
can be stated. The first has to do with 
the broad scientific question of interre- 
lationships among empirical findings. 
An explicit set of hypotheses permits 
the researcher to set out on a program 
of study, with the end in view of bring- 
ing his data together in a coherent 
statement. Traditional studies of child 
development may be liable to the charge 
that they lack organization and present 
a failure to establish linkages between 
one set of relationships and another, 
We know a great deal about the de- 
velopment of children; what sometimes 
appears to be missing is a conceptual 
framework on which this knowledge 
can be ordered and extended. Clearly, 
the achievement of this goal will not 
occur merely because someone pro- 
poses a speculative set of general state- 
ments, but such a set does tend to give 
direction to research, to aid the search 
for data that can reasonably be related 
to other data, and to prescribe in a more 
or less precise way what the appropriate 
next step in research should be. 

Corollary to the chief gain from the 
organization of study which a set of 
principles imposes are several somewhat 
less obvious advantages. Commitment 
to a particular theory (and it seems at 
the present stage of child psychology 
not to matter a great deal which theory) 
binds a group of researchers together, 


60 GENERAL RESEARCH METHODOLOGY 


providing the support of communica- 
tion and exchange that makes a sub- 
stantial difference in the probability 
of doing meaningful research. Shared 
commitment may also increase the like- 
lihood of replication of observations, 
with a corresponding increase in the 
assurance we have about the stability of 
our findings. 

Another gain from theory-directed 
research in child development lies in 
the possibility that it will move child 
psychology more rapidly into a sig- 
nificant relationship with the rest of the 
behavior sciences. Whether child psy- 
chology is a discipline unto itself, re- 
quiring its special techniques and 
special theories, is in large measure an 
empirical question, but it is a strategy 
consistent with the usual growth of 
science to search out the ways in which 
the study of children may be meaning- 
fully related to the study of animals 
and human adults. The bridges among 
the divisions of psychology will be built 
on general theory; without prejudicing 
the issue of whether they will take the 
weight they are constructed to carry, it 
seems a legitimate part of our job to 
sketch out working plans. 

In short, the selection of observations 
in the study of children is by no means 
an automatic or obvious procedure. The 
researcher will be confined by limita- 
tions ranging from the casual to the 
highly elaborated, and these limitations 
will determine in part the kind of child 
psychology he will produce and the 
decisions he will make in regard to the 
design and procedures of research. It 
may reasonably be expected that re- 
search presentations would profit from 
an explicit statement by the investigator 


IN CHILD DEVELOPMENT 
of the grounds he has used for the selec- 
tion of a particular set of observ rions. 
THE ROLE OF THEORETICA 
CONSTRUCTS 

Even though there is available no 
generally accepted and comprehensive 
theory of child development, child 


psychologists use a large num of 


theoretical constructs in their reports 
and discussions. Terms like rres- 
sion,” “dependency,” and “emotion” do 
not merely give names to the inve tiga- 
tor’s operations; they go beyond simple 
operational status and become involved 
in statements of assumed relationships 


or hypotheses. Two important notions 
of this sort are widely used in the litera- 
ture of child psychology—*“maturation” 
and “developmental stage.” Although 
both terms are useful to the psychol- 
ogist, permitting him to talk about some 
research findings in an abbreviated 
fashion, it is important in the present 
context to note that they are not mere 
descriptive names, that is, labels for 
some observable aspect of child be- 
havior. In order to use the terms ap- 
propriately, one must know something 
about the theoretical principles in which 
the words occur (Hebb, 1958). Much 
of the contention between the matura- 
tional and environmental positions in 
child psychology does not concern facts 
but rather the interpretation and use of 
facts. We know that all normal children 
smile, but it is a theoretical presupposi- 
tion that leads us to call the behavior 
maturational or learned, and we will do 
different kinds of research on smiling 
according to this presupposition. 

An argument of similar force can be 
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applied to the construct developmental 
stage. The issues involved in discussing 
psychosexual stages (Freud, 1905) or 
stages of intelligence (Piaget, 1952) are 
not designed to determine only that 
child behavior can be categorized into 
clusters of responses that change with 
age; there is also at issue an understand- 
ing of how the “stage” notion fits into 
Freud’s and Piaget’s over-all theory of 
child behavior (Wolf, 1957). 

Briefly, the point under discussion 
here is that the words a child psychol- 
logist uses in talking about his results 
are not conventional names for aspects 
of child behavior; they are words which 
call attention to observations, but they 
also function in theoretical principles. 
We shall fully understand the use of 
maturation, developmental stage, and 
many other terms in the language of 
child psychology only when we know 
their place in hypothetical generaliza- 
tions as well as their relation to em- 
pirical operations. This double aspect 
of our most important constructs sup- 
ports once more the need for a public 
expression of preconception and as- 
sumption in the study of children. 


Practical Problems 


The psychologist familiar with the 
realities of child development research 
will recognize that the distinctions 
about statement-form and theory elab- 
orated earlier do not represent fairly 
the workaday business of studying chil- 
dren. At almost every stage of a re- 
search project there arise problems of 
a technical or practical sort, problems 
that must be tackled and solved before 


the investigation can be brought to a 
successful conclusion. In treating some 
of these problems in the following 
pages, recourse is often had to psycho- 
logical lore rather than law, with par- 
ticular emphasis on the longitudinal 
method and its sometimes peculiar prop- 
erties. Moreover, because the line be- 
tween practical and theoretical is a thin 
one indeed, there is occasional reference 
back to some of the questions raised 
earlier. In general, however, the discus- 
sion here assumes that the researcher 
has solved his theoretical and financial 
problems and now needs only to collect 
good data. 


SELECTING AND MAINTAINING 
A SAMPLE 


Beyond the issue of representativeness 
raised in the treatment of sampling 
procedures there arise several further 
problems in setting up and maintaining 
a group of subjects for developmental 
research. One of the toughest of these 
problems lies in determining the char- 
acteristics of the subjects who are not 
in the study—the ones who got away. 
If, for example, a research is set up to 
study patterns of mother-child interac- 
tion in the first years, the researcher will 
normally recruit from a group of 
pregnant women or new mothers. 
Whatever technique is used to enlist 
a sample, there will almost invariably 
be some subjects who refuse to partici- 
pate or who drop out of the study later 
on. How does this reduction in sample 
affect one’s conclusions? Often there 
exists no feasible way to tell whether or 
not such “drop-outs” are different from 
the maintained sample, but, at the very 
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least, the researcher should keep care- 
ful records of the proportion of such 
losses and keep his eyes open for any in- 
dication of a systematic variation be- 
tween the subjects who remain in the 
study and those who leave it. Occasion- 
ally, if his maintained sample shows 
some apparently unusual characteristic, 
e.g., a high proportion of feeding prob- 
lems, he may be able to track down 
his drop-outs to get at least a superficial 
picture of how their children have been 
developing. 

When there is a large sample under 
study and the proportion of drop-outs 
is low, this problem is probably trivial, 
but intensive longitudinal study often 
requires restriction to a relatively small 
number of cases, so that each refusal or 
loss may count a great deal. Under these 
circumstances, it is important in inter- 
preting the results of repeated-measure- 
ments research to know as much as 
possible about the rate and circum- 
stances of attrition in subject popula- 
tion. 

The occurrence of attrition in longi- 
tudinal study samples poses one of the 
most interesting questions in develop- 
mental research, and one which is of 
great practical implication: Why do 
people enter and stay in a long-term 
child development research project? 
When no special demands are made on 
parents or children (e.g., when observa- 
tions take Place in school), there is no 
difficulty in understanding the Coopera- 
tion of subjects, but what of those 
studies in which the parent must bring 
her child to the researcher, in which 
home visits are made routinely, or in 
which the child must be seen for rel- 
atively long sessions? No systematic 
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data are available on this point, but 
several answers can be suggested if it 
is recognized that wide variation will 
exist from one group of subjects to 
another in the weighting of motivations 
for participation in a research project. 

Some parents will agree to enter a 
longitudinal study because of a con- 
viction that research with children is 
important. It is unlikely that this is a 
major argument for more than 2 small 
proportion of parents, but the trent 
Zeitgeist clearly supports an interest in 
the scientific study of development, at 
least in the community of well-educated 
parents. At a far remove from such in- 
tellectual commitment are the parents 
who participate in a project because of 
the gain for them in services. Free 
pediatric care may be provided, a reduc- 
tion in nursery school fees, or an op- 
portunity to consult on school 
problems, but in each case the partici- 
pating parent gets something from the 
research that he would not otherwise 
have had. 

Whatever role these two sources of 
support for participation may play, they 
clearly do not account for all cases 
of participation in longitudinal research 
and may not in themselves be enough 
to account for more than a few. To 
understand the willingness of parents 
to suffer what would normally be in- 
trusions into privacy and to put up with 
the unusual demands of psychologists, 
it is necessary to call on another group 
of factors. Different psychologists will 
use different terms in discussing these 
factors, but they seem to turn on some 
notion of “psychological support” or 
“rapport.” Parents are probably as 
puzzled by the behavior of their chil- 
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dren as psychologists are, and the op- 
portunity to talk to an objective and 
ompetent professional about their con- 
cerns and their doubts is valuable to 
many parents, The popularity of books 
on child care and the range of questions 
iddressed to newspaper and magazine 
columns about children give a rough 
estimate of the hesitations and confu- 
ions of the American parent. Given 

chance to clarify these confusions 
vith a child psychologist or pediatri- 
cian, parents are often quite willing to 
iccept the demands of developmental 
search, 
If the foregoing observations are cor- 
‘t, then the successful maintenance 
of an intensive longitudinal contact de- 
pends in large measure on the degree 
to which the researcher supports the 
need of the parent for reassurance, 
advice, and a listening ear. When such a 
relationship is established, there is reason 
to believe that parents will not only go 
along with the researcher but will ac- 
cept his questions and tests with good 
grace.t 

Another practical decision the re- 
searcher must make in selecting his 
sample has to do with the optimal num- 
ber of cases. It is a fairly good rule of 
thumb that longitudinal researchers at 
the outset of their work plan to study 
more children than they can possibly 
manage, There are several correctives 
to this tendency to overestimate one’s 
ability to handle data collection. First, 
‘Apart from the reluctance of parents to 
participate, a major source of attrition in 
longitudinal study samples is residential in- 
stability. The psychologist with free choice 
would be wise to work in an area in which 
there is low geographical mobility among his 
potential subjects. 


it is wise to count, not children, but 
occasions of observation, A sample of 
ten babies may not seem very large, but, 
if one plans to see the child every month 
or so, to interview and test both parents, 
and to integrate whatever pediatric in- 
formation may be available, the number 
of observations made and the number 
of problems for data analysis that arise 
are enormous. Second, insofar as it can 
be arranged, detailed schedules of ob- 
servation and of data analysis should be 
laid out in advance. Only when the psy- 
chologist can see who will be available 
to test babies while someone else is 
interviewing mothers while someone 
else is analyzing the data already col- 
lected will he be able to make a rational 
estimate of optimal sample size. As has 
been the case in so many of the prob- 
lems raised in this chapter, decisions 
about the number of cases to be studied 
are the result of several considerations 
and no fixed answer can be given. 
Perhaps the only generalization that all 
child-development researchers would 
agree on is that one never overestimates 
the effort required to collect repeated- 
measurements data. 


CHANGES IN PROCEDURE 
DURING LONGITUDINAL STUDY 


In considering how to deal with 
changes in procedure during a longi- 
tudinal study of children, it must be 
remembered that repeated-measure- 
ments research can vary widely in scope 
and duration. For those investigations 
in which sharply limited aspects of 
behavior are observed, or in which the 
time span covered is brief, there usually 
are no major practical problems involv- 


ing revision of procedure. Such is not 
the case in longitudinal research stretch- 
ing over a long period of the child’s life, 
or that which is designed to study many 
facets of the child’s development. It is 
into the latter contexts that the present 
discussion fits; however, the issues raised 
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observations will not provide quate 
measures of variability in the year- 
old; character of social interaction with 
peers may be an important dimension 
of adolescent behavior, but it is less 
likely to be discriminating in the study 
of the toddler; and so on. Particularly 
in the observation of the chil! from 


will apply mutatis mutandis to less 
far-ranging longitudinal investigations. 

Among the reasons why a standard 
set of research procedures cannot be 
used throughout a developmental re- 
search program, one that ranks high 
on the list is the need for flexibility in 
following up suggestive relationships as 
they appear in the course of study. 
Longitudinal research serves not only 
as a data-collection device but also as 
a source of hypotheses for the re- 
searcher. Although independent veri- 
fication will normally be necessary to 
confirm such hypotheses, one of the 
ancillary advantages of “big” longi- 
tudinal research is the opening up of 
new areas for fruitful study. To make 
optimal use of this characteristic, the 
investigator will often have to leave 
some ambiguity in the first definition 
of his procedures. 

There is another compelling reason 
for the occurrence of procedural shifts 
in repeated-measurements study—the 
absence of a single observational tech- 
nique that is applicable across the de- 
velopmental span. One may want to 
record general activity or reflex re- 
sponsiveness in the newborn, but such 


‘Tt is necessary in this connection to em- 
phasize the distinction between longitudinal 
study as an exploratory search for relation- 
ships and as a technique of confirmation, See 
the earlier discussion of goals for develop- 
mental research. 


birth to school-age, it is necessary to 


change techniques used to gatho: data 
as the child grows older. This con- 
sideration poses a serious problem for 
the understanding of develop: ient— 
how are we to translate from ob- 
servational technique to anoth To 
put the question in another way, w hat 
constitutes the “same” behavior at dif- 
ferent ages? A 6-month-old is judged 


to be active on the basis of observations 
quite different from those used to assign 
a s-year-old a “high activity” score. 
Similarly, the evaluation of quality of 
mother-child interaction will depend on 
markedly different criteria when the 
child is 1 year old and when he is 7. An 
irreducible problem of developmental 
research is that we must modify our 
data-gathering techniques as the child 
and his environment change. More- 
over, it is no solution merely to give the 
same name to the results of different 
observational procedures; an “anxious” 
2-year-old is not similar to an “anxious” 
third-grader because we use the same 
word to describe their behavior. 

The difficulty underlying shifts in 
observational procedure is not likely 
to be easily removed; solution will de- 
mand both a sharpening of our con- 
ceptualizations of development and a 
great deal of research comparing dif- 
ferent observational modes, Perhaps the 
potentially most profitable approach to 


this issue is the overlapping of different 
echniques; at an age at which two or 
nore procedures may be appropriate, 
hey can be used simultaneously or in 
close succession to evaluate the relation- 
ships among them (Bell, 1953). For the 
area of development testing, there are 
data of this sort available, and the care- 
ful investigation of relationships among 
tests (Maurer, 1946; Hofstaetter, 1954) 
as provided leads for a better under- 
nding of what goes on when we 
nodify our devices for collecting data 
‘bout developmental status or intel- 
ligence, Related investigation is in order 
r establishing correspondence among 
different ways of assessing social re- 
sponsiveness, quality of mother-child 
interaction, emotional development, and 
many other aspects of children’s devel- 
opment. Short of empirical evidence 
about rules for translating the results of 
ying observational techniques, the 
developmental researcher would seem 
well advised to respect the operation- 
alist’s injunction against assuming the 
equivalence of different procedures. 
The tangle that results from being 
forced to change modes of observation 
in order to keep up with changes in 
the child will be teased apart only with 
the help of further empirical examina- 
tion of research methods. 


THE EFFECT 
OF THE INVESTIGATOR 
ON HIS SUBJECTS 


The relationship between a longi- 
tudinal researcher and his subjects is not 
simply an exchange between a source of 
data and a data-recording system, 1t 1s 
a social interaction that can become, 
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in long-term studies, fairly intimate and 
complicated. It was suggested earlier 
that subjects stay in longitudinal studies 
at least in part for the support the re- 
searcher provides as a competent and 
attentive professional. If this is the case, 
he will not remain totally inconse- 
quential in his effect on the participating 
parent. Some interviewers will be more 
successful than others in gaining a 
subject's confidence, some testers will 
be more frequently asked to “explain” 
the child’s behavior, and so on. Beyond 
this, child psychologists are probably 
unable to hide completely their convic- 
tions about appropriate rearing tech- 
niques behind a screen of impassive neu- 
trality. As the investigator gets to know 
the child and his parents, and as they 
get to know him, there arises the pos- 
sibility that they will change their “nor- 
mal” behavior or their reports to 
conform to his position, however mildly 
expressed. If this kind of distortion 
occurs at a high rate, then the validity 
of long-term longitudinal studies of de- 
velopment is in jeopardy. 

Yet in a sense the difficulty is in- 
soluble—the researcher cannot avoid 
asking his questions and making his ob- 
servations, and a refusal to discuss the 
parent’s problems may weaken the rap- 
port necessary to maintain an informa- 
tive contact. Some precautions can be 
taken, nonetheless; rather than pretend- 
ing to be above or beyond evaluative 
opinion about his subjects, it would 
appear wiser for the psychologist to be 
as explicit as he can to himself and his 
colleagues in the research about his 
prejudices and antipathies. In this way, 
he will be alerted to signs that his sub- 
jects are “coming into line” and can 
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take further precautions against apply- 
ing improper pressure on them. Beyond 
this, the longitudinal researcher can 
take whatever comfort is available in 
the evidence that major changes in 
parents’ rearing practices are unlikely 
to occur in response to the researcher's 
weak expression of his attitudes. As 
Wolf has pointed out in a report on the 
Yale Child Study Center longitudinal 
research project, 


People’s behavior is not so easily 
changed in vital matters as some attitude- 
change studies lead us to believe and as 
optimistic educators hope for. . . . Only 
two of our (ten) cases breastfed for any 
length of time. A number of our study 
mothers spanked their children during 
their first year, most of them in the course 
of the first two years. The toilet-train- 
ing procedures of our study mothers are 
certainly not marked by the stamp “ap- 
proved by child-development experts” 
(1957, P- 47 f.). 

The longitudinal researcher must rec- 
ognize that he is not seeing his subjects 
in a perfectly natural state and that they 
will occasionally be influenced in their 
behavior as a result of often-repeated 
interactions with him, But as long as he 
is tuned to the possibility of distortion 
and discrete in the social exchange that 
is part of research on human behavior, 
it is unlikely that he will seriously re- 
duce the validity of his findings by mak- 
ing his subjects too responsive to his 
own preconceptions. 


THE ANALYSIS 
OF LONGITUDINAL DATA 


After the longitudinal researcher has 
fought his way through the web of 
problems that surround the setting up 


and execution of an extended study 
of child development, he faces one 
last task—the analysis of his dita. On 
occasion, there is nothing in this process 
special to longitudinal study; when the 
research is of limited scope and when 
there are established conventions for 
handling data analysis, the investigator 


goes about his work with no more 


hesitation than is usual for students of 
human behavior, Once again, © ımples 
from the study of test performance 
(Bayley, 1933) and of physical growth 
(Stolz and Stolz, 1951) best illustrate 
this stage of developmental rc earch, 
But the interest of child psychologists 
in more intricate aspects of human 
development leads to the collection of 


data for which there are no simple rules 
for analysis and summary. The innova- 
tor, in particular, setting out to describe 
and account for behavior that has not 
been studied in detail before, cannot 
solve his problems of data analysis by 
reference to agreed-on principles. This 
ambiguity, necessary as it is in €x- 
ploratory phases of study, gives rise to 
several problems for the longitudinal re- 
searcher in the midst of his protocols. 
Two of these issues deserve further 
comment—the problem of too much 
information and the problem of con- 
tamination. 

It is somewhat curious to speak of 
having too much information about 4 
group of subjects, but as the longi- 
tudinal researcher works over a perio 
of years to collect observations that he 
considers relevant to his research goals 
he may find himself in possession of a 
staggering array of data and with no 
end in sight. The issue is sometimes 
further complicated by the fact that 
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the researcher cannot turn to the 
analysis of the data he has already col- 
ected because he is spending full time 
collecting new material on his subjects. 
Vor those studies that extend over dec- 
ides rather than years, the danger of 
eing overwhelmed by the weight of 
lata is by no means mythical. In a 
vise like this the researcher may either 
postpone his report indefinitely or 
present only a case-historical sample of 
what he has achieved (Fries and Woolf, 
1953; Murphy, 1956). 

One corrective to this difficulty is 
implied in what has already been said 

out longitudinal research—that the 
roals of the project must be carefully 
thought through in the planning stage 
ind careful estimates made of the time 

nd effort that will be required to keep 
ibreast not only of the child and his 
environment but of one’s data as 
well. This sort of calculation may re- 
sult in a restriction of the range of be- 
havior observed, in a limitation on dura- 
tion of the study, in a reduction in the 
size of anticipated sample, or in an in- 
crease in professional personnel sched- 
uled for the project, but it will also 
make more likely the presentation of a 
terminal report. 

Child psychologists are not seers, 
however, and the intricacy of children’s 
development demands a good deal of 
flexibility in the planning of research. 
Assuming that, in spite of appropriate 
and considered planning, the researcher 
finds his data coming in faster than he 
can analyze them, what can he do? 
One possibility, which has additional 
advantages to be mentioned shortly, is 
to set up a two-team organization for 
the research. One group of workers 


collects data and maintains contact with 
the subjects; the other sets itself the 
task of coding, summarizing, and ana- 
lyzing observations, With divided re- 
sponsibility of this order, some balance 
will be established between the collec- 
tion of data and their analysis. More- 
over, the occurrence of analysis parallel 
with observation would provide sug- 
gestions to the data-collection team 
about the kinds of observations that 
would be relevant to the emerging pic- 
ture of development. 

Finally, the problem of “too much” 
can be put in better perspective if the 
double function of longitudinal re- 
search is remembered, Extended ob- 
servation of children with an eye to 


‘including a wide range of behavior 


serves the important purpose of out- 
lining a series of problems for later 
concentrated study. “Big” longitudinal 
research has to date primarily served 
this necessary exploratory function 
(MacFarlane, 1938; Jones, 1943). It may 
be most fruitful for child-development 
research in the long run for psychol- 
ogists to recognize frankly the pre- 
liminary character of current long-term 
longitudinal research on the develop- 
ment of the child and to accept the 
data obtained as an unrivaled source 
of hypotheses rather than as a finished 
set of observations that must be ana- 
lyzed in detail. This orientation would 
reduce our distress at being unable to 
have in public view all the results of 
longitudinal research, and it might speed 
the “second-stage” collection of stable 
generalizable relationships in child de- 
velopment. 

The longitudinal researcher who can- 
not, because of the exploratory nature 
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of his study, state in advance how he 
will analyze and present his data runs 
the risk of another stumbling block— 
contamination. If, for example, a re- 
searcher has observed the development 
of a child from birth to 4 years and at 
the end of this period sets about his 
analysis and summary, he cannot ex- 
amine his information about the child 
as an infant free of influence from his 
knowledge of the child’s later behavior. 
If he knows the 4-year-old to be de- 
pendent, anxious, or active, the in- 
vestigator’s appraisal of the child’s in- 
fancy will be directed in part by this 
knowledge. Nor are adequate checks 
always available when a group of chil- 
dren is under study because the long- 
term longitudinal sample will often 
not be large enough to preclude selec- 
tive treatment of the data. As was noted 
in the discussion of post hoc analysis, 
it is often possible to “see” in a mass of 
protocols the continuing thread of 
some characteristic. There is enough 
known about human perception to be 
certain that contamination will not be 
avoided simply because the researcher 
tries to avoid it. In his contacts with 
his subjects he builds an implict por- 
trait of them, a set of expectations about 
their behavior, and it would be surpris- 
ing if this portrait did not influence the 
work of analysis and summary. More 
than that, it is important for the re- 
searcher to use his contaminating in- 
formation when he is in search of hy- 
potheses; by moving back and forth 
through his protocols, he will fre- 
quently come up with useful directions 
for new study. But when the goal of 
the research is the statement of re- 
peatable relationships, contamination 


from the researcher’s “theory of the 
child” is a confusing source of error. 
The two-team approach to longitudinal 
study, which permits the analysis of 
data as they are collected, stands as one 
technique for reducing contamination 
of this sort. When the behavior of the 
4-year-old is observed, whatever can be 
said about his infancy has been said 
some time earlier, and the investigator's 
response to later behavior cannot in- 
fluence the report of the child’s infancy. 

Short of concurrent analysis, the 
problem of contamination can be dealt 
with by replication. When a relation- 
ship is found in an overview of develop- 
mental data collected in longitudinal 
study, it is usually appropriate to check 
its generality by an observation of new 
cases. The creative use of longitudinal 
observation is an important part of 
child-development research, and it per 
mits the presentation of an integrated 
sensible picture of the developing child. 
However, this exploratory function 
must be supported by more limited and 
precise study under circumstances in 
which contamination does not introduce 
the possibility of error. 


Summary 


The study of child development in- 
volves two interlocking goals—the 
statement of reliable relationships be- 
tween behavior and age and the anal- 
ysis of these relationships to provide 
better prediction and control of child 
behavior. The achievement of these 
goals requires the continuing collec- 
tion of empirical evidence, particularly 
about age-functional changes in be- 
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avior among different subpopulations 
f children and in response to environ- 
ental change, and a refinement of 
coretical principles about develop- 
lent. 
The intricacy of child development, 
nd the existence of wide individual 
iriation in behavior, has brought to 
prominence the longitudinal study of 
child behavior. There is much to recom- 
iend longitudinal procedures, both as a 
source of hypotheses and as a sensitive 
method of confirmation. There are 
risks, however, in depending on choice 
of research design to provide stable 
swers to our questions about human 
development. The psychologist study- 
g children must bear in mind the 
presuppositions of his research, the kind 
f generalization he is aiming for, and 
he relevance of his procedures to other 
esearch on child development. 
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Observational 
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This chapter is about research methods 
that leave nature and society to their 
own devices. These methods rest upon 
direct observation as a scientific prac- 
tice that includes observing and associ- 
ated recording and analysis of naturally 
occurring things and events. Observa- 
tional child study is a collective name 
for all such methods in psychological 
research with children. 

Observational child study has been 
complicated at times by facilities that 
range from notebooks and movie cam- 
eras to elaborate check lists and rating 
scales to designs for experiments in na- 
ture. At bottom, though, this must be 
the simplest way of all to study child 
behavior. One gets within seeing and 
hearing distance of a child, observes and 
records something of his behavior or 
situation or both, and then scores, clas- 
sifies, summarizes, freely interprets, or 
otherwise does something with the re- 
corded observations, A group is some- 
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times observed instead of a single child, 
but the basic routines are the same in 
either case. 

Direct observation here is direct in 
two ways. (1) No planned arrange- 
ments stand between the observer and 
his target phenomena, and (2) neither 
does appreciable time: recording closely 
follows observing. The first of these 
limitations excludes tests, interviews, 
and questionnaires as well as manipula- 
tive experiments, and the second ex- 
cludes especially the retrospective de- 
scriptions of case studies. What remains 
is study of spontaneous and ongoing 
child behavior in the settings of every- 
day life. 

Consider now some data on rate of 
output in this line of research. Four- 
teen hundred nine empirical studies 
whose subjects include children and 
adolescents have been reported since 
1890 in The Pedagogical Seminary, The 
Journal of Genetic Psychology, Genetic 
Psychology Monographs, Child Devel- 
opment, and Monographs of the So- 
ciety for Research in Child Develop- 
ment. These investigations are classified 
in Table 3-1, which shows that only 
110, or 8 per cent, of them meet the 

resent criteria of observational child 
study and that this fraction has not been 
exceeded greatly during any of the 
seven decades after 1890. 

These figures cannot be surprising. 
They only document the safe generali- 
zation that psychology in its every 
branch has done comparatively little 
watching and recording and examining 
of events left to happen as they may. 
Psychological science began with a leap 
from the armchair to the laboratory and 
has since generally preferred to do 
things with its subjects, to give them 
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TABLE 3-1. EMPIRICAL STUDIES IN 
CHILD PSYCHOLOGY FROM 
1890 THROUGH 1958 


Period All Observational 
Number Per Cent 

1890-1899 35 3 9 
1900-1909 63 8 13 
1910-1919 48 4 8 
1920-1929 82 3 4 
1930-1939 49! 53 11 
1940-1949 328 20 6 
1950-1958 362 19 5 

Total 1409 110 8 


Note. The data are from all issues 
since 1890 of The Pedagogical Seminary 
and the Journal of Genetic Psychology, 
Genetic Psychology Monographs, Child 
Development, and Monographs of the So- 
ciety for Research in Child Development. 
Evident factors contributing to the star- 
tling upsurge of studies in the 1930s in- 
clude the founding then of Child Devel- 
opment, with its associated monograph 
series, and the often noted increase during 
these years of financial support for be- 
havioral research with children. 


tasks or problems, to interrogate them, 
to test them, or at least to draw them 
into prearranged situations. 

Direct observation, on the other hand, 
has been a methodological mainstay in 
many other sciences. It has stood as the 
basic method of astronomy and the 
earth sciences, of the natural history 
disciplines in biology, and of anthro- 
pology and sociology. Its contributions 
may not have been revolutionary so 
often as procedures that manipulate 
events; and yet they have been sub- 
stantial over the long run of progress 
in science. Purely observational meth- 
ods have enabled scientists to record 


the true appearance and the distributi 
in nature of countless phenomena, 
discover new things from microorgan 
isms to galaxies, to find new problei 
to anticipate the needs of theories í 
hard empirical facts, hence to avoid t 
vulgarities of easy speculation, and evi 
to test theories where experimental 
rangements were difficult or impos- 
sible. 

These characteristically slow but s\ 
scientific gains alone recommend a 
view of psychological methods in child 
development with the common base of 
direct observation. What differing ws 
of studying child behavior while leo v- 
ing it alone have been made available 
by research in child psychology? And 
what are the advantages and disadvan- 
tages of such methods? The chapter 
asks these questions. 


Outline of Methods 


Two phases of investigation are fore- 
most in our preface to this kind of re- 
search in child development: recording 
of behavior and analyzing the obtained 
record. A third major phase remains 
implicit: the beginning one of devising 
a plan for sampling the universe of child 
behavior. The sampling plan for a par- 
ticular investigation generally needs tac- 
tics of a statistical nature that suit only 
its own subjects, measures, and aims. 
But broader requirements of strategy 
in sampling that can be generalized as 
continuum coverage and material cov- 
erage fall on all observational studies of 
behavior. These requirements are set 
first by problems of content rather than 
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method, and yet they are bound fast to 
the task of getting good and sufficient 
data from an always complex field of 
study. 
ie behavior of a person is a lifelong 
continuum, It is in the nature of a 
stream that can never be seen in its 


entirety. To observe it, therefore, one 
has co divide it into observable lengths. 
One can look in on the stream, say, once 
a day for an hour, once an hour for a 
minute, for as long as a quarrel or a 
erecting lasts, or from awakening time 
to bedtime of a day through as many 


minutes, quarrels, greetings, or 
days as one may choose. Continuum 
coverage refers to the lengths or parts 
into which the stream of behavior is 
divided for purposes of observation. 

‘The behavior stream is enormously 
complex. Observing all of its many cur- 
rents and ripples for the shortest time 
is so difficult that limits are usually set 
again. One can look during the hour a 
day only for such grand current phe- 
nomena as taking first steps or saying 
first words, One can look during the 
minute once an hour for little ripple 
phenomena on the order of restless ac- 
tions in a classroom; or during the 
greeting for degrees along such a be- 
havior dimension as emotional warmth; 
or during the day, as precedent truly 
stands (see p. 83), for everything in 
sight in the behavior and situation of 
the person. Material coverage refers to 
what and how much at a time the ob- 
server tries to see in the stream of be- 
havior. 

Through what continuum units are 
observations made and of what order 
and scope are the observed phenomena? 
How are the observations recorded? 
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And what is done with the recorded 
material? 

Answers made in research to these 
questions of sampling strategy, record- 
ing technique, and analysis procedure 
give a basis for sorting and describing 
different methods of observational child 
study. One can count as a separate 
method a particular combination of 
answers to the three questions. Six 
methods stand out in the literature on 
this basis. These methods, each con- 
veniently named and identified by an 
early representative writing, are diary 
description (Tiedemann, 1787), speci- 
men description (Dresslar, 1901), time 
sampling (Olson, 1929), event sampling 
(Dawe, 1934), field unit analysis 
(Bishop, 1951), and żrait rating (Hag- 
gerty, Olson, and Wickman, 1930). 
Another sort of observational investi- 
gating in child development will be 
added later (pp. 114-117). But these six 
methods accommodate the main find- 
ings of our canvass. Table 3-2 may help 
here. It outlines most of the survey by 
summarizing the sampling, recording, 
and analysis provisions of each method. 

Both the nature of the target material 
and the observed lengths of the behavior 
stream vary definitively from one to 
another of the different methods. See 
how this holds by checking the follow- 
ing statements against the entries under 
Sampling Plan in Table 3-2. Diary de- 
scription traces developmental changes 
as these occur at biographically sampled 
intervals. Specimen description covers 
intensively and continuously the be- 
havior and situation of the child during 
more or less extended behavior se- 
quences. Time-sampling records se- 
lected aspects of behavior if and as they 


GENERAL RESEARCH METHODOLOGY IN CHILD DEVELOPMENT 


74 


stsAyeue 
Jeonsneas pue Suroog 


Burpoo 10ds-əy3-uO 


y0q 10 
uonemys JO JOIAeYyaq 
jo səjqenea pRAPS 


suun 
TOTARYAG, 2A1SS5990NG 


sısÁeue nun ppor 


stsÁjeue 
Jeonsneas pue Sunosg 


stsAyeue 
Jeonsneis pue Sunog 


sisAjeue 
Jeonsneas pue Sup109g 


uonea 
-rasqo asp Janew 
-nə uo paseq Suney 


(poq 10 uoneseU 
10 upos 10ds-ay3-uQ 


y0q 10 uonesreu 
10 3urpoo 30ds-əy3-uO 


Io1eyaq jo 
suorsuaunp p3APS 
(suəu 


-nĝre se) sse uaarg 
e Jo SIUDAD [LIOTALYI 


yoq 10 
uonenys JO 10meyəq 
Jo səjqenea pAPS 


səsuənbəs 
JoraByaq snonunuog 


sueds own 31u34 


suun swg wozrun 
pue qroys IUWUINU] 


Sunes WEIL 


Surdures quay 


Surdues owr J, 


e250) Mee; 


sisAyeue peonsnes 
pue ‘Suroos ‘Surpoo 
zo Ápms əanərdıanu] 


Apnas sanasd.aa 
-u pue uUOnBOYTISSE[E) 


uoneseU 
yenuonbes paeva 


uoneJIeu 
Áxeununs pue sadueys 
waAo3 Jo uonezrua 


uonenys 
pue 1omeyoq Suros 
-uo jo „BUPÁIDAJ, 


səpos 
-idə əpıp pawposse 
pue ymos peromey 
-aq ur sdays JAss22NG 


səsuənbəs 


Jomeyəq snonunuog 


sjeasoqut Áep 03 Avp 
aepnBar ssa] 10 OW 


uondixəsəp uəunoədg 


uondriosap Áq 


oanz 


aunpacorg StsÁjeuy 


ənbruyəə j, Surp1ov2% 


AGOLS AHH IVNOLLVAUISAO NI SGOH.LIW 40 


IFLIIA0Q EWEN 


aseiaaog wmnunuog 


ued Sudweg 


spoya w 


ISINA V E ATAVL 


OBSERVATIONAL CHILD STUDY 


happen within precisely limited time 
pans. Event sampling singles out natu- 
rally segregated behavioral events of 
one or another class and records these 
events as they arise and unfold. Field- 
unit analysis divides the behavior stream 
into consecutive units as they occur and 
describes each unit in turn while the 
iron is still hot. Trait rating selects di- 
mensions of behavior and bases judg- 
ments about them on observations dur- 
ng extended sequences of behavior. Re- 
cording techniques and analysis proce- 
dures, noted in the table, are made to 
suit these different sampling plans. 

Attention is called to the distinction 
in Table 3-2 between open and closed 
methods. Field unit analysis is not 
placed in either of these categories be- 
cause it is open in some applications and 
closed in others. Diary description and 
specimen description are indeed open 
to diverse phenomena. Time and event 
sampling and trait rating, on the other 
hand, are closed to everything but 
rather uniform and sharply limited phe- 
nomena. The open methods are broadly 
comparable to those of the biographer, 
the historian, or the biological field ex- 
plorer. The closed methods are more 
like those of the classical experimenter 
or the field observer in biology with 
eyes for only one descriptive target. 
Closed direct observing resembles ex- 
perimenting in narrowness of observa- 
tional set. 


Some observational techniques of these 
methods have often been used in manipu- 
lative experimental studies. These tech- 
niques, as applied in such studies, have 
pertinence here. However, space limits 
force nearly complete exclusion of re- 
search that does manipulate behavior or 
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situation. A clear boundary surrounds 
“purely” observational methods. If one 
crosses that boundary, stopping becomes 
difficult. 


Subjects 


The change-with-age emphasis of 
problems in child development makes it 
important to consider that these meth- 
ods have not been applied with any- 
thing like equal frequency at all age 
levels. Table 3-3 shows the number and 


TABLE 3-3. NUMBER AND PER CENT OF 
OBSERVATIONAL AND OTHER EMPIR- 
ICAL STUDIES SINCE 1890 WITH PRE- 
SCHOOL CHILDREN, SCHOOL CHILDREN, 
AND ADOLESCENTS AS SUBJECTS 


Observational Other 
Pre Sch Ado Pre Sch Ado 
Number 104 12 3 643 600 427 


Per Cent 94 u 3 50 46 ESA. 


Note. Pre: preschool children, ages under 
6. Sch: school children, ages 6-12. Ado: ado- 
lescents, ages about 13-19. The N of observa- 
tional studies is 110. A given study often used 
subjects of more than one age level with the 
result that each per cent frame exceeds 100. 
See Table 3-1 for data sources. 


er cent of the studies in our sample 
of Table 3-1 in which preschool chil- 
dren, school children, and adolescents 
have been subjects. It may be seen that 
the percentages at these respective age 
levels stand at 94, 11, and 3 for observa- 
tional as against 50, 46, and 33 for every 
other kind of empirical child study. 

In both of these bracketings of re- 
search, then, young children and ado- 
lescents, respectively, have served most 
and least often as subjects. The plainest 
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indication here is that methods of study- 
ing child behavior by direct observa- 
tion have been used a good deal more 
than other methods in child psychology 
with subjects under 6 years of age. One 
of the reasons for this may well be an 
opinion, expressed rather often in the 
literature, that the younger the child 
the more natural his behavior in the 
presence of an observer. Other possible 
reasons may include institutional poli- 
cies against frequent participation by 
younger children in experimental pro- 
cedures and the popularity among ex- 
perimenters of research problems call- 
ing for knowledge and skills of the 
older child. It is clear, in any case, that 
methods of observational child study 
have been shaped most by research on 
behavior in early childhood. 


Settings and Situations 


Table 3-4 shows the number and per 
cent of our 110 observational studies in 
which children have been observed in 
designated life settings. The figures in- 
dicate that, just as preschool children 
are the preferred subjects, preschools 
are the favored scenes of this research. 
Observers have ventured into home set- 
tings in less than a fourth of these in- 
vestigations. They have gone into hos- 
pitals and maternity homes in 7 per cent 
and into elementary schools in 6 per 
cent of the different studies. In only 
two of the 110 investigations were chil- 
dren observed as they went about from 
place to place in entire communities. 
There are left over finally a miscellany 
of settings, such as summer camps, 
clinics, and orphanages, that figured in 
13 per cent of the studies. 


These scenes of observation have been 
called interchangeably settings and » /- 
uations. We shall go on calling then 
settings in the meaning of milieu parts 
that provide time, place, and facilities 
that are suited to certain kinds of hu- 
man behavior.! At the same time, in rc- 


TABLE 3-4. NUMBER AND PER CENT OF 


OBSERVATIONAL STUDIES IN WHICH 
CHILDREN HAVE BEEN OBSERVED IN 
SPECIFIED LIFE SETTINGS 

Settings Studies 
Number Per (ent 

Nursery schools 62 56 
Homes 2 
Hospitals and maternity 

homes 8 3 
Elementary schools 7 é 
Communities-at-large 2 
Miscellaneous 14 13 


Note. The N of studies is 110. In certain 
of these investigations children were obser ed 
in more than one setting. See Table 3-1 for 
data sources. 


ferring to situations, we shall mean 
states of affairs that arise within set- 
tings and that are made up of the 
momentary conditions in an individual 
child and his environment under which 
the actions of individual behavior 
episodes occur. A nursery school, again, 
is a setting. The complex of conditions 
that includes, in a child X, a present 
need to have fun and, in the environ- 
ment, a cheerful teacher, a sandbox, and 
a toy shovel, all of which make a differ- 
ence to X while he is digging a hole in 

1 The intention in this statement is only to 
place a common everyday concept that is used 
at many points in the chapter. The closely 
related but more restricted offspring concept 
of behavior setting is represented on pp. 114- 
117. 
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rhe sand, is a situation. A situation, in 
hort, is the relevant person-environ- 
ient context of a particular action at 

a particular time. Three assumptions are 
ade here. 


1. Certain sorts of situations rather 
ian others tend to arise in a given set- 


v va 


Yet settings and situations are by 
\o means univocally related. On the 
ontrary, (a) very radically differing 
ituations can occur in the same setting, 
nd (b) virtually identical situations can 
secur in different settings. 

3. The behavior of the child at a par- 
ticular time does indeed depend more 
directly upon the situation than upon 
the setting in which it takes place. 


Now, the present significance of these 
distinctions is that our six methods and 
variants of each differ in provisions for 
observing and recording of settings, sit- 
uations, and behavior. Aspects of be- 
havior often are described entirely out 
of context, In these and other instances 
settings may be barely identified, with 
nothing recorded as to their distinctive 
milieu characteristics or associated be- 
havior patterns. In still other cases gen- 
eral comment (Biber et al., 19425 Issacs, 
1930) or supplemental methods (Barker 
and Wright, 19553 Gump and Sutton- 
Smith, 1955b) point up special charac- 
teristics of settings in which the chil- 
dren behave. A few procedures (Olson 
and Wilkinson, 1932; Baldwin, Kalhorn, 
and Breese, 1945) focus on the situation 
of the child while leaving what he does 
for secondary consideration. Finally, 
there are some procedures in which 
situation and behavior are observed, re- 
corded, and examined together. An ef- 
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fort will be made to represent the dif- 
ferent methods with enough regard for 
these dissimilarities and some of their 
consequences. 

It seems a likely guess, meanwhile, 
that observational methods in child de- 
velopment would now be suited to more 
problems if they owed less to the 
American nursery school and more to 
stores, streets, parks, vacant lots, swim- 
ming pools, and other settings that child 
psychologists rarely visit for research 
purposes. As for subjects, Gellert (1955, 
P- 193) points out that most of the 
children observed in nursery schools 
have been of “above average intelli- 

ence, from high socionomic strata, and 
of Judean-Christian cultural and ethnic 
heritage.” Consider further that this de- 
scription applies ina preschool setting, 
not only to the pupils as subjects, but 
also to these same children as social 
behavior objects in the situations of one 
another. In general, a suggestion by 
Gellert that methods of direct observa- 
tion need to be extended more to chil- 
dren of other sorts and ages and to a 
larger “variety of environments” seems 
well founded. 


Application Aims 


One might expect to find unity of 
purpose in methods with a common 
focus on naturally occurring behavior. 
Outsiders, if no others, could reason- 
ably look here for study of naturally 
occurring behavior per se. Field his- 
torians among biological scientists, for 
example, might well suppose that the 
main idea in observing children “in the 
field” must always be to describe the 
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natural habitats and behavior of chil- 
dren. But one quickly discovers that 
these naturalistic methods have by no 
means always served naturalistic pur- 

Oses. 

A natural history objective or eco- 
logical aim can indeed be found in ob- 
servational study of relationships be- 
tween behavior and conditions that 
vary from part to part of the real 
world. Baldwin, Kalhorn, and Breese 
(1945) describe parental behavior pat- 
terns in different types of homes and 
show some of their consequences for 
social and intellectual growth. Barker 
and Wright (1949, 1955) survey psy- 
chological living conditions and be- 
havior of children in the American 
small town of Midwest. Gump and 
Schoggen (1955a, 1955b) compare 
bome and camp in the life of a single 
urban child. Wright and others (1957) 
are inquiring into life situations and 
behavior of children in communities 
differing in size. These studies try to 
link child behavior with conditions 
anchored in natural habitats. They are 
uncommon. Ecological ends as such 
have rarely been sought in any field of 
psychological research. 

A normative aim is more prominent 
by far in observational child studies. 
The research goal here is precisely the 
same as in normative applications of 
psychological tests and laboratory 
measures. Children are classified as to 
age or whatever and then observed di- 
rectly for the purpose of disclosing 
central behavior tendencies in 2-year- 
olds, 5-year-olds, boys, girls, or the like. 
Goodenough (1928), Thomas et al. 
(1929, 1933), Arrington (1939), 
others have argued here that norms 


based upon direct observation of “spon- 
taneous behavior” may have gre: r 
validity than those established by ts 
or other “interference techniques; ” es- 
pecially in “the social behavior field.’ 
A systematic aim is evident in ob 
vational studies of relationships Dewe n 
universal behavior variables. Theory 
steers investigation toward general | 
chological laws in this research. For ¢x- 
ample, Lippitt, Polansky, and Rosen, in 
“a field study of social influence in 


y- 


groups of children,” contribute to a 
“systematic theory of social power in 
face-to-face groups.” They use socio- 
metric measures to get for each boy of 


a camp population an index of attrib- 
uted power that expresses judgments 
by the child’s associates of his ability 
to influence others. Then, using a time- 
sampling method, they observe directly 
and count direct influence attempts ind 
instances of behavioral contagion in in- 
teractions of the children as group 
members. Correlations of the socio- 
metric power measures with the obser- 
vational scores on social influence dem- 
onstrate that, among other things: 


The group member is more likely to 
imitate the behavior of those to whom 
he attributes high power in the group. 

. Members with high attributed power 
initiate more social influence attempts 
than do low power members . . . (1952, 


P- 59)- 


These are not ecological statements 
about “camp behavior” or normative 
statements about “boy behavior.” They 
are generalizations without limiting ref- 
erences to specified habitats or classes 
of children. Observational research with 
a systematic aim, like so much of labo- 
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ratory research, turns on such generali- 
itions, 

Finally, an idiographic aim occurs in 
observational studies of particular chil- 
dren as individual persons. Virginia 
\xline introduces such a study with 
these remarks. 


Children in a free play experience tell 
us many things about themselves. As we 
observe them... we see the expansive- 
ness and restriction in different children. 
see the acceptance and rejection of the 
challenging situation. We notice that the 
child’s way of meeting the new situation 
is as varied as the personalities of the 
children (1951, p. 358)- 


Direct observations of children are 
used in this study, as in others to be 
mentioned, much as psychological tests 
or interviews are often used for diag- 
nostic purposes. 

These different aims are not logically 
foreign to one another. Their end re- 
sults certainly can overlap. Placing the 
outcomes of investigation with a nor- 
mative aim in a separate class by them- 
selves is especially open to question. A 
developmental norm, for example, is an 
empirical generalization that connects 
behavior with age, and, as such, it does 
not differ strictly from many empirical 
generalizations of research that we are 
identifying as systematic in aim. Again, 
the findings of ecological studies are 
normative insofar as they reduce to 
averages that describe designated popu- 
lations or environments. Practically im- 
portant differences in emphasis and 
form of generalization are nevertheless 
represented by these objectives, which, 
therefore, we shall take into account in 
considering each method. 
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Survey of Methods 


The summaries of Table 3-2 will now 
be developed. 


It would be possible to examine one at a 
time the separate provisions in Table 3-2 
for sampling, recording, and data analysis. 
Then, with this accomplished, one could 
re-examine the complete methods. Such a 
survey plan offers systematic advantages. 
But severe dismantling of research cases 
in point is avoided and seeing more of the 
woods than the trees may be assured by 
our leaving each method intact as a way 
to go from start to finish in a research 
undertaking. 


DIARY DESCRIPTION 


This oldest of methods in child de- 
velopment could also be the most sim- 
ple minded, As the reader is forewarned, 
however, we believe that it may de- 
serve re-examination and even practical 
attention by students of child develop- 
ment whose households include young 
children.* 

The method is well known, It em- 
ploys mainly the recording technique 
of the lay diary to draw a sequential 
account of growth changes and behav- 
ior episodes from the life history of the 
child. Notations are made from day to 
day of what happens when develop- 
ment proceeds, and episodes that il- 
lustrate various behavioral processes are 
narratively described. Then, as a rule, 


1I inject here without delay as respectively 
great and small argumenta ad homines the two 
facts that Kurt Lewin urged me twenty-one 
years ago to keep a diary record on my son 
(now 21) and that I wish I had, Here is some- 
thing many could do. 
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the recorded material is treated only 
by topical classification and characteris- 
tically free qualitative analysis. The no- 
tations and descriptions necessarily are 
selective in the face of innumerable 
events that pass in an endless procession. 
Yet they are not capriciously selective 
in the sense that the observer sets down 
whatever is only arresting. The entries 
in diary records are ruled on the con- 
trary by a stable and objective principle 
of limitation, namely, that of recording 
in sequence new behavioral events in 
the behavior continuum of one subject, 
usually an infant or a child of preschool 
age. A special feature of the method 
has always been close and essentially 
continuous contact between the subject 
and the observer, who have invariably 
lived together as child and parent or 
other guardian, 


The open character of diary descrip- 
tion differs in breadth of focus as be- 
tween the comprehensive diary, which 
records in order as much of everything 
new as it can, and the topical diary, 
which excludes everything but phenomena 
in one or a few developmental channels. 
Beginners in child development read about, 
although one doubts that many of them 
have read lately, such comprehensive diary 
studies as those of Tiedemann (1787), 
Darwin (1877), Preyer (1888-1889), 
Sully (1895), Shinn (1900), and the 
Scupins (1907). Topical diaries include 
those kept by Taine (1877) and Bateman 
(1916) on language behavior, Buchner 
(1909) on emotional behavior, Dearborn 
(1910) on motor and sensory functions, 
Brainard (1927) on “learning and in- 
stincts,” and Navarra (1955) on concept 
formation. 

Diary records are not all products of 


direct observation alone. There are in 
many of them reports of casual interview 
procedures and even of improvised 
psychological tests (Major, 1906; Hogan, 
1898). Some also present such quick ex 
periments as the one reported and inter- 
preted engagingly by Darwin as follows in 
the diary he kept on his son. 


On the seventh day, I touched th: 
naked sole of his foot with a bit of 
paper, and he jerked it away, curling 
at the same time his toes, like a muci 
older child when tickled. The perfec 
tion of these reflex movements show: 
that the extreme imperfection of the 
voluntary ones is not due to the state 
of the muscles or of the coordinati 
centers but to that of the seat of th 
will (1877, p. 285). 


Such an experiment barely breaks the 
rules and would seem to suit the spirit of 
direct observation. 

But most of these records report chiefly 
(1) such developmental items as “eighty 
ninth week, running is awkward, but 
falling rare,” from Preyer (1880-1889, p. 
xxxi), and (2) such occurrences as the one 
related and interpreted by Sully in the 
following graphic passage. 


One day (end of seventh month [of 
the third year]) when, after being very 
naughty, his mother had to carry him 
upstairs, he broke into a more than 
usually violent fit of crying. His mother 
asked him what he meant by making 
such a noise on being carried upstairs; 
whereupon he replied, “’Cause you 
carry me like a pig.”. . . This is sig- 
nificant as showing how . . . the con- 
scious sense of self was developing . « - 
on its moral side as a sense of personal 


dignity . . . (1895, p. 452). 


Spontaneous growth indices and be- 
havior episodes—these, narratively de- 
scribed, make up the bulk of the child 
diaries. 
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Diary records have often been turned 


to practical account in modern text- 
books and journals as object lessons in 
how not to do psychological research 
with children. They have been charged 
with defects which include biased se- 
lection, unreliable recording, inefficient 
gathering and processing of data, and 
unwarranted interpretation. However, 
these alleged shortcomings are not 
bound to the method with any logical 
necessity; nor would there appear to be 


insuperable reasons why all of them 


could not be substantially remedied. In 
any case, whatever its faults in practice, 
the basic method has unique advantages 
in principle. It commands the breadth, 


richness, subtlety, and permanency of 
the written word, compared with which 
the best of check lists and rating scales 
are limited instruments. It gives a mul- 
tidimensional picture of simultaneous 
and successive factors in the behavior 
and circumstances of an individual 
child. It saves for later study at any 
time and for any purpose comparatively 
intact specimens of behavior with its 
context, Diary records understandably 
have not often described in very great 
detail the situations in single life epi- 
sodes. Their extended coverage pro- 
hibits continuous and unselective mo- 
ment-to-moment recording of situation 
and behavior. Yet the method does per- 
mit sketchy treatment of situational fac- 
tors in developmental changes. Finally, 
diary description takes into account the 
continuity of behavior as this can be 
done only by observation of children 
one by one over extended periods of 
time. 

Some of the early diary keepers made 
much of the last point. Milicent Shinn, 
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for example, wrote as follows in intro- 
ducing the record of development of 
her niece through the first year. 


The biographical method of child study 
has the inestimable advantage of showing 
the process of evolution going on, the 
actual unfolding of one stage after another, 
and the steps by which the changes come 
about. No amount of comparative statistics 
could give this, If I should find out that 
a thousand babies learned to stand at an 
average age of forty-six weeks and two 
days, I should not know as much that is 
important about standing, as a stage in 
human progress, as I should after watch- 
bg a single baby carefully through the 
whole process . . . (1900, p. 11). 


The longitudinal principle in this 
statement has been endorsed often by 
contemporary students of child devel- 
opment; and it could be that no method 
stands to implement the principle bet- 
ter than diary reporting. Other longi- 
tudinal methods have generally used 
large groups of subjects who have been 
visited or summoned, observed, inter- 
viewed, and tested at rather widely 
separated intervals of time, Because 
they lack the feature of continuous con- 
tact between investigator and subject, 
these methods inevitably miss much that 
a diary kept at home stands to reveal of 
“the actual unfolding of one stage after 
another” in the life of a child. 

The foremost aim of the child diaries 
has been not the idiographic one of de- 
scribing and understanding the special 
behavior of cach child subject but the 
normative one of discovering behavior 
traits of children by and large at differ- 
ent ages and stages of development. 
For example, the end results of the 
classical work by Preyer (1880-1889), 
which is based upon a record of 
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“mental growth” in his son from birth 
through age 3, are summarized in a 
conspectus that looks and reads astound- 
ingly like familiar developmental sum- 
maries of contemporary child psy- 
chology. But the diarists have gathered 
and worked over their material also 
with a systematic aim. Many of them 
theorize often in educing generaliza- 
tions that relate behavior with its fun- 
damental determinants, as in the instance 
in the foregoing text note from Darwin. 

Diary studies of children have shown 
little or no explicit regard for the eco- 
logical problem. Their preoccupations 
with universal norms and general laws 
all but crowd out entirely deliberate 
attempts to relate behavior with the 
psychological living conditions of dif- 
ferent times, places, and cultures. One 
thinks, however, of the possibility that 
the available child diaries, kept as most 
of them have been over the time span 
of more than a century on children of 
Germany, England, Italy, Portugal, 
France, the United States, and other 
widely ranging localities (Dennis, 
1936), might advantageously be scruti- 
nized for differences across cultures in 
behavior and its conditions. The chiefly 
normative study by Dennis and Dennis 
(1937) of baby biographies from vari- 
ous countries could be extended in this 
direction. 

It may indeed be that some new diary 
records would be useful. Trained 
students of child behavior could keep 
them with new safeguards against bias 
and error. Colleagues or responsible lay 
persons could be called in at moot points 
for tests of observer agreement. New 
refinements in recording methods could 
be applied and so also could new ways 
of coding and quantifying qualitative 


material. Even standardized tests and 
minor experiments that threaten only 
small violence to nature could be 
worked in at intervals, although some 
observers might prefer only to make the 
most of direct observation, The data 
could be sorted, plumbed, tabled 
graphed, Chi-squared, and interpreted 
with the additional help of descriptiv 
categories and conceptual framewo! 
that were not available to the ear 
diarists. And each record could b 
mined with explicit regard for norn 
tive, systematic, and ecological prob- 
lems, to say nothing of idiographic 
problems with domestic implications {or 
the diarist. Any part of the age span 
and any one channel or all channels of 
development could be opened to study. 

Psychologists have still done little to 
lay things and events of the behavior 
continuum end to end. So much of their 
research on growth problems has been 
piecework in the absence of our best 
assembly line—the growth stages of a 
single growing child. The Prousts and 
Twains and Rollands and other literary 
biographers may well have accom- 
plished more here than scientists in un- 
raveling social and emotional strands in 
development and in revealing con- 
cretely the lines of connection between 
interdependent factors in the growth 
process. Diary description has much in 
common with literary biography. It, 
too, is well adapted to problems of 
continuity and interdependency, well 
enough, perhaps, that this unique way 
of doing longitudinal research in child 
development should be renovated and 
tried again. 


Diary description and what can rightly 
be termed anecdotal description have often 
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been given the same names, with each 
being called either a “diary” method or 
in “anecdotal” method; and both have 
>n subjected also to much the same 


criticisms. Both report observed behavior 
narratively, and each has relied upon 
qualitative study. But, instead of recording 
with some regularity the behavioral 
growth of a single child, anecdotal re- 
ports set down at circumstantial times 
whatever invites attention in the be- 
havior of any child who happens along. 

An enchanting model case of this way 
to investigate child behavior—which it 
seems incorrect to feature here as a now 
significant method—is presented by the 
work of Russell (1892) and some of his 
students and collaborators, including 
Burnham, 1891, Brown, 1892, and Haskell, 
1906, who, in the time of G. Stanley Hall 
and with his vigorous encouragement 
(Hall, 1892), collected and classified under 
such headings as imitation, imagination, 
and reasoning no fewer than 79,000 be- 
havior samples. Here is an example from 
Brown that is typical in length, style, 
jocularity, and religious sentiment. The 
subject is a boy, R, of five years. 


R came into the house eating a horse- 
chestnut. Grandma: “Well, R, if you 
eat that horse-chestnut you'll die and go 
to heaven with your mother, and then 
I shan’t have any R.” (His mother was 
dead.) R: “Well, I'll go out and get four 
horse-chestnuts, one for grandpa, one 
for you, one for Aunt H, and one for 
me. I’ll eat mine first, then I'll die and 
go up to heaven first. Then grandpa ’Il 
eat his and he'll die and go to heaven, 
then Aunt H Il eat hers, then she'll die, 
then you'll eat yours and we'll all be up 
there together. Won’t that be nice, 
grandma?” (1892, p. 383). 


In stating a rationale for his amassing 
of such material, Russell (1892) wrote 
that “. . . if the science of psychology is 
to be advanced on the objective side, as 
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need for vast masses of facts, which for 
their multitude can only be collected by 
an army of investigators working in many 
quarters, under various conditions, and 
through a long period of time” (p. 346). At 
least a corporal’s guard of developmental 
psychologists will agree now that great 
quantities of facts, straight from nature, 
assembled from widespread and various 
life settings, and on the order of many in 
the Russell anecdotes, are still needed in 
developmental psychology. But we are 
concerned here with suitable ways to meet 
this need. 

Anecdotal reports lack the limiting 
objective principle of observing and re- 
cording in sequence new behavioral events 
as they appear in the life continuum of 
one child. Nor are they limited by any 
clear rules. Records that accept anything 
and everything arresting in the behavior 
of any child as fair game can yield at 
most a miscellany of happenings that de- 
pend for their content upon spur-of-the- 
moment interests and implicit theories of 
the observer. It is for this reason, we sug- 
gest, that the proscribing charge of sub- 
jective bias, so often brought against nar- 
rative records in general, can be sustained 
against these easy reports. 


SPECIMEN DESCRIPTION 


This method begins with the sched- 
uled and continuous observing and nar- 
rative recording of a behavior sequence 
under chosen conditions of time and life 
setting. A child to observe and a time 
and particular place in which to ob- 
serve are selected to suit special inter- 
ests. But the observer in the field is 
in no way deliberately selective. He is 
instead deliberately unselective in the 
sense that he aims to make a faith- 
ful record of “everything” as it comes 
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in the behavior and situation of the 
child. The obtained record may be ex- 
hibited as a behavior documentary. But 
that seldom is all. The recorded mate- 
rial is usually subjected to study that 
varies among different applications from 
ever so free interpretation to somewhat 
strict and thoroughgoing quantitative 
analysis. 

The end results of specimen descrip- 
tion depend upon its field records, 
which are therefore considered first and 
independently. These behavior ac- 
counts, to add another terminological 
note, have often been called “diary 
records.” Their narrative form under- 
standably suggests this identification, 
and enough of them pieced together 
would make a diary. But they neither 
select nor span developmental changes 
in behavior, and, part by part, they 
are far more intensive than typical 
diary descriptions. These records have 
been made and studied, not in the con- 
texts of individual life histories, but in 
the temporally more limited contexts of 
particular life settings. They provide 
lasting specimens of the behavior and 
immediate situations of children in these 
settings. Accordingly, following Barker 
and Wright (1949), we shall call them 
specimen records, from which the name 
of the method as a whole follows. “Let 
us now get a segment thus-and-so lon, 
and as nearly complete as possible from 
the behavior continuum of child X in 
habitat Y. Then, later, we shall see what 
can be done with it.” Such is the main 
idea of the specimen record. 

Records of this type have differed a 
good deal in setting or locale. They have 
described behavior of children at home 
(Dresslar, 1901; McFarland, 1938; La- 


fore, 1945), in nursery schools (Andru: 
1924; Berne, 1930; Issacs, 1930, 1933 
Moore, 1931; Shallit, 1932; Fales, 1937 
b; Slater, 1939; Tucker, 1940; Fite, 194 
Woodcock, 1941; Hartley, Frank, a: 
Goldenson, 1952), in an elementa: 
school (Biber et al., 1942), in a summe: 
camp (Gump, Schoggen, and Re 
1957), and in communities-at-la 
(Barker and Wright, 1951, 1955; Gun 
and Schoggen, 1955b). The recorded 
sequences have ranged in length fr 
“moments snatched” (in fact, irre 
larly scheduled periods lasting usu 

a few minutes: Issacs, 1930, 1933), 
through the duration of a “baby’s mo 
ing” (Dresslar, 1901), up to a full 
(Barker and Wright, 1951; Gump and 
Schoggen, 1955a, 1955b). Sequences 
that span the natural time limits of c] 
room sessions, parties, and other such 
come-and-go settings also have been 
recorded (Barker and Wright, 1949; 
Gump and Sutton-Smith, 1955). 

The recording technique and in- 
tended content of specimen records are 
evident in instructions to observers who 
have made them. Below, then, is a par- 
tial abstract of pooled directives on ob- 
serving and recording from Stone 
(1941), Biber and associates (1942), 
Hartley, Frank, and Goldenson (1952), 
Barker and Wright (1955), and other 
investigators. 


e Begin in reporting each observed se- 
quence with a description of the scene, 
the actors, and the ongoing action. Report 
throughout in everyday language. Describe 
the situation as fully as the behavior. Thus 
include “everything the child says and 
does”; but include also “everything said 
and done to him.” 


Describe the larger “adaptive actions” 
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of the child, but weave in as well the 
ows” of these actions as far as possible. 
‘Nonadaptive aspects of behavior” (Stone, 
}1) are important on this account. These 
clude hunchings, turnings, twistings, and 
the like, which often illumine such prop- 
ties of behavior as intensity, affect, and 
ficiency. 
Do not substitute interpretations that 
eneralize about behavior for descriptions 
of behavior, but add such interpretations 
when they point to possibilities of fact that 
might otherwise be missed. Segregate 
every interpretation by some such device 
as indentation or bracketing. Straight re- 
porting must be left to stand out, è 


Procedures in recording that have 
been followed and recommended offer 
several possibilities: 


e Notes on the scene of observation, 
which obviously are needed for sufficiently 
detailed and accurate description, can be 
kept in improvised shorthand. These field 
notes can be enlarged in writing im- 
mediately after each observation period, 
or they can serve then as a base for a 
dictated narration of the observed be- 
havior sequence. Also, a co-worker can 
hear this account through and at once 
question the observer where it is thin or 
unclear. The original dictation plus the 
interrogation and the observer’s responses 
can be sound recorded. All of the re- 
corded material can then be copied and 
revised in an improved running account. 
Observations can be timed to permit 
various measures of duration. Timing of 
the field notes at intervals of approximately 
one minute or even thirty seconds (Jersild 
and Fite, 1939; McFarland, 1938; Fite, 
1940) has been found practicable. When 
long records are made, observers can 
work in rotation; and the time of each 
observing period can be regulated to 
minimize effects of fatigue upon ability to 
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see and remember an always fast train 
of events. è 


The following is a part of a specimen 
day record from Gump and Schoggen 
(1955a), who were joined in the re- 
cording process by three others, All five 
observers took turns of thirty minutes 
each, in rotation, from 7:23 A.M. to 
9:04 P.M. of the recorded day. The ob- 
servations were recorded by the dic- 
tate-interrogate-revise technique, as de- 
scribed above. The setting is a summer 
camp, and the subject is g-year-old 
Wally, whom the first observer finds 
asleep in his cabin. 


7:23 Wally rolls over in the bed, 
opens his eyes, props himself up on 
one elbow, and looks with a sort of be- 
wildered stare at me. 

I am across the room from him. 
Then he smiles briefly, as if he has 
caught on to what is going on. 


Noting O 


He says to me, but also to others 
around, looking at them intermit- 
tently, but more at me, “Gee, do you 
know what happened to me last 
night?” 

He says this again with more earnest- 
ness, “Do you know what happened 
to me last night? I had a dream.” 
Wally now sits straight up in the 
bed and begins to tell his dream in a 
serious, impressed tone of voice. 

“I dreamed that my Dad and I were 
riding in this big truck and we were 
coming down this hill and the driver, 
he was knocked out, and we couldn’t 
drive the truck and the truck was 
going faster and we landed in the 
river with a big crash! Jeeze!” 

I nod but nobody else makes any 
comment on this dream, which was so 
vivid in Wally’s mind when he awoke. 


Telling About Dream 


Specimen records do not all read like 
this one. Some are less detailed. Others 
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hold back more on partly inferential 
observing; a few generalize more about 
whys. But all have the common base 
of sequential, unselective, plain, narra- 
tive description of behavior with some 
of its conditions. 

A researcher who reads one of these 
running accounts is almost sure to ask 
whether specimen records are reliable. 
Several studies have approached this is- 
sue via tests that show level of agree- 
ment between simultaneous and inde- 
pendent observers. One finds among re- 
sults of these tests agreement per cents’ 
of 92 for notation of “social contacts 
... between two or more children” 
(Jersild and Fite, 1939); 95 for nota- 
tion of “verbal or physical communi- 
cation between parent and child” (La- 
fore, 1945); 90 for notation of “large 
activities . . . in the . . . development 
of interests” (Tucker, 1940); 91 to 95 
for notation of “major body shifts,” 
“social contacts . . . or attempts at so- 
cial contacts,” and “extraneous or mar- 
ginal events” (e.g. “departure of the 
teacher”) as factors in “episodes of ac- 
tion” (Biber et al., 1942); 84 for nota- 
tion of definite “breaks” in the “con- 
tinuity of ... play” (Slater, 1939). 
Comparable observer agreement has 
been found where specimen narration 
is used as a recording technique in time 
sampling studies. Jersild and Markey 
(1935), Markey (1935), Nesbitt (1943), 
and Heathers (1955) present examples. 


‘The agreement per cent is a common 
measure in observational child studies. It has 
been calculated by formulas that differ some 
in stringency and yet give comparable values. 
Examples of these formulas are in Arrington 
(19324), Festinger and Katz (1953), Barker 
and Wright (1955), and Marshall and Mc- 
Candless (19574). 


So far so good. But these findings do 
not warrant a broadside conclusion that 
specimen records are reliable. This is a 


difficult problem. Such records ne 
many reliability coefficients as 
contain classes of statements about 
ferent facets of behavior and situation. 
Their reliabilities must differ from one 
to another of such categories as things 
used, persons encountered, games play ed, 
jokes told, tasks undertaken, ideas ex- 
pressed, dangers met, fears shown, com- 
petitions entered, failures 
problems solved, successes enjoyed, 
moods displayed, dreams related, utter- 
ances made in “a serious, impressed tone 
of voice” (see above what is said about 
what Wally said), and so on throu rha 
great list of things that everyone wants 
to learn about and that many investi 
tors ask and write about on the basis 
material in specimen records. What this 
comes to is that the many needed co- 
efficients demand analytical techniques 
for the identification and measurement 
of these and other behavior and situa- 
tion variables. Likely techniques are 
mentioned later. Each uses a limited 
number of descriptive categories. Large 
inroads on this problem can be made 
only by applying such categories in 
great number, one by one, in tests 
of agreement between independent ob- 
servers. Of course, a choice can be 
made to use only the parts of a speci- 
men record for which reliability has 
been firmly established, although it is 
hard to find a precedent for caution so 
aseptic, and some would consider this 
wasteful. 

Specimen records cannot now claim 
direct proof of reliability for all of 
their applications. Yet some things re- 
main to be said for a generous work- 


suffered, 


gq- 
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ing trust in them. The balance of social 
experience indicates that their method 
of narration in conventional language, 
the standard method of recording be- 
havior in communications of man with 
man, can be depended upon as a way 
to register faithfully many kinds of be- 
havioral data. So, too, does the more 
immediate experience of reading such 
a record as the one sampled above 
through consecutive descriptions by 
different observers of the same child; 
for this practical test gives prima facie 
evidence that these observers were in- 
deed seeing the same child and describ- 
ing the same stream of behavior. Also, 
systematic analysis of the recorded ob- 
servations has shown meaningful and 
internally consistent relationships be- 
tween many behavior and situation 
variables. One would not expect such 
results from analysis of unreliable ma- 
terial. 

Whatever the exact limits of their 
accuracy, specimen records have assets 
that encourage making the most of 
them without shackling anxiety about 
just how accurate they are. They re- 
mind one here of how long scientists 
made good progress with thumbs and 
sun dials before they had dependable 
thermometers and clocks. 

First, these records can register al- 
most everything observers can see of 
behavior. This follows from the range, 
variety, wealth, and subtlety of plain 
words—called to attention above by 
diary description. It does seem doubt- 
ful that any instrument short of a 
sound-movie camera can surpass lay 
language as a means of recording the 
range, variety, wealth, and subtlety of 
behavior itself. This tool has been suited 
to the purpose by centuries of daily 
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practice. Allport and Odbert (1936) 
tell us that there are in English 17,953 
words for as many ways of behaving. 
Recording behavior with only a much 
more limited set of symbols may some- 
times gain less in reliability and appar- 
ent precision than it loses in validity 
and genuine refinement. ss 
Specimen records describe behavior 
in context by recording behavior with 
situation. Try to make one without 
bringing in situational factors. It is in- 
teresting that good sense forbids this. 
One simply does not report that “John 
said ‘Ouch’” without saying he was 
stuck with a pin, say, Or report that 
Mary behaved aggressively or submis- 
sively or affectionately without iden- 
tifying the person objects of her ag- 
gression or submission or affection or 
without saying something about their 
treatment of Mary. Everyone knows, 
at least intuitively, that the meaning 
and significance of an action and 
even its occurrence depend directly 
(if not primarily, as for Lewin) upon 
the coexisting situation. So, with no 
check lists or rating scales or precoded 
categories to confine him, an observer 
who makes a specimen record naturally 
works the situation jnto his running ac- 
count, moment by moment and action 
by action. Specimen records differ here. 
Some are richer than others on the situa- 
tion side (on the action side, too, for 
that matter), but none reports behavior 
alone, whereas records made by closed 
methods of direct observation often do 
this. 
For developmental psychology, with 
its interest in behavioral change over 
than minutes, hours, or 


years rather 
days, specimen records tell less about 
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the continuity of behavior than diaries. 
Yet continuous accounting for behavior 
and situation is a first requirement of 
their recording technique. The result is 
that these records present for analysis 
finely woven strands of successive units 
and conditions of behavior. A further 
result is that they permit intensive study 
of sequence relations in behavior. Given 
S (a child) and O (a mother), what 
may S do to O at noon if at 11:30 O 
spanks S? And how may the relations 
of the two develop and change from 
moment to moment in the intervening 
time? Specimen records permit system- 
atic answers to such questions. Here is 
a much neglected opportunity, although 
a paper by Barker and others (1955) 
shows promise in it. 

Another possibility is that specimen 
recording could be done along with 
diary recording in work on problems of 
continuity and sequence relations in be- 
havior. A fantasy which the reader may 
pardon is that of an eight-volume work 
about a child, C: Volume I, a specimen 
day record, C’s First Day; Volumes II 
through VII, each a specimen day rec- 
ord made near a birthday (two of 
which, note, would occur in school 
years); Volume VIII, a comprehensive 
diary record, C’s First Seven Years. 
Teachers and students as well as re- 
searchers could use the recorded ma- 
terial, 

A specimen record preserves much 
of behavior as it appears without either 
the omissions of selective observation or 
the alterations of theory. It therefore 
goes far to secure whatever advantages 
theoretically neutral data have to con- 
fer upon a discipline with more guesses 
than facts. One is encouraged here by 


cience but 


the fact that about every 
psychology has gone to great pains to 
store such data. Natural history mu- 
seums, herbaria, astronomical charts and 
photographs, and anthropological ar- 
chives give examples. 

Finally, like diary records and like 
materials in the museums and archives 


of other sciences, specimen records have 
permanency. They will keep as long as 
common words retain their standard 
meanings, and they can be studied for 
that long on behalf of innumerable in- 


terests. The same cannot be said for the 
more typical records of closed observa- 
tional methods. A marked and analyzed 
check list, for example, may as well be 
thrown away, whereas a written and 
analyzed specimen record can be put 
back in the files and used over and over 
again. In fact, specimen records increase 
in value with age. Consider that the 
older they get the greater the number 
of possible comparisons between them 
and other available records for the eco- 
logical purpose of studying change 
from time to time and place to place 
in children’s psychological living con- 
ditions and behavior. 

Specimen records provide in no way 
beforehand for quantitative description. 
Making them must seldom be an effi- 
cient way of getting only the data 
needed to test a particular hypothesis. 
They are costly in research time and 
manpower. These can be serious prac- 
tical drawbacks that one should weigh 
carefully before undertaking to use 
specimen records on any considerable 
scale in a research program. Yet it seems 
to us that a research tool which records 
behavior (1) comprehensively, (2) sen- 
sitively, (3) in context, (4) in continu- 
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ous lengths, (5) without theoretical 
bias, and (6) with permanence ought 
to meet needs of many problems in 
child development. 

What besides exhibiting them can be 
done with these records? This is to ask 
about analysis procedure as a stage in 
specimen description. 

Differing kinds of qualitative speci- 
men study are to be found in the litera- 
ture. One consists in educing from the 
recorded material chiefly normative 
generalizations about the behavior of 
classified children in particular life set- 
tings. Biber and associates (1942), for 
example, draw “pictures” of “seven- 
year-oldness” from specimen records 
(in their terminology, “contextual rec- 
ords”) on the behavior of 7-year-old 
children in the settings of “reading, 
writing, painting, clay, shop, rhythms, 
and roof play” of an urban school as a 
“psychological community”; and the 
same general type of analysis is illus- 
trated by the earlier mentioned research 
of Woodcock (1941) and Hartley and 
associates (1952). Another sort of quali- 
tative study here consists in searching 
among the specimen facts of behavior 
and situation for properties and laws of 
basic psychological processes. Susan 
Issacs has contributed cases in point by 
using specimen records on the behavior 
of young children in the daily situations 
of an experimental school to develop 
theoretical formulations of interdepend- 
encies in intellectual functioning (1930) 
and social behavior (1933) at different 
stages of growth. 

These measurement-free undertakings 
give impressions that specimen records 
are rich in behavioral data and that 
scrutiny of these data without benefit 
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of numbers reveals much of interest 
that no presently available techniques of 
measurement seem likely to pin down. 
They often support a belief that it 
would be unfortunate if there were no 
child psychologists, somewhere be- 
tween the novelists and the laboratory 
experimenters, with liking and talent for 
research that first describes carefully 
and then only classifies and interprets 
without the narrowing strictures of 
quantification. The cited illustrative 
studies at least document the behavior 
of children authentically; and it would 
seem bad policy for child psychology 
to relinquish this documenting func- 
tion to literary and journalistic ob- 
servers. 

Efforts to quantify the raw material 
of specimen description have been made 
with a number of analytical schemes 
that yield scores on variables of be- 
havior or situation, or both, from rec- 
ords on individual children. We can 
merely identify rudiments in some of 
these schemes as they apply to the 
single record, which has often spliced 
together running accounts of sequences 
observed at different times. Only basal 
measures are represented. 


1. The record as a whole is scanned as a 
geologist might study an intact rock speci- 
men. Then one of two procedures is 
followed. (a) Ratings are made on the 
prevalence of various ways of behaving, 
and scale values of these ratings are 
summed. Andrus (1924), to take a pioneer 
example, applies to records on children in 
a nursery school an inventory of 410 
questions, such as: Does the child giggle? 
Use adverbs? Walk steadily? Try to boss? 
Answers of very often, sometimes, never 
to these questions give values of two, one, 
and zero, which are added in scores on 
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emotional, mental, motor, and social-moral 
divisions of the inventory. (b) Straight 
frequency counts are made on different 
kinds of behavior. McFarland (1938) tallies 
the instances per record of submission, 
resistance, helping, giving, and other such 
responses to one another by 3-to-6-year- 
old sisters at home. Slater (1939) counts 
changes of activity, play materials used, 
words spoken, nonverbal contacts with 
other children, contacts with adults, oc- 
currences of associative play, and a num- 
ber of other interesting things in records 
of uniform length (125 minutes) on 2-to- 
3-year-olds of a nursery. 


Sarason, Davidson, Lighthall, and Waite 
(1958) analyze specimen records of a sort by 
an interesting technique that includes scanning 
of the intact account. Their subjects are chil- 
dren in classrooms of grades 2 through 5. Each 
child is observed for one hour, whereupon the 
observer first records narratively “the major 
events” in the behavior sequence and then 
draws “as integrated a personality description 
as... possible” from the summary record. 
Analysts count the number of children who 
are characterized thus and so on variables 
that include need for achievement and aca- 
demic ability. Also, tallies are made of refer- 
ences to nail biting and other such behavior 
in the uninterpreted record. But for making 
these tallies, the investigators (1) watch be- 
havior for a while, (2) record it in summary 
fashion, and then (3) extract from the record 
judgments about the behaving children. This 
seems adventuresome. Reliability of observa- 
tion is affirmed on the ground that inde- 
pendent observers do very well at blind 
matching of their total reports on the same 
subjects. But could such records be so ex- 
tremely unreliable as not even to give clues 
that would nearly guarantee success in such 
matching? What one would like to know is 
whether the particular observations behind 
judgments on particular variables, academic 
ability, say, are reliable. 


2. The record is divided into uniform 
short time intervals, much as if the analyst 


were a time sampler in the field. Different 
procedures have been followed here also. 
(a) Duration scores are obtained by 
summing the time spans of selected re- 
curring behavioral phenomena. Thus 
McFarland adds the thirty-second intervals 
during which paired sisters of the study 
just cited engage in such interaction as 
sharing, dramatic play, and cooperation. 
(b) Frequency scores are obtained by 
summing the number rather than the time 
of intervals that encompass certain phe- 
nomena, as when Jersild and Fite (1939) 
count all of the thirty-second lengths 
during which individual children in a 
nursery school enter into social contacts 
with their child associates. 

3. The record is divided into natural be- 
ginning-to-end units that are anchored 
to restricted psychological content and 
are therefore suited to study of phe- 
nomena within the chosen content limits. 
These units are treated as convenient ex- 
amination targets; and frequencies, in the 
total record, of different phenomena which 
they contain are determined. 

Lafore (1945) divides records on pre- 
school children at home into parent-child 
contacts that are delimited by the be- 
ginning and end points of “verbal or 
physical communications.” She then 
counts the instances per record, during 
these contacts, of such parental practices 
as bestows affection, blames the child, 
and praises the child and such acts of the 
children as helps the parent and cries. 
Tucker (1940) divides records on children 
in a cooperative nursery into situations 
that involve the child and his mother as 
associates in “the solution of a problem 
or the development of an interest.” She 
then determines the frequencies per record 
in these situations of control tactics by 
the mothers, such as urges, directs, im- 
pedes, warns, and of cooperation, re- 
sistance, combativeness, and other such 
reactions to these tactics by the children. 
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4. The record is divided into natural be- 
zinning-to-end units that override dif- 
ences in psychological content, hence 
are suited to the study of diverse behavior 
and situation variables and their relation- 
ships. These units are treated as both 
examination targets and counting integers. 
Dimensional and discrete-item categories 
for description of situation and behavior 
variables are applied to each unit. Scale 
measures per unit and scores that tell the 
number of units in which different phe- 
nomena occur are derived. 

Barker and Wright (1955) divide speci- 
men day records on children in the settings 
of an American town into behavior epi- 
sodes, defined as units of action and co- 
existing situation, such that in each the 
action has a constant direction and falls 
within the behavior perspective (roughly, 
evident intention span) of the child. These 
units are in no way restricted to particular 
kinds of behavioral content. Noting O and 
Telling About Dream, as marked off by 
brackets in the margin of the record ex- 
cerpt on p. 85, are examples of episodes. 
Occurrence rates of these behavior-situa- 
tion units are studied. So also are their 
positional or structural relations, as when 
episodes overlap, so that the child does two 
or more things at once (pp. 225-302). In 
one application of the method (pp. 323- 
457; Wright, 1956) categories are used to 
describe various aspects of the social be- 
havior and situation of the child, and a 
larger set of categories (Wright and 
Barker, 1950) has also been developed for 
more comprehensive study of situation and 
behavior variables. Examples follow 
shortly. 


Space limitations forbid discussion of 
the ways in which these analytical 
schemes have been elaborated or of 
their outcomes in empirical findings. 
But some things about them which bear 


ol 
on assessment of specimen description 
in general can be mentioned. 


1. Independent analysts make similar 
discriminations when, as in schemes 3 
and 4, records are divided into psy- 
chologically defined units. 

2. Quite consistently high estimates 
of accuracy in applications of behavior 
and situation categories appear gener- 
ally when judgments of independent 
analysts are compared. 

3. Satisfactory indices of internal 
consistency are shown by measures of 
change over time in variables with pre- 
sumably high stability (Jersild and Fite, 
1939; Lafore, 1945). 

4. A diversity of behavioral variables 
usually is accommodated by each 
scheme. Some of the special advantages 
in multivariable research designs are 
represented in these procedures. 

5. The reported results meet well the 
common (and so often final) tests of 
intelligibility, congruence of independ- 
ent measures, and agreement with sense. 

6. Some of the findings also meet 
tests of agreement with outside criteria. 
To illustrate by an early result that may 
be as exemplary and suggestive in this 
respect as any, Andrus (1924) reports 
a correlation of .66 (+.09) between 
specimen-record measures of intelli- 
gence level in play activities and scores 
on the 1922 Kuhlmann Revision of the 
Binet. 


We turn now to application aims in 
specimen recording and analysis. Speci- 
men description, as a method that takes 
what comes in the field of observation, 
seems well suited to studies with the 
ecological aim of exploring and inter- 
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relating spontaneous behavior and psy- 
chological living conditions of children. 
Curiously enough, however, the method 
has not often been applied for quite 
such a declared purpose. The Midwest 
research, cited under analysis scheme 4, 
has made considerable use of specimen 
records to this end and so have subse- 
quent studies of Gump and his col- 
leagues (Gump and Schoggen 1955, 
1955b; Gump and Sutton-Smith, 19552; 
Gump, Schoggen, and Redl, 1957; and 
Gump, Sutton-Smith, and Redl, 1955). 
The ecological applications of specimen 
description to date, however, are only 
small beginnings whose values can be 
multiplied by comparative studies ina 
variety of communities and cultures. 
The method has often been adapted 
to our systematic aim by investigators 
with primarily ecological interests. It 
offers here the possibility of exploring 
lawful relations between many and 
greatly differing variables of behavior 
and situation that are not easily caught 
in the same matrix by experiments or 
closed observational procedures. Labo- 
ratory strings, especially, cannot readily 
be pulled to bring under quantitative 
assay within, say, a day of a child’s life 
such conditions of psychological situa- 
tion as bona fide affection, aggression, 
dominance, submission, resistance, ap- 
peal, nurturance, and avoidance by a 
mother, father, brother, neighbor 
friend, or any such associate, waxing 
and waning in the social power of 
others; varying kinds and degrees of 
social pressure, mood, and evaluation 
by associates, together with concomi- 
tant values for all of these in the be- 
havior of the child, plus changes in 
cooperation, competition, conflict, and 


related forms of social interplay be- 
tween the child and interacting persons. 
This has been done after a fashion by 
the method of specimen description. Re- 
lations between simultaneous behavior 
and situation factors have been investi- 
gated, and sequential interdependencies 
could be studied by the same proce- 
dures. 

We have referred earlier to norma- 
tive applications of specimen descrip- 
tion. A thing to be considered here is 
that some arrays of norms in develop- 
mental psychology are so characteris- 
tically like time tables and other dry 
civil accounts. Despite their great use- 
fulness, they often look thin against the 
landscape of child behavior at any age 
Work cited earlier here is pertinent 
Thus the Biber team’s “pictures . . . of 
seven-year-oldness,” Woodcock’s simi. 
lar descriptions in Life and Ways of th 
Two Year Old, and the portrayals oi 
young children at play by Hartley anc 
associates help to fill in, an age level ai 
a time, some enormous gaps in thi 
orderly normative summaries from psy: 
chological tests and laboratory meas 
ures. These descriptions frequently sug 
gest also that specimen accounts o 
behavior, even if they were only to bi 
kept on file, should often be good fo: 
idiographic purposes; and one finds in 
deed that they have at times been se 
applied (Axline, 1951; Reckless, 1929 
and Slater, 1939). 


TIME SAMPLING 


This closed procedure fixes attention 
of observer and analyst upon selectec 
aspects of the behavior stream as they 
occur within uniform and short tim 
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intervals. The length, spacing, and num- 
ber of intervals are intended to secure 
representative time samples of the tar- 
get phenomena. As a rule with excep- 
tions, descriptive categories are coded 
in advance for quick and precise judg- 
ments in the field and later efficient 
scoring. The subjects are invariably as- 
sembled children of a school setting or 
other institutional environment. Chil- 
dren en masse have been observed in a 
few studies (Olson, 1929). Usually, 
however, the observer proceeds in ro- 
tation from child to child through the 
scheduled intervals of time. 

Here is a method of some thirty years 
standing that appears to be indigenous 
to research in child development. Its 
applications have indeed been numerous 
and varied. But we shall be concerned 
less with differences between these ap- 
plications than with their common fea- 
tures and with issues that turn upon 
basic traits of the method. These issues 
are often clearest in the earlier studies, 
which, therefore, are featured in the dis- 
cussion. We believe that getting into 
these issues would alone justify careful 
examination of the method. 


Groundworkers in time sampling, first 
named, as here, by John Anderson (in a 
foreword to Olson, 1929), include Olson 
(1929), Goodenough (1928), Parten (1932, 
19334, 1933b), Thomas (1929, 1932), and, 
among associates of Thomas, Barker 
(1930), Loomis (1931), Beaver (1932), and 
Arrington (1932a). The method evidently 
was applied first-—to nervous habits in 
school children—by Olson in 1929. Cri- 
tiques with review of the literature have 
been made by Olson and Cunningham 
(1934) and Arrington (1943). See also, 
for references and appraisal, Goodenough 


and Anderson (1931), Heyns and Lip} 
(1954), Murphy and Murphy (193 
Arrington (1939), Jersild and Me 
(1939), Sells and Ellis (1948, 1951), a 
Gellert (1955). Applications continue 
current journals after reaching a f 
quency peak in the 1930s. 


It will be convenient here to foll 
routinely the scheme of Table 3-2 | 
description of observational metho 
This scheme is applied to illustrat 
time sampling studies in Table 3 
which also presents examples of rest 
from each application. 

Consider first that the range of tin 
sampled behavioral phenomena is wi 
So the variables under material cov 
age in Table 3-5 suggest and so do ti 
samplings of imaginative behav 
(Markey, 1935), friendships (Challm 
1932), friendships and quarrels (Gre 
1933a), techniques of successful soc 
interaction (Mallay, 1935b), “approac 
withdrawal patterns” in intragroup | 
havior (White and Williams, 1939), 
sponses of teachers to children (Ols 
and Wilkinson, 1932), resistance a 
acquiescence in social transactic 
(Caille, 1933), interactions of adol 
cents in clubs of differing class le 
(Maas, 1954), thumbsucking in infa 
(Heering, 1932), and tics or “nerve 
habits” (Olson, 1929, Koch, 1935). O 
might expect to come upon the co 
paratively small units of molecular t} 
havior the more often in short tit 
samples. But it turns out otherwi 
After Olson’s study of nervous habi 
time samplers usually have worked wi 
comparatively large phenomena of n 
lar behavior. Also, as Table 3-5 su 
gests, these phenomena are predor 
nantly social in the specific sense 
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their involving the child in interactions 
with peers and adults. 

This method is defined primarily by 
its temporally uniform observation in- 
tervals. These time units have varied 
remarkably in duration from one study 
to the next. Note among the examples 
in Table 3-5: one minute for recording 
of solitary through parallel to coopera- 
tive play, five minutes for facial and 
vocal expressions, and three minutes for 
emotionally dependent and independent 
actions, One finds a total range of five 
seconds (Challman, 1932) to twenty 
minutes (Manwell and Mengert, 1934) 
per observation unit. Consistent rela- 
tionships between length of interval 
within this range and kind of behavior 
observed are not apparent. Usually, 
spacing of the intervals is regulated pre- 
cisely. Many investigators schedule no 
more than one unit per child per day 
and keep all observations within certain 
diurnal limits, Units in a day rarely ex- 
ceed two or three, and these are widely 
separated by observing the subject 
children in rotation. Periods of five 
minutes or less are commonest. The 
typical time sampling schedule is a 
series of far-between flashes on a be- 
havior stream that otherwise flows in 
the dark. 

Coded observation guides of time 
sampling characteristically prime and 
yet restrain the observer with memo- 
rized symbols, check lists, rating scales, 
or graphic recording devices that can- 
not be reviewed quickly enough to suit 
this space. Some of these tools present 
difficult problems in categorizing and 
scaling that go beyond observational 
methods in psychology, and others 
apply only to target phenomena of par- 


ticular investigations. Narrative speci 
men recording takes the place of cod 
ing on the spot in some applications 
The studies of Bott and of Heathers, 
in Table 3-5, give examples. So also do 
time samplings of behavior in “artistic 
and inartistic” children (Dow, 1933), 0! 
child-with-child conflicts (Jersild and 
Markey, 1935; Roff and Roff, 1940), 
of children’s imaginative behavi« 
(Markey, 1935), and of social transac 
tions between teacher and child (Nes- 
bitt, 1943). Parten (1932: Table 3-5) 
and a few others (Swan, 1938: Tab 
3-5) have supplemented code recordin 
by brief summary narration, 

Scoring operations of the method 2% 
usually built into its procedures in dat 
collection. Either the time intervals 
themselves or the observed phenome 
can be used as counting integers, and 
occurrence time also can serve as a 
measure. Hence one finds among basal 
scores (1) the N of intervals in which 
a target phenomenon occurs, (2) the 
total N of occurrences, (3) the N of 
occurrences per interval, (4) the total 
occurrence time, and (5) the occur- 
rence time per interval. Scale values 
enter into certain measures. Represent- 
ative scoring possibilities are exempli- 
fied with variations and refinements in 
Table 3-5. The tabled findings clarify 
different scores and may also tell 
enough for present purposes about out- 
comes of the method. 

Circumspect and occasionally prodi- 
gious effort (Arrington, 1932a) to test 
reliability of observation is a mark of 
time-sampling studies. Several papers 
(Thomas, 1932; Arrington, 1932b; Bott, 
1933; Robinson and Conrad, 1933; 
Berne and Kelly, 1934) deal primarily 
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with this problem. Investigation of re- 
liability or accuracy of observation by 
measurement of agreement between 
simultaneous and independent observers 
early became a staple phase of this re- 
search, with results that indicate two 
conclusions, 


1. The level of interobserver agree- 
ment generally has been “good,” “ac- 
ceptable,” “adequate,” or “satisfactory.” 
Agreement levels are scored by per- 
centages or rank-order coefficients. 
With allowances made for some cate- 
gories in a few studies, the obtained 
values do not fall enough below 90 or 
-9 to justify serious reservations about 
the reported data. 

2. The agreement percentages and 
coefficients do not differ with any evi- 
dent consistency according to size, 
complexity, sociality- nonsociality, per- 
ceivability-with- the-naked- -eye, seeming 

“subjectivity,” or any other character- 
istic of the observed behavior.! 

Tests of data reliability or consistency 
over time also fortify most applications 
of time sampling. Correlations of odd 
with even interval scores or of scores 
from different series of intervals gen- 
erally have shown some measurement 
stability. Yet most of the reported 


*See especially for more documentation 
Olson (1929), VanAlstyne (1932), Beaver 
(1932), Arrington (1932a), Olson and Wil- 
kinson (1932), Parten (19332), Dow (1933), 
Green (19332), Hagman (1933), Robinson and 
Conrad (1933), Caille (1933), Bott (1934), 
Jersild and Markey (1935), Markey (1935), 
Graves (1937), Swan (1938), White and Wil- 
liams (1939), Nesbitt (1943), Dawe, Ekern, 
and Berger (1949), Lippitt, Polansky, and 
Rosen (1952), Maas (1954), Walters, Pearce, 
and Dahms (1957), and Marshall and Mc- 
Candless (19574). 
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coefficients are not high. They ordi- 
narily stand well below comparable 
indices of psychological-test reliability. 
Disparities between the obtained coef- 
ficients and 1.0 are commonly ascribed 
above all to change over time in “situa- 
tional factors.” But steps to find out, 
sample by sample, what these factors 
are generally have not been taken, 
Goodenough (1937) and others stress 
the importance in this connection of 
ascertaining “conditions of observa- 
tion.” Time samplers have reacted more 
often to unstable measures, however, by 
building up the N’s of their samples in 
ostensible hope of neutralizing these 
conditions while exhausting the normal 
range of variability in behavior. Agree- 
ment of findings with outside criteria 
has been demonstrated in some applica- 
tions (Hagman, 1933); and Marshall 
and McCandless (1957a: Table 3-5) use 
the method to validate other measures, 
namely, sociometric scores and ratings 
by teachers. 

Time sampling has several attractions. 
It limits with exactitude observed con- 
tents as well as temporal lengths of the 
behavior stream. It permits systematic 
control by selection of phenomena to 
be observed and studied. It insures rep- 
resentativeness and reliability by re- 
cording large numbers of commensu- 
rable observations. It is economical of 
research time and effort, Its coding 
schemes minimize equivocal judgments 
and prescribe definite ways to quantify 
whatever is observed. It goes far to 
achieve standardization of observer and 
analyst as measuring instruments. Yet 
the method has a number of character- 
istics that narrow its applications. 

First, time sampling is limited to 
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problems of incidence. Its results show 
that certain things occur with such and 
such absolute or relative frequency un- 
der specified conditions. These results 
show little of what the actions or situa- 
tions of children look like, of how they 
change from beginning to end, or of 
how their components are related. 
Moreover, even when incidence prob- 
lems are concerned, the method is prac- 
ticable only for study of events that 
happen often. Arrington concludes 
from experience of her own and others 
that “if the behavior to be observed 
occurs . . . less than once in 15 min- 
utes on the average .. . some other 
method than that of short time samples 
. . «ds indicated” (1943, p. 95). This 
excludes many important phenomena. 
Schoggen (1954) has analyzed speci- 
men day records to find an occurrence 
rate of once every 32 minutes for satia- 
tion, every 54 minutes for success, 
every 81 minutes for frustration, and 
every 260 minutes for failure over the 
long run of 156 hours in the lives of 
12 Midwest 2-to-10-year-olds. Murphy 
(1937) rejected time sampling for study 
of sympathy in young children partly 
because sympathetic behavior proved to 
be all too infrequent among likely sub- 
jects. Goodenough (1930) reports sim- 
ilar experience with one-minute samples 
of compliance by young children. But 
Satiation, success, frustration, failure, 
sympathy, and compliance are central 
in child behavior. Possibly there is 
something of a negative relation be- 
tween the significance of behavioral 
events and the rate of their occurrence. 
Persons are born once, die once, and 
meanwhile take a billion footsteps. 
Time sampling does not honor natu- 
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ral behavior units. Aggressive acts, con 
versations, quarrels or friendships, and 
virtually all other segments of behavio: 
in the literature of the method can be 
ever so long or short, with the outcom 
that the sampling intervals have to be 
as found above: arbitrary in length. Thi 
loose respect of behavior for time leave 
no alternative. But with what con 
uences? As others have mention 
(Jersild and Meigs, 1939; Murphy an 
Murphy, 1935), one result here mu 
often be the observation of action fra, 
ments. A tally per fragment clearly is 
legitimate enough for incidence prob 
lems as long as each fragment represet 
what is really going on (aggressior 
affection, cooperation, or the like) in 
the complete behavior unit. One can 
seriously question whether it always 
does, however, and urge as well that 
the watching of events through their 
natural courses must generally be good 
policy for most research purposes. 

Time sampling characteristically sev- 
ers behavior from its immediate relevant 
context. It does not often link behavior 
with coexisting situation, 

One can find time samplings that re- 
cord situational factors and aspects of 
child behavior. Nesbitt (1943), for ex- 
ample, gets ten-minute samples of con- 
tributions by adults to the social situa- 
tions of children in a nursery school. 
She counts such dominative adult ac- 
tions as refusals, postponements, and 
threats along with such integrative 
adult actions as invitations to activity, 
participations with the child, and ex- 
pressions of approval. She also gets 
measures of how the children be- 
have during the ten-minute intervals. 
The behavior of each child is rated 
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to constructive use of play mate- 
ials, adjustment to routines, social ad- 
ustment, and individual well being. 
indings of the study show among 
her things that the adults were more 
‘ten integrative than dominative and 
hat, on the other hand, the children 
chibited more “acceptable and healthy 
an oe and unhealthy prac- 
” This seems good. But situation 
a i Hena are not aligned here in 
procedures or results. What did the 
child do in response to threats or to 
bring them on? How did he react to 
expressions of approval? Questions of 
this order are not asked or answered in 
such a way as to reveal the give and 
take of situation and behavior. 


Anderson (1939a, 1939b), who intro- 
duced and developed the useful concepts 
of dominative and integrative actions (see 
also Anderson, 1940), also applied time- 
sampling procedures in measuring the 
incidence of these control tactics in be- 
havior of teachers with preschool children 
while measuring in addition concurrent 
modes of child behavior. His findings in- 
dicate a general relationship over time such 
that the more integrative the adult con- 
tacts the more hygienic the behavior of the 
child. Again, however, the results do not 
demonstrate specific behavior-with-situa- 
tion links. The observed adult actions are 
distributed among 35 categories that range 
roughly from extreme directiveness to 
extreme permissiveness. The reported in- 
teresting data do not bear concretely and 
stepwise on the question of differences 
from one category to the next in elicited 
behavior of the children. 


What happens in this behavior plus 
situation research can be understood as 
a function of the method. Common psy- 
chological theory now says that for 
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every action (or response, for that con- 
ceptual matter) there is a corresponding 
situation (or stimulus) and vice versa. 
It would seem to follow that the way 
to link actions with situations is to di- 
vide the behavior continuum, not into 
arbitrary time units, but into integral 
units of behavior with its context. We 
have seen one way to do this in speci- 
men description. One first records in 
conventional language the different ac- 
tions of a child and their conditions as 
they occur sequentially and then di- 
vides the record into behavior-situation 
units. Or one can single out such units 
when observations are made, as in event 
sampling and field-unit analysis. The 
chances of matching behavior with con- 
ditions must be better in either case 
than when (1) a count is made of be- 
havior items that occur during fixed 
time intervals and (2) a count also is 
made of situational components that 
happen to occur during the same uni- 
form lengths of time. 

Although data on behavior alone or on 
situation alone or on one plus the other 
obviously have value, two points can 
be urged here. One is that the problem 
of revealing connections between phe- 
nomena and their relevant contexts is 
the central problem in every science. 
The second is that the meaning of an 
action often cannot be established with- 
out regard for the situation in which it 
occurs. Consider some examples that 
refer to behavior variables of time- 
sampling studies. Aggression (Green, 
1933a; Walters et al., 1957) in response 
to a blow on the head is one thing, 
whereas aggression in response to in- 
difference or teasing is quite another 
thing. Actions toward persons (Barker, 
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1930; Beaver, 1932; Bott, 1934) who 
are hindering you and actions toward 
persons who are helping you can be no 
more the same than the hindering and 
the helping. Among social approaches 
(Mallay, 1935b), visual regard for a 
smile usually differs greatly from visual 
regard for a frown. Yet all of these and 
many other kinds of behavior have been 
time-sampled in situational vacuo, A 
further limitation is that short and 
widely separated (cf. p. 98) spans of 
time must in every case leave the larger 
continuing situation of a child out of 
account, which means also that the 
method can never do much in the light 
of, or to illumine, the continuity of 
behavior. 


Some time sampling studies have worked 
at the problem of connecting situation with 
behavior. In her study of imaginative be- 
havior among preschool children, for ex- 
ample, Markey (1935) records imaginative 
play of both the child and children who 
come into contact with the child. She 
records adult “contributions” toward im- 
aginative activity (as in suggesting things 
to do) and also examines consequences of 
change in play-group heterogeneity for 
imaginative behavior. Jersild and Markey 
(1935) record “acts of the teacher” and ask 
how these affect outcomes of conflicts be- 
tween children in a nursery school. Three 
procedural features of these two studies 
are noteworthy. 


1. The observation interval lasts fifteen 
minutes. Fifteen minutes could possibly 
be near a safe outside limit for capture, in- 
tact, of children’s conflicts and imagina- 
tive-play episodes. 

2. Specimen narration is the recording 
technique. 

3. Behavior and situation items are 


counted, not per integral unit of any kind, 
but per fifteen-minute interval. 


One nevertheless can wonder whether 
these investigators might have acco 
plished more with the action-context prob 
lem if, in analyzing their narrative speci- 
mens, they had first systematically divided 
each one into action-context units. 

There are to be considered, also, tests of 
relationships between situational factors 
and behavior in which time sampling anc 
independent measures of specific environ 
mental properties are coordinated. Take 
the study by Lippitt and associates (cf. 
p- 78) of behavioral contagion and socia 
influence attempts as they are related to 
measured social power of the chik 
and his companions. Special measures es- 
tablish the relative power of each com- 
panion, C. Direct observations during in- 
tervals of five, ten, and fifteen minutes 
show further that C’s power as a situationa 
factor and the behavior of S toward C are 
related. Here is one indirect way to use 
time sampling in research to connect be- 
havior with its cotemporaneous condi- 
tons. 

Body (1955) applies time sampling in a 
study that aims to get at “situational fac- 
tors” in the aggressive behavior of 
children in a “university nursery school” 
and a “community-supported day school” 
for young children. The children of the 
two school settings are found to differ 
in aggressive behavior. Explanations of 
the differences are proffered in revealing 
discussion of dissimilarities between the 
compared settings in “group composition, 
intragroup constellations, child-child con- 
tacts, direction methods, and school pro- 
gram.” But these situational determinants 
are taken into account ex post facto. They 
are not measured directly. Only the ag- 
gressive behavior is time-sampled. The 
situation of the child action by action is 
left out of account. 
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This leaves the question of aims in 
time sampling. The evident aim of the 
method in its more typical applications 
is normative. See for clear examples 
Bott (1934), Swan (1938), Walters, 
Pearce, and Dahms (1957)—all in Table 
3-5—Arrington (1932a), and Thomas 
(1932a,b). The investigator in these 
and other comparable studies records 
behavior alone for the sake of char- 
acterizing the behaving child. Attention 
may be given to individual differences. 
But these are usually heeded with less 
regard for the child as an individual 
than for measurement of deviations 
from central tendencies. Clearly idio- 
graphic applications of the method 
seem negligible. Most attention is given 
here to the child of a given class, who 
nearly always turns out to be the pre- 
school child of a stated a age. 

Psychological tests are very often 
applied similarly to the normative prob- 
lem. Tests to get norms and direct 
observing of behavior alone to get 
norms seem basically the same in strat- 
egy, inasmuch as both use information 
about behavior to establish common 
traits of classified children. It is none- 
theless true that the two methods differ 
in important tactics. Paradoxically, di- 
rect field observing in this case gives 
situational factors shorter shrift than 
psychological testing. Take any repre- 
sentative abilities test or personality test 
of the inventory type. What every such 
test does in effect is to score behavior 
in response to (1) a large number of 
(2) prototypical situations that are (a) 
known and (b) standardized. What any 
observational study of behavior alone 
does in fact is to score behavior in re- 
sponse to (1) a probably smaller num- 
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ber of (2) local and circumstantial sit- 
uations that are (a) unknown (where 
unknown means unrecorded) and (b) 
not standardized at all. The observa- 
tional norm seeker generally does iden- 
tify the activity setting, as, for example, 
preschool X, outdoors, morning, and 
free-play period. But this leaves much 
to be said about psychological situa- 
tions. As for outcomes, the results of 
the tests tell us what to expect of clas- 
sified children in situations that present 
intellectual problems or aggressive acts 
or affectionate overtures or dangers or 
whatever at home, at school, on the 
playground, or the like. On the other 
hand, to look back on pertinent studies 
to date, normative observational data 
on behavior out of context tell us 
mainly what to expect of children in 
American preschools. This also is some- 
thing but it is still not so much as could 
be hoped. And almost all of these per- 
tinent studies to date are time sampling 
studies. 


At least two issues, which can only be 
touched on now, are left over here. “Spon- 
taneous behavior” reveals the child in a 
unique way? Behavior that cannot be 
elicited by tests can be observed directly? 
The right answer in each case may well 
be yes, but we would urge no less ‘that it 
is more useful to record directly observed 
behavior in context than out of context 
for any purpose. 


Aims in time sampling have nearly 
always been systematic when they have 
not been normative. The study by 
Heathers in Table 3-5 gives a good ex- 
ample. So also does a demonstration by 
Marshall and McCandless (1937b) of a 
negative relation between dependence 
on adults and social acceptance by 
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peers among preschool children. The 
study by Lippitt, Polansky, and Rosen 
(1952) of relationships between power 
and social behavior in groups of chil- 
dren, which has already been cited, 
is another case in point (see p. 78 
and p. 102). Consider also an investiga- 
tion by Maas (1954) which tests a num- 
ber of hypotheses about behavior in 
lower-class and middle-class groups of 
adolescents. These hypotheses include, 
for example, the one that member-to- 
member actions are more aggressive in 
the lower-class groups. Ove way to test 
this hypothesis is to observe intermem- 
ber behavior in suitable teen-age clubs 
for brief and constant intervals of time. 
Maas did as much and thereby gathered 
efficiently data that upheld the stated 
expectation and others. 

Work begins in these studies with 
definite ideas about empirical relation- 
ships between variables of behavior or 
of situation and behavior. Time sam- 
pling may be at its best in research with 
such a beginning. It provides one effi- 
cient way of testing hypotheses or an- 
swering questions about how often chil- 
dren behave in given ways in life set- 
tings under specified conditions. There 
are good incidence problems in child 
development; and the studies at hand 
show that some of these problems can 
be met usefully by sampling naturally 
occurring behavior during fixed inter- 
vals of time. But there are other and 
possibly sounder ways to handle such 
problems, as discussion to follow may 
show. : 


EVENT SAMPLING 


Event sampling begins with a plan 
to study integral behavioral events of 
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a given class. These may be in 
nature of anger outbursts, or of a 
ments, or games. The investigator 
tions himself where children can 
seen and heard, waits for the events to 
happen, and then describes them if and 
when they do occur. Each event 
sample of its class in the behavior 
streams of classified children in selected 
life settings. 

Here in psychological application is 
the classical method of natural history 
research in biology. How does a beaver 
build a dam? what are the nesting habits 
of geese? and how do frogs prepare f 
winter? Biological scientists have built 
up vast stores of knowledge about such 
events by the straightforward tactic of 
describing each one as it occurs re- 
peatedly in nature. Child psychologists 
are comparative novices in this long line 
of basic descriptive study, and yet they 
have made some beginnings as event 
samplers. 

Take as a model case a study by 
Helen Dawe (1934). This modest in- 
vestigation describes quarrels of pre- 
school children “as they occur spon- 
taneously” during the free-play hour 
of a nursery school over a period of 
four autumn and winter months. The 
subjects are 19 girls and 21 boys of ages 
2 to 5 years. Dawe “watched the chil- 
dren from some central spot on the 
playground or from a doorway between 
the two main playrooms” of the school. 
“Immediately upon noting evidences of 
a struggle” she “moved quickly to the 
scene of action as unobtrusively as pos- 
sible” and “devoted her whole attention 
to the quarrel” until “the issue was set- 
tled or the teacher interfered.” Each 
“struggle” was timed from its inception 
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to the outcome. “Running notes” and 
also entries on prepared forms were re- 
corded immediately after the conclu- 
sion of every quarrel. The forms gave 
space for recording (1) the name, age, 
and sex of every participant, (2) the 
duration of the quarrel, (3) what the 
children were doing at the time of on- 
set, (4) what they quarreled about: 
contested ownership, physical violence, 
interference with activity, or other 
social stress, (5) the role of each 
participant: precipitator, main aggres- 
sor, retaliator, objector, sheer emoter, 
or passive recipient, (6) specific motor 
and verbal accompaniments, (7) the 
outcome: yielding to force, voluntary 
yielding, compromise, interference by a 
child spectator, or intervention by a 
teacher, and (8) the after affects: good 
cheer or resentment. 

What did Dawe learn by waiting for 
hese quarrels to occur and by then 
describing them one by one? Here are 
some sample findings. 


acd 


Two hundred quarrels were observed at 
the rate of three to four per hour among 
the 40 children. These quarrels were 
“amazingly short” indeed. Their average 
duration was twenty-four seconds, with 
only 13 lasting more than a minute. The 
time per quarrel was nearly twice as great 
outdoors as indoors. Quarrels stopped by 
teachers were not appreciably shorter than 
those settled by the children! The boys 
quarreled more often than the girls. Quar- 
rels involving boys alone or girls alone 
were the more frequent, but these intrasex 
quarrels ended with some form of com- 
promise about three times as often as 
intersex quarrels! So! Retaliative acts in- 
creased with age although the number 
of quarrels declined with age. Such motor 
activity as pushing or striking was absent 
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in only three of the 200 quarrels! Truly ar- 
gumentative quarrels increased with age, 
as Piaget (1952), for example, would as- 
suredly have expected. Yielding to force 
was the commonest outcome. Quick and 
cheerful recoveries outnumbered resentful 
aftermaths about three to one. (Abstracted 
from Dawe, 1934-) 


Here are data gathered by recording 
unitary behavioral events as they hap- 
pen, one by one, in the naturally oc- 
curring behavior streams of children; 
and it is submitted that these data are 
splendidly intelligible and useful. 

Dawe’s application of the method 
does indeed tell something about be~ 
havior in context. The quarrel as an 
integral process with an observable 
course is described in relation to some 
of its conditions, included among which 
there are precipitating states of affairs to 
quarrel about, opposite-sex and same-sex 
peers to quarrel with, teachers as in- 
terveners, and force or compromise as 
quarrel enders. The method has not 
always achieved such description, al- 
though several investigators have 
worked with it in this behavior-in-con- 
text vein, as in event samplings of anger 
or aggressive acts in young children 
(Goodenough, 1931; Ricketts, 19343 
Appel, 1942), children’s fears (Jersild 
and Holmes, 1935) and their “out- 
breaks” of both anger and fear (Felder, 
1932), other “emotional episodes” (in 
infants, Blatz and Millichamp, 1935; in 
the child of school age, Blatz, Chant, 
and Salter, 1937; in Filipino preschool 
and school children, Inselberg, 1958), 
sympathetic child behavior (Murphy, 
1937), and such other events in chil- 
dren’s lives as crying (Landreth, 1941), 
sleeping (Reese, 1932), laughing and 
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smiling (Ding and Jersild, 1932; Blatz, 
Allin, and Millichamp, 1936), war play 
(Bonte and Musgrove, 1943), spon- 
taneous group formations (Chevaleva- 
Janovskaja, 1927), riding to school with 
a stranger (Heathers, 1954), the troubles 
other children have with teachers 
(Kounin and Gump, 1958), and greet- 
ings in pedestrian streams of large and 
small communities (Wright and Dreyer, 
1959). Studies of language functions 
by Fisher (1934), Janus (1943), and 
Burnham (1956) illustrate similar re- 
search on spontaneous verbal behavior. 
Some of Piaget’s observations could 
also be counted here, even though he 
generally oversteps the bounds of ob- 
servational study by interrogating his 
subjects. See for pleasant examples his 
descriptions (1932) of how Swiss chil- 
dren play marbles. 

This leaves applications with less re- 
gard for observed conditions of be- 
havior. Examples include event sam- 
plings of classroom recitations (Horn, 
1914), misdemeanors at school (Blatz 
and Bott, 1927), laterality of limb move- 
ments in infants (Stubbs and Irwin, 
1933), food habits (Vance and Temple, 
1933; Campbell, 1933), sleeping habits 
(Boynton and Goodenough, 1930; 
Scott, 1931; White, 1931), choice of 
play materials (McDowell, 1937), and 
speech elements (Adams, 1932; Stal- 
naker, 1933; Nice, 1932). These in- 
vestigations are rather like normative 
time sampling studies that inquire about 
the frequency of certain behavior items 
among children of particular age levels 
in particular settings. Ames (1949, 1952) 
and others at Yale have also applied a 
kind of event sampling toward develop- 
mental norms. A procedure that might 
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appropriately be called “progressive 
condensation” is followed. The observer 
begins with detailed notations on 
selected behavior targets and ends with 
highly condensed normative summaries. 
Ames, for example, proceeds in this 
way to describe the growth of “inter- 
personal smiling responses” in the pre- 
school years (1949) and to trace 
developmental change among preschool 
children in “the sense of self” as ex- 
pressed or implied by verbal or other 
behavior (1952). 

Event samplers have been like time 
samplers as a rule in preferring record- 
ing techniques that arm them in the 
field with check lists, category sets, or 
the like. Yet some have been content 
just to look, listen, and then describe 
each event in everyday words. am 
(1937), Bonte and Musgrove ( 1943), 
Ames (1949, 1952), and Kounin and 
Gump (1958) have done this. A few in- 
vestigators, also, have combined free 
narration with coded observation 
guides. The form used by Jersild and 
Holmes (1935, p. 351) in their research 
on children’s fears is a clear example. 
This form includes check lists of items 
that refer to the child’s activity pro- 
gram and physical condition during the 
day in which each recorded fear oc- 
curs, and it also provides white space 
under each of the subjoined heads. 


Situation in which the child gave 
signs of being afraid: place, time, what 
the child was doing at the time, persons 
present, apparent cause of fear, etc. 

Behavior of the child: words spoken, 
cries, other vocalizations, jumping, 
starting, withdrawing, running away, 
other physical activity, etc. 
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Perhaps some such directive, yet 
open-ended provisions for recording 
both situation and behavior might prof- 
itably be made in every event sampling 
study. 

Event samplers generally have 
worried about the reliability of their 
observations less than time samplers. 
One nonetheless finds only reassuring 
results wherever agreement between 
simultaneous and independent observers 
has been tested. See for examples the 
studies by Dawe (1934), Ding and 
Jersild (1932), Ricketts (1934), and 
Landreth (1941). Landreth (1940) com- 
pared event (“incident”) sampling, time 
sampling, a class record, and teacher 
ratings as ways to study crying by 
preschool children. She reports ob- 
server agreement per cents of 97.5 and 
95.5, respectively, for the first two 
methods. The event sampling scores 
were found to measure this “sporadic” 
type of behavior best, yet they proved 
to be low in consistency over time. 
Dawe and also Ding and Jersild, how- 
ever, show stability in frequency of 
their target phenomena (noted above) 
by getting moderately high correla- 
tions between scores from different ob- 
servation periods. 

Others seem to have been little con- 
cerned with measurement stability in 
applications of this method. It is as if 
most of them had reasoned that (1) 
psychological events are sure to change 
in frequency and character over time as 
their conditions change, in spite of 
which (2) there will be some stability 
when the child stays the same, but (3) 
let us not bother to show this because 
(4) we are interested in the events 
more as functions of their momentary 
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conditions than as functions of the 
transcending child. Students of per- 
sonality, of the individual, of the or- 
ganism, of the child may not easily get 
the feel of this position, which is never- 
theless as comfortable as an old shoe 
to some students of social behavior. 

Perhaps the most distinctive good 
point of event sampling is that it struc- 
tures the field of observation into 
natural units of behavior and situation. 
Important resulting advantages have 
been anticipated, especially in our dis- 
cussion of time sampling. These ad- 
vantages amount essentially to a fair 
chance to study relationships between 
behavior and its coexisting conditions. 
Every behavioral event with enough 
singularity to invite attention is poten- 
tially a systematic psychological study 
in nature. All one has to do to make it 
actually an experiment of sorts is to 
control its conditions in effect by keep- 
ing track of them and of correlative 
changes in behavior, Dawe, Jersild and 
Holmes, Goodenough, and others 
whose pertinent research we have men- 
tioned show that this can be done, 

An important practical asset of the 
method is that it can be adapted to study 
of naturally occurring behavioral phe- 
nomena that happen only once in a 
fairly long while. Event sampling is 
most economical in the face of such 
phenomena if lay adults who live on 
the scenes of observation serve as ob- 
servers. Goodenough (1931) and Jersild 
and Holmes (1935) have set a precedent 
here by securing parents and other adult 
family members as observers in their 
studies of anger and fear. It is suggested 
that the possibilities in extensive event 
sampling of child behavior by resident 
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lay observers in homes, schools, camps, 
stores, and other such activity settings 
deserve consideration. There may be no 
more feasible way to get at key spon- 
taneous behavioral phenomena with 
low incidence rates. 

Possibly the greatest limitation of 
event sampling is that it breaks up the 
larger continuity of behavior—as much 
as time sampling. The method compares 
unfavorably in this respect with speci- 
men description. Yet any unitary be- 
havioral event has a continuity of its 
own that can be recorded. Biological 
naturalists have done so much better 
here than child psychologists as event 
samplers. The natural history researcher 
in biology characteristically describes 
things, from births to migrations and 
life cycles, in process as well as in con- 
text. Child psychology needs such 
descriptions. It has only a most pitifully 
meager stock of beginning-to- end rec- 
ords that describe psychologically sig- 
nificant events in the lives of children. 
Selective event sampling with some 
such recording technique as specimen 
narration could be used to meet this 
need for descriptions that preserve at 
least the smaller continuities of be- 
havior. 

As for research aims, the evident pur- 
pose of the more representative event 
sampling studies now in the literature is 
primarily systematic, not normative. 
The studies by Dawe, Goodenough, and 
many others have contributed toward 
empirical generalizations that relate be- 
havior to its more or less universal de- 
terminants. It may be, however, that 
the method is suited as well to the 
ecological aim. Consider as classes of 
behavioral events in the lives of chil- 
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dren, all of which have been or 
well become targets of event samp 
studies: quarrels, anger episodes, fear 
episodes, frustration episodes, success 
episodes, failure episodes, competition 
episodes, cooperation episodes, problem- 
solving episodes, actions for others, ac 
tions against others, play with pets, | 
with dolls, solitary play episodes, di 
pline episodes, illnesses, school i 
tions, public performances, chore 
births, nursing episodes, weaning epi- 
sodes, car trips, humor episodes, toilet- 
training episodes, religious-ritual epi- 
sodes, new-situation episodes (such as 
first baths, first school days, introduc- 
tions to strangers), buying things, being 
the baby sat with, getting-up episodes, 
and going-to-bed episodes. Here are 
possible experiments in nature on sys- 
tematic problems. But here also are 
possible natural history studies. How 
do these and other such events in child 
behavior streams differ, as between dif- 
fering communities and cultures, prim- 
itive and civilized, Soviet and American, 
Maine and Kansas cultures, and large 
and small, old and new, central and 
suburban, rich and poor communities 
of these cultures, through scores of like 
variations? Anthropologists and soci- 
ologists have worked productively with 
many facets of this mammoth question, 
and psychological researchers in child 
development with an ecological aim 
have available in event sampling one 
clear way to do likewise. 


FIELD UNIT ANALYSIS 
This method has two phases. 


1. A behavior sequence is divided 
into consecutive units in the field on 
the basis of explicit rules. 


ing 
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2. Descriptive categories are applied 
to the phenomena of each unit. 


Presently ongoing behavior is ana- 
lyzed here much as narratively recorded 
behavior has been unitized and ex- 
amined in the analysis stage of specimen 
description. Great advantages in a 
permanent record that can be saved 
and studied in a hundred ways are lost, 
but economy is gained; and so also is a 
special advantage in time sampling and 
event sampling, namely, close contact 
through the stage of analysis with be- 
havior in progress. 

The method is comparatively new. 
Partly for this reason, and because it 
seems uniquely promising, its applica- 
tions in single studies will be reviewed 
in some detail. 

Consider as one preliminary example 
a procedure in which Wright and as- 
sociates (1957) do on-the-spot episod- 
ing. A child is observed for twenty to 
thirty minutes as he goes about his 
business in a community. His behav- 
ior throughout this time is divided 
into episodes on the basis of criteria 
first developed for discrimination of 
episodes in specimen records (see p. 91, 
Barker and Wright, 1955, pp. 225-273). 
The duration of every episode is re- 
corded. Beyond this, the observer re- 
cords on the spot only a participial 
phrase as a title of each episode. Buying 
Oranges and Discussing Tornado with 
Banker are examples. The record for 
each observation period consists solely 
in the time notations and a series of 
participial phrases that identify different 
episodes and show their structural rela- 
tions: of sequential position and, at 
times, of overlapping. Immediately after 
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every observation period, however, 
categories to describe the behavior and 
the situation in each episode are ap- 
plied by the observer. Each recorded 
participial title serves then to muster 
in the observer-turned-analyst an image 
of what the child has just been observed 
to do and of his situation in the named 
episode unit. Conclusive statements can- 
not now be made about the reliability of 
the substantive outcome of this tech- 
nique, which may nonetheless show 
what we mean by field unit analysis. 
These same investigators are working 
similarly with a unit for study of social 
interaction between children and others 
of their communities, This part of a be- 
havior sequence is defined only by con- 
stancy in number and identity of the 
child’s present associates. Hence it is 
called an association unit. An associa- 
tion unit and a spur-of-the-moment 
social group that includes the child and 
any one or more associates are the 
same. Associates are defined without 
conceptual or operational ado as in- 
dividuals with whom the child becomes 
involved in any way. The subjects, chil- 
dren of 6 through 11 years, are observed 
individually for periods of one hour, 
Intervals through which the child is 
alone become zero association units. 
Different kinds of social action are nar- 
ratively recorded as they occur in be- 
havior of the child and of every as- 
sociate. Frequencies of units differing 
in size and in composition (with respect 
to sex, age, and role) are determined. 
Also, different kinds of social action 
(e.g., aggressive or nurturant) in vari- 
ous sizes of units (e.g, three-person 
groups) are analyzed on an event sam- 
pling basis. Tests show satisfactory lev- 
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els of observer agreement in identifica- 
tion of associates and actions. 


A behavior episode is an integral part 
of the behavior continuum. It has phe- 
nomenal discreteness and some dynamical 
independence (Barker and Wright, 1955, 
pp. 225-230). Can the same be said for 
the association unit? The investigators 
think so. It is phenomenally discrete inso- 
far as a person knows when others come or 
go. It has some dynamical independence in 
that a person’s behavior depends in part 
upon the presence or absence of others. 
An episode is defined essentially by con- 
stancy in direction of behavior. But there 
obviously are congruencies between be- 
havioral direction and the comings, stay- 
ings, and goings of other individuals. Some 
correspondence between episodes and as- 
sociation units can therefore be predicted. 
Episoded specimen day records of the 
Midwest collection (with classroom se- 
quences eliminated) have been used to test 
this prediction, The resulting data show 
that 44 per cent of the association units in 
four day records on as many Midwest 
7-to-11-year-olds coincide exactly from be- 
ginning to end with behavior episodes. 
Further, when this correspondence is im- 
perfect, the percentage of association units 
that either start or end where episodes 
start or end is 95. 


The association unit can be seen 
with the almost naked eye. It is in itself 
a basic psychosocial phenomenon. It 
defines precisely as to size and makeup 
the group context of interpersonal rela- 
tions, It may have fair integrity for the 
individual child, These considerations 
and data coming in encourage belief 
that the unit offers a good base for on- 
the-spot study of naturally occurring 
social behavior. Yet it is crude. One 
association unit—lasting, for instance, 
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ten minutes of continuous time that 
Joe, Tom, and Sue spend together— 
can span behavior segments in the life 
of each child that differ greatly as 
unitary events. Psychologically more 
refined units are imperative for many 
research purposes. 

Bishop (1951) has contributed a kind 
of field unit analysis that presents some 
intriguing tactics. Her technique, which 
Moustakas and Schalock (1955) and 
Moustakas, Sigel, and Schalock (1956) 
have modified, looks at first glance like 
time sampling. It employs five-se 
lengths of time both as recording in- 
tervals and as counting integers. In 
first place, however, these lengths of 
time are consecutive rather than in- 
termittent, and, in the second place, the 
recorded observations account for se- 
quential and complete units of social 
interaction. The target phenomena in 
the research are interactions between 
preschool children and their mothers in 
a play setting, 

The observer masters sets of cate- 
gories to describe behavior of the 
mother and behavior of the child, Each 
observation period lasts thirty min- 
utes. Symbols of the categories are 
recorded when they apply during the 
five-second intervals. Thus, in effect, 
complete cycles or units of interaction 
go down in the field protocol as they 
occur in each period of observation. 
Further, instigating behavior and re- 
sponsive behavior on each side in tran- 
sactions between mother and child are 
distinguished by the notations. The 


1 Only Bishop’s own applications involve ap- 
preciable experimental arrangements. See text 
note On p. 75+ 
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appropriate symbol is carried over from 
interval to interval when either an in- 
tigation or a response lasts longer than 
five seconds. As we envisage it, a se- 
quence of the field record, kept by 
devices that cannot be recounted here, 
might be read off as follows: 


Unit 10: instigation : mother sug- 


gests response : child 
accepts (1 interval = 5 
seconds) 

Unit 11: instigation mother re- 
stricts (3 intervals = 15 
seconds) response 
child resists (1 interval = 
5 seconds) 

Unit 12: instigation child seeks 


help (2 intervals = 10 sec- 
onds) : ; response : mother 
helps (4 intervals = 20 sec- 
onds) 


Tests show high observer agreement in 
application of the categories, and cor- 
relations of frequencies from different 
observation periods show reasonable 
consistency in the adult and child be- 
havior. A frequency score is obtained 
per category by summing the five-sec- 
ond intervals in which the social be- 
havior at issue occurs. Proportion scores 
are also derived to equalize the dif- 
ferences among mother-child pairs in 
absolute amount of social behavior. 
Measures on the adult side are correlated 
with measures on the child side. An 7 
of .70, for instance, shows a clear posi- 
tive relation between specific control by 
the mother and reluctant cooperation 
by the child. 

Each basal frequency count of this 
procedure expresses well the total 
amount of suggesting or restricting or 


whatever per child or mother; but 
the analysis procedure could be chal- 
lenged on the issue of part-to-whole 
relations in behavior study. These fre- 
quency counts add whole actions, 
specifically, every action that lasts no 
longer than five seconds, and temporally 
equal action fragments, Some psycho- 
logical researchers may prefer to add 
whole actions only, even when these 
behavior segments differ in temporal 
length. One might be happier, for ex- 
ample, with a sum of 2 that means “You 
ought to be more careful, George” (five 
seconds of criticizing), plus a one- 
minute harangue with the same theme 
(sixty seconds of criticizing), than with 
a sum of 13 that means “You ought to 
be more careful, George,” plus 12 sepa- 
rate twelfths of the one-minute har- 
angue. Time sampling often adds action 
fragments perforce. The present tech- 
nique need not. Perhaps it would do 
better without the five-second intervals, 
in which case each categorized action 
could simply be timed as it comes. 
Bishop’s action-by-action analysis on 
the spot nevertheless yields rich find- 
ings. Also, the stepwise delineations of 
behavior that her records give can be 
mined for many sorts of additional 
findings. What is the mean duration of 
cach type of action? What are the 
frequency distributions for different 
kinds of instigation? Of response? What 
pairings of instigation by C (the child) 
and O (the other person) are most to 
least common? Of instigation by O and 
response by C? What about sequence 
relations between actions of a series? 
For example, given any chosen number 
of consecutive directing instigations by 
O, what is the probability of an im- 
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mediately sequent moncooperating re- 
sponse by C? Unitary behavior units 
laid end to end raise hypotheses. 

Here is one way to scan system- 
atically the continuous social behavior 
stream of a child, The technique clearly 
must be about as good as its main 
facility: a set of terms for describing 
social and other behavior. See, with this 
in view, the categories Bishop has de- 
veloped and also the categories and re- 
finements in field scoring procedures of 
Moustakas, Sigel, and Schalock (1956). 
The latter investigators use no fewer 
than 97 (memorized) rubrics in analyz- 
ing consecutive five-second intervals of 
therapist-child and parent-child inter- 
actions, This staggers a slow-gaited field 
episoder, ‘but tests show good ob- 
server agreement on most of the cate- 
gories. Practicability and usefulness of 
this way to unitize and describe be- 
havior on the spot have been clearly 
demonstrated. 

Heinicke (1956) has applied in field 
procedures a method of analyzing units 
of action that are also called acts or 
activities. He compares 2-year-old chil- 
dren of day and residential nurseries, 
respectively, as subjects of mild and 
severe separation from parents. His 
analytical scheme is adapted from the 
system of Bales (1950; Heinicke and 
Bales, 1953) for observational study of 
social interaction. It also has a psycho- 
analytic base. Certain features of every 
act are recorded, These are the agent of 
action: the child or any other person; 
the object of action: any present or 
absent recipient of an act—a body part, 
for example, a toy, or a person; the 
agent-to-object relation: how the agent 
relates himself to the object, exemplified 
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in such kinds of behavior as nurturance, 
hostility, and avoidance; the mode of 
expression: how the act is mediated, as 
by biting or crying; and the intensity of 
the act: its quantitative force (rated 
only on hostility as a relation and c 
ing as a mode of expression), A unit 
of action is defined by constancy in all 


of these variables. Thus a new unit 
begins with every change in agent, 


object, relation, mode of expression, or 
intensity, but for the one qualification 
that a unit is assigned to every thirty 
seconds of a continuous act “because 


on the average about 2 units were 
scored every minute.” Proportions of 
all action units in which categorized 
phenomena occur are determined in the 
final analysis stage. It is found, for ex- 


ample, that the percentage of acts in 
which the child as agent relates him- 
self with others in a severely hostile 
way is greater among the residential 
nursery children. 

The unitizing principle here begets 
a new unit for each of several important 
changes in behavior or situation, This 
principle guarantees division of be- 
havior into parts with a certain amount 
of psychological integrity. The change 
criteria are so many and varied, how- 
ever, that the meaning expressed by any 
sum of these parts is equivocal. We 
do not find in Heinicke’s fascinating 
report either numerator or denominator 
for any of his percentages. This is 
understandable, An absolute N of hap- 
penings so multiplexly defined as these 
acts is not interesting. It would be 
otherwise if, for example, the sole 
criterion of change in act had been a 
change in object, perhaps, in some 
fundamental kind of agent-to-object 
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relation, or in direction of behavior. 
would then want to know the 
nber of acts per interval of time 
ler specified conditions. Although 
variably defined behavior segments 
must be more legitimate as counting 
itegers than arbitrary time intervals, 


One 


n 


segments with more nearly univocal 


aning can be summed to greater 

idyantage and would seem to give 

sounder base for derived meas- 
res. 

Observational child study may need 
nothing more than a basic unit of de- 
scription that (1) can be used in the 
field, (2) takes in a diversity of behavior 
and situation variables, (3) has psy- 
chological integrity, (4) has clear mean- 
ing in terms of some central criterion, 
and (5) permits study of behavior in 
context. All of the foregoing descriptive 
units meet some or all of these specifi- 
cations. As for application aims, we 
suggest only that a unit which satisfies 
each of the five specifications could be 
made to serve many normative, sys- 
tematic, ecological, and idiographic pur- 
poses, There seems to be much room 
here for discovery and invention. 


A highly specialized analytical scheme, 
which might be considered a kind of field 
unit analysis, has been developed by 
Birdwhistell (1955) for the “systematic 
study of how human beings communicate 
through body movement and gesture.” 
Called kinesics, this arresting method 
provides for sequential analysis of “visually 
perceptible body shifts” and for inter- 
pretation of their significance in “nonver- 
bal interpersonal communication.” Only 
pilot applications in research with children 
(id.) have thus far been reported. 
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TRAIT RATING 


Essentials of trait rating as an ob- 
servational method can be seen in its 
widely familiar routines. 

The observer O goes to a school, 
home, or other setting in which there is 
at least one child; or perhaps O lives in 
the setting or works in it, as in the case 
of a teacher. O has before him or in 
mind a battery of scales that describe 
various dimensions of behavior; or, if he 
is a resident observer, these scales may 
be unknown to him, The time comes in 
any event when, after observing the 
child—possibly on a single occasion but 
probably on many occasions over days, 
weeks, or months—O uses the scales to 
sum up what he has seen in a series of 
trait ratings. He may do this after con- 
secutive periods of time, so that tests 
can be made of intrarater consistency 
or, if the interim periods are very long, 
so that changes in behavior can be 
measured. Often, too, someone inter- 
correlates the ratings to test for be- 
havior syndromes or correlates them 
with external measures to investigate 
their validity and psychological bear- 
ings. The rated variables have included 
among others, in great variety, cheer- 
fulness, jealousy,  apprehensiveness, 
curiosity, planfulness,  fancifulness, 
conformity, friendliness, competitive- 
ness, respect for authority, sense of 
humor, and adjustment to new situa- 
tions. 

One finds here many skillfully de- 
veloped instruments. Prominent early 
and later examples include the behavior 
scales of Haggerty, Olson, and Wick- 
man (1930), Berne (1930), Conrad 
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(1933), VanAlstyne (1936), Joël 
(1936), Roberts and Ball (1938), and 
Richards and Simons (1941). These and 
comparable schedules generally have 
scored well on tests of reliability and on 
cross validations with other measures. 
They have been used almost invariably 
for normative or idiographic study of 
the child. Quite another possibility is the 
one of using them to compare effects 
upon behavior of different settings or 
conditions of life. This possibility has 
received little attention. What we mean 
by it is clearly demonstrated, however, 
in engaging research by Koch (1955a,b 
and 1955a,b) on consequences for child 
behavior of sibling position as a prop- 
erty of the family setting. 

Each of our other methods deals with 
behavior in progress. Present actions 
or conditions are described and ob- 
servations are recorded when they are 
made. In applications of trait rating, on 
the other hand, stable ways of behaving 
are described and observations are re- 
corded after they are made. These ob- 
servations are essentially direct in that 
they are not screened by interference 
techniques or by passage of enough 
time to make them fade out con- 
siderably. Each rating amounts to a 
statement that summarizes cumulative 
direct observation, as indicated in Table 
3-2. Yet what the observer records is 
more an assessment of personality than 
a description of behavior or its con- 
ditions; behavior rating scales are in- 
deed commonly grouped with and even 
called personality inventories. Accord- 
ingly, to save problems in appraisal of 
personality characteristics for Chapter 
17, discussion of the method is limited 
here to these identifying comments. 
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ADDENDUM: ON THE LARGER 
CONTEXT OF CHILD BEHAVIOR 


Methods now at hand are merely 
added to the foregoing survey of “prin- 
cipal” ways to investigate naturally oc- 


curring child behavior only because 
they exceed the common limits of ob- 
servational child study. Their im- 
portance is another question. 

We refer here to methods that bring 
into focus, not the behavior continuum 
of the individual child as represented 
in Table 3-2, but life settings in the 
larger context of that comparatively 
slender stream. Thirty years ago 
Baldwin and others (1930) observed 


and described one-room schools, coun- 
try churches, homes, public libraries, 
and other settings in the larger environ- 
ment of Iowa farm children. Dennis and 
Dennis (1940a,b), working in the tradi- 
tion of cultural anthropology, have used 
field techniques to depict and relate to 
child-rearing practices the life settings 
of the Hopi child. Such examples bring 
to mind at once a long line of pertinent 
anthropological studies and also socio- 
logical studies of delinquency areas, 
habitat factors in social class, and the 
like. We shall review here only some 
convenient illustrative research in the 
psychological literature. 

A recent concerted attempt to map 
the environmental penumbra of in- 
dividual child behavior has been made 
in a survey of behavior settings in the 
town of Midwest (Barker and Wright, 
1955, Pp- 44-176). A behavior setting 
amounts to a special case of the general 
everyday concept of life setting. This 
community unit can be defined as a 
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stable part of the physical and social 
milieu together with an attached stand- 
ing pattern of human behavior. The 
milieu part always has its own distinc- 
tive characteristics of time, place, things, 
and personnel that accommodate the 
behavior pattern. This pattern, too, is 
always extraindividual in that it stays 
essentially the same over extended 
periods of time while different in- 
dividuals come and go. Examples of be- 
havior settings in Midwest are the 
Second-Grade Classroom, Hooker's 
Tavern, the Weekly Meeting of Scout 
Troop 96, and the Methodist Regular 
Worship Service. Raising hands, shoot- 
ing pool, saluting the flag, and singing 
the Doxology, respectively, are com- 
ponents of the standing behavior pat- 
terns attached to these settings. 
Behavior settings are common enough 
as empirical entities that lay persons and 
social scientists identify them and many 
of their features and functions with 
close agreement. This holds, indeed, for 
the behavior patterns anchored to them. 
These behavioral wave phenomena in 
Midwest have been described in terms 
of such functions as aesthetics, earning 
a living, education, recreation, religion, 
and social contact. Relative weights of 
these different action patterns in settings 
entered by infants, preschool children, 
school children, and others of various 
age, sex, class, and color groups are 
described. Variations from setting to 
setting in the incidence of gross motor 
activity, listening, talking, and com- 
parable behavior mechanisms are also 
determined for different population 
subgroups. Measurements are made as 
well of the territorial ranges, occupancy 
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times, degrees of segregation, and pene- 
tration levels of children and others in 
Midwest’s behavior settings. The sur- 
vey data were gathered by participant 
observers whose sources of information 
included the local press, public records, 
and informants. Tests of agreement be- 
tween independent observers and 
analysts indicate adequate reliability of 
the obtained measures. 

Watch a child move from home to 
school to a park to the corner drug- 
store and so on through a day. His be- 
havior will be seen to change with re- 
markable speed as he goes from setting 
to setting. One discovers, then, that 
behavior settings contain forces that 
coerce individual behavior; and one can 
easily believe that these forces must be 
of critical importance in the processes 
of socialization and personality growth. 
Procedures of a behavior setting survey 
give one way to begin to diagnose and 
interpret these forces. Communities 
have behavior settings, indeed, but so 
have the separate institutions of com- 
munities. Koppe (1952) found 71 set- 
tings in the grade school of an American 
town with a population of only 390. 
A modern hospital has many different 
behavior settings. So has the average 
American church. Solution of theoreti- 
cal problems in child development and 
of practical problems in child guidance 
might be helped by application and con- 
tinuous improvement of ways to iden- 
tify, catalogue, describe, and assess these 
stages on which children play their 
parts in communities and their subordi- 
nate institutions. 

The persisting larger and more stable 
context of individual child behavior has 
a vast reach. It ranges from clothes, 
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family customs and manners, school 
desks, and school rules to the affairs of 
nations. Research in this range, which 
stands nearer to the child than the Mid- 
west setting survey, is found in work 
with the Fels Parent Behavior Rating 
Scales. These scales comprise an instru- 
ment for appraisal of the child’s home 
environment as a function of enduring 
parental behavior patterns. First de- 
vised by Champney (1941), this instru- 
ment has been used extensively in the 
Fels Home Visit Program. It has been 
developed and applied in research by 
Baldwin, Kalhorn, and Breese (1945, 
1949). Special studies have also been 
made of its applications by Roff (1949), 
Larr and Jenkins (1953), Lasko (1954), 
and Crandall and Preston (1955). Only 
supplementing many secondary writ- 
ings, we want to recount features and 
applications of this well-known tool 
with significance from the standpoint of 
observational method. 

Thirty scales make up the rating 
battery. These scales cover dimensions 
of the home setting that include such 
variables as child-centeredness, pro- 
tectiveness, readiness of criticism, re- 
strictiveness of regulations, clarity of 
policy, discord in the home, and severity 
of penalties. The ratings are based 
mainly upon information obtained dur- 
ing repeated home visits in the course 
of which informal interviews or other 
means of getting pertinent evidence 
may be added to direct observation. 
These estimates are not left to stand 
alone in the Fels applications. They are 
supplemented at the time of each visit 
by a clinical summary that records gen- 
eral impressions and cites specific in- 
cidents to epitomize important features 
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of the home situation, Satisfactory re- 
liability of the scales is demonstrated by 
measures of both intrarater consistency 
and interrater agreement. 
Baldwin and associates have 
pared a manual (1949, Part II) for use 
of the Fels Parent Behavior Rating 


pre- 


Scales by home visitors. The manual 
treats the visit, rating techniques, scor- 
ing and standardization, administration 


of a home rating program, and the 
scales as such, Each scale variable is 
defined and cue points of every dimen- 
sion are phrased with abundant refer- 
ences to concrete grounds for high to 
low ratings. 


[A] rating is usually an estimate of the 
impact of the parent’s behavior on the 
child. This is clearly different from the 
parent behavior as perceived by the visitor 
(p. 32). 

The Fels ratings and the Midwest 
setting survey differ here. The Midwest 
survey does not weigh impacts of 
standing behavior patterns upon in- 
dividual children. This is not intended 
even in ratings by Simpson (1956) on 
approval, affectionateness, and other 
such variables of social weather as a 
property of a setting. It is here es- 
pecially that the Fels instrument gets 
nearer to the behaving child. Yet it does 
not record discrete parental actions as 
elements of the child’s momentarily 
changing situations. It does not count 
or interrelate or connect with ongoing 
behavior of the child specific rewards, 
punishments, denials, entreaties, or com- 
mands. Instead, it measures conduct re- 
sultants in the form of disciplinary 
policies, warmth of the home, family 
adjustment mechanisms, and other be- 
havior configurations that have con- 
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siderable stability as attributes of the 
enduring home environment. 

In one application (Baldwin et al., 
1945) ratings of 124 homes are inter- 
correlated by a method similar to San- 
ford’s technique of syndrone analysis 
(1943). Resulting groups of related 
variables permit classification of these 
homes into such types as the rejectant- 
autocratic, the casual-indulgent, and the 
acceptant-democratic. Differences in 
consequences for child development 
over a period of three years are found 
among the several types of homes. 
Clusters of variables that define warmth, 
adjustment, restrictiveness, clarity, and 
interference as home dimensions are set 
forth (in Baldwin et al., 1949, pp. 4-17) 
to guide clinical interpretation of the 
ratings. Graphic profiles that show 
the thirty ratings per home in sigma 
scores also serve this purpose. Crandall 
and Preston (1955) analyze ratings 
made with the scales of homes other 
than and yet comparable in locale and 
socioeconomic level to those studied by 
Baldwin and associates (1945). They 
report intercorrelations that demon- 
strate considerable similarity in parental 
behavior patterns between the two 
samples of homes. Hence their findings 
suggest that McClelland (1951) and 
others have been safe in basing theoreti- 
cal bets on measures secured by the 
method. 

There used to be a wide strip of no- 
science-at-all land between extremely 
proximal conditions of human behavior 
and such distal conditions as economy, 
political organization, and geography. 
Sociologists and anthropologists long 
ago colonized that middle ground to 
study social classes, primary groups, 
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mores, customs, and even the roles of 
persons. The studies just sampled sug- 
gest that child psychologists can work 
productively with their preferred tac- 
tics of stepwise measurement in the 
same area of comparatively stable and 
lasting environmental conditions. 


Common Problems 


Some problems that are common to 
all methods of observational child study 
need special consideration. 


OBSERVER INFLUENCE 


How natural or unnatural is the be- 
havior of a child and his associates in 
the presence of an observer? What can 
be done by an observer to minimize his 
influence, such as it may be, upon the 
behavior he records? Observational re- 
search with children has been up against 
questions in this vein from the first. 

One thinks here at once of observing 
from behind a one-way vision screen as 
a way out of the problem. But this is 
not often practicable if one wants to 
describe behavior and situations in life 
settings. So, instead of trying to hide, 
the observer can take a nearly opposite 
tack. He can first get acquainted with 
his subjects and others who are likely to 
be on the scene and then play his part 
as a nonevaluating, impartial, non- 
participating, nondirective, and friendly 
person with interest in children. 
Jersild and Meigs (1939, p. 9) write 
apropos of experience with this tactic 
that “much-observed children, as well as 
adults, seem quite readily to become 
habituated to the presence of an out- 
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sider.” They add that time has “a 
tranquilizing effect,’ and this seems 
understandable. We suppose that a child 
or even an adult cannot entirely stop 
being himself for long because he is 
being watched. Forces of life settings 
may always be stronger than induced 
forces of a detached onlooker. 

The observer can do much to dampen 
his influence upon the child. His part is 
to fade into the background as much 
as possible. This obviously requires the 
set of a neutral bystander, yet it does 
not call for wooden unresponsiveness. 
An observer is sure to stand out at times 
as a stationary figure against a moving 
ground if he tries to be only a listening 
post. He had better answer questions, 
for example, or certainly become a cen- 
ter of uneasy attention. Peripheral vision 
can sometimes help against the certainty 
that no one likes to be watched beadily, 
and some of the advantages in con- 
cealed observation often can be gained 
from skillful “preoccupation with other 
matters” (Jersild and Meigs, 1939, p. 
9). More may be gained by openness 
and candor. The child and everybody 
in town or at school, say, is going to 
know anyhow what is going on. 

Such effects of the observer as there 
may be need not go entirely unmeas- 
ured, Teachers, parents, or other adults 
can be asked to rate deviancies in the 
child’s behavior during periods of ob- 
servation. Behavior of the child in rela- 
tions with the observer and with other 
adults can be compared. At least one 
attempt has been made to do this quite 
systematically (Barker and Wright, 
1955). Social interactions of eleven 
Midwest children with a cumulative 
total of 14 observers during a day in the 
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life of each child are described. The 
results show, among other things, rel- 
atively low frequency of observer-child 
interplay, less frequent dominance, 
nurturance, resistance, aggression, and 
avoidance by the observers than by 


other adults, and relationships between 
observer and child that were even- 


tempered and generally free of extremes 
in emotive behavior of any kind. Con- 
trary results would of course be 
startling. 

Age of the subject clearly is a factor 
in sensitivity to an observer, One 
guesses that observational adult study 
or even observational adolescent study 
would call for very special methods. 
Probably, the younger the child the 


fewer the complications, although chil- 
dren up to eleven years of age have 
served willingly and without symptoms 
of great stress or self-consciousness as 
subjects of full day studies. 

Human field observers are sure to 
miss much in child behavior that percip- 
ient ghosts would see. Probably their 
presence attenuates especially the bad 
extremes of behavior from cuss words 
on down. But this leaves much that ex- 
perimenters are never going to see in 
laboratories at their inestimable best. 


Problems in public relations are added 
to problems in relations with subject 
children when studies are made outside 
preschools and other institutions with 
built-in arrangements for research. This 
holds especially for community projects. 
Shall the investigator live where he works? 
Shall he, in any case, work at least a part 
of the time as a participant observer? Or 
shall he keep his distance and look in from 
the outside? Should he ever pass himself 
off as John Jones, fellow layman? (One can 
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doubt it, with appreciation for arguments 
to the contrary.) Whose consent to do 
this or that ought the investigator to 
secure? Ought he to keep mothers, fathers, 
and others who are in on the research in- 
formed of his progress? How much and in 
what ways should lay adults be asked to 
help? Without pay? With pay? Ought 
final results to be shared with cooperating 
lay people? How about press relations? 
Psychological field projects can stand or 
fall on answers to these and related ques- 
tions. But this is a matter in social-science 
management that can only be raised here. 
Whyte (1951), among others, has dis- 
cussed many of the problems helpfully. 


RELIABILITY OF OBSERVATION 


Reliability of observation has gen- 
erally been defined empirically in ob- 
servational child studies by agreement 
between simultaneous and independent 
observers. Tests based upon this defini- 
tion are included in representative ap- 
plications of every method in our sur- 
vey but diary description. These tests 
have turned out passably well, with few 
exceptions, 


Reported measures of observer agree- 
ment in different studies do not clearly 
favor any principal method over any 
other. Nor do we find clear differences 
in trustworthiness of observation as be- 
tween different sorts of observed phe- 
nomena. Yet one clear indication stands 
out; namely, that many of the bigger 
and putatively vaguer phenomena pass 
the standard reliability test of observer 
agreement. Take from Bishop (field 
unit analysis: 1951) the example of 
noncooperation: ignoring or refusing 
to comply with or to accept a social 
approach of another person. This vari- 
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able subsumes a multitude of alternative 
mediating behaviors; and observing it 
requires some inference. Consider then 
that Bishop reports an over-all mean 
agreement percentage of 89 on nonco- 
operation. Similar examples include 
agreement percentages of 85 to 97 on 
friendliness and quarreling (time sam- 
pling: Green, 1933a), anger-evoking 
situations (event sampling: Ricketts, 
1934), episodes of imaginative behavior 
(time sampling: Markey, 1935), verbal 
or physical parent-child communication 
(specimen description: Lafore, 1945), 
social conflicts (time sampling: Jersild 
and Markey, 1935), and affection (field 
unit analysis: Moustakas, Sigel, and 
Schalock, 1956). 

Arrington reported in 1932 that her 
efforts to develop accurate techniques 
for recording children’s behavior with 
things and persons had led her to single 
out “finer and finer units” and to work 
toward elimination of “every element 
capable of different interpretation that 
might reduce reliability of the data re- 
corded” (Arrington, 1932a, p. 27). Yet 
her accuracy indices are not remarkably 
high in present perspective, She does re- 
port a median agreement per cent of 
99 on physical contacts with such 
material things as paper towels, cups, 
chairs, walls, and iron gratings. But she 
also reports comparable values as low 
as 72 and no higher than gi on gross 
physical activity not involving material 
things, no overt activity, talking, laugh- 
ing, crying and physical contacts with 
persons. Scores from independent ob- 
servers of Olson’s tics (see p. 93) cor- 
relate .75. A provocative fact is that, 
in general, the smaller and more literally 
objective behaviors have not fared 
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prominently well over the long run in 
tests of observer agreement, 

Some might suppose that the larger 
behavioral wholes on which independ- 
ent observers agree acceptably are al- 
ways defined reductively in terms of 
subordinate and more directly visible 
parts. Such is not the case. Instead, with 
exceptions to be noted, the core in- 
tended meaning of the target phenome- 
non is stated without either elaborate 
conceptual formulation or detailed 
specifying of “criterion” behavior 
items. Operational pointing is often 
done with only a minimum of examples. 
Moustakas and associates (1956), for 
instance, define seeking recognition, on 
which they get an agreement per cent 
of 84, by the following statement, in 
which A and C, respectively, are an 
adult and the child. 


Seeking recognition: Attention to ac- 
tivities, productions, or statements is 
sought by A or C. Ex: See what I’ve made? 
Ex: Look what I’m doing (p. 113). 


This must leave a thousand sub- 
ordinate behaviors that could be used 
as criteria for seeking recognition. But 
these are not even grouped in major 
classes or so much as sampled studiously. 
Seeking recognition is bidding for 
attention, as this social act is commonly 
and confidently known and perceived 
by socialized man. The target is left 
to stand right about there for the ob- 
server; and this example could be 
multiplied many times. 

Molar phenomena of child behavior 
evidently can be observed dependably 
on the basis of definitions that identify 
these phenomena as objects of common 
—even if partly inferential—perception. 
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Nor can this be very surpris 


ng, 


mon perception of the same phenomena 
works well in everyday life. It wor- 
mally enables us to adapt most of our 
actions with marvelous efficiency to 
friendliness, anger, fear, affection, 
seeking recognition, and the like, as 
these occur all around us hour by hour. 
So it turns out, moreover, despite the 
biasing effects of subliminal interests 
upon ability to see things as they are in 
real life. Then w hy should the same 
ability be unequal to the disint« ved 
observations of science? 

Reduction statements here seem gen- 
erally unpromising. Think again of 
seeking recognition. Listing all of the 
visible acts and audible words that 


can mediate this way of behaving ob- 
viously is not a serious possibility One 
can choose plainly visible items from 
among these behaviors and, for 
tivistic surety, make them stand for the 
bigger and ‘supposedly vaguer target. 
But an investigator who does this must 
end up with something less than and dif- 
ferent from bidding for attention as he 
would like to understand, predict, and 
control it in life settings! Such com- 
promising has been going on for a long 
while in psychology, and it does seem 
inescapable for some measurement pur- 
poses in experimental research, We may 
have shown, however, that reliability 
strictures do not demand it in observa- 
tional studies. Perhaps these studies 
should build on common perception as 
a homely base. This would leave plenty 
of room for conventional scientific 
operations in the overlapping stages of 
theory construction, comparative de- 
sign, data analysis, and integration of 
findings. Possibly direct observers of 


posi- 
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children ought to stop being scientists 
and try to be ordinary citizens while, 
in the stage of field work, they see as 
best they can the larger actions and 
situations of children. Many seem to 
have done this, and welcoming the prac- 
tice could be a good thing for work on 
many problems. 


A few child observers have worked 
hard at reductive definitions of molar be- 
havior variables. Hagman, for example, 
develops an extended definition of a com- 
panion, which refers in part to criterion 
molecular behaviors. Her statement builds 
on Webster’s by asserting that a com- 
panion is one who “(1) Comes into physi- 
cal contact with another ... (2) talks to 
another ... (3) listens to another ... (4) 
looks at another ... (5) approaches an- 
other within ten seconds ... (6) withdraws 
from another ... (7) plays with or touches 
the same unit of apparatus as another ... 
(8) participates in the same game or proj- 
ect as another ... (9) looks at the unit of 
apparatus another is playing with ... (10) 
gets something and immediately gives or 
shows it to another” (Hagman, 1933, pp- 
16-17). We suggest that this statement has 
two limitations that are likely to hold for 
others like it. First, it leaves out many 
behaviors that can mediate companionship. 
Second, it includes several behaviors that 
are not necessarily diagnostic of this rela- 
tion, as witness items (1), (4), (5), (6), 
(7), and (9). In any case, Hagman reports 
an observer agreement percentage of 83 
for identification of companions. This 
value is based on 688 judgments of be- 
havior of 39 2-to-4-year-olds. 

The research on association units, which 
is represented above (pp. 109-110), deals 
in a different way with essentially the 
same phenomenon. An associate of a child, 
C, is entirely defined in this research as an 
individual, O, that C gets involved with 
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in any direct way under either of two 
conditions. (1) C and O get together as 
“companions” and stay together for a 
while. (2) O does something or says some- 
thing to C that C “notices,” or vice versa. 
Some examples are added, but the defini- 
tion is not limited by specific, mediating, 
subordinate, molecular, literally objective, 
immediately visible, criterion particulars. 
Then consider that 30 observer agreement 
percentages, with a median of 96, are ob- 
tained. These percentages are based on 
thirty hours of observation by 12 lay ob- 
servers of 30 6-to-11-year-olds. 


A consideration that meets concern 
here for the necessity of somehow 
defining behavior phenomena to be 
observed is that formal definition can 
specify conceptual properties without 
being positivistically reductive. A com- 
plex behavior variable often presents to 
an observer alternative connotations 
that are sure to leave him confused 
until a choice is made as to what it shall 
mean. One is therefore not surprised 
to find that in a study by King, Erh- 
mann, and Johnson (1952) joint discus- 
sion by different observers of the mean- 
ings in cooperation, dominance, and 
hostility improved their agreement on 
ratings along these dimensions of the 
behavior of preschool children. We 
suppose, too, that it would be impossible 
in many instances to get along without 
systematic and carefully disciplined 
sorting and threshing of alternative 
ways to construe complex behavior 
variables, as, for example, in the elab- 
orate definitions by Baldwin and as- 
sociates (1949) of variables in parent 
behavior. It does not follow, however, 
that new meanings are invented in 
these or other similar cases. A great 
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many definitions of reliably observed 
behavior would seem to express elec- 
tions of meanings that rule common 
perception. 

Investigators in this line of research 
have been characteristically conserva- 
tive about reliability of observation 
since the years of diary records. It is as 
if they felt vulnerable outside the lab- 
oratory. This conservatism led in some 
early instances to an apparent policy 
of reliable observation, first, and sig- 
nificant observation, second. Perhaps 
the time has come for the reverse policy 
of significant observation, first, and 
reliable observation, second. Investiga- 
tors may prefer to go on getting direct 
evidence that their observations are 
dependable and repeatable as best they 
can. All may want to whenever the 
services of lay observers or other un- 
usual circumstances would otherwise 
leave their data in unknown jeopardy. 
Yet there are circumstances, as in some 
applications of specimen description, 
in which putting reliability first could 
greatly hamper effort to learn some- 
thing new. 

A respectable recourse for the in- 
vestigator without enough (or any) 
direct evidence that his observations are 
reliable remains in any case. He can 
look at his findings and ask a number 
of questions about them. Are they in 
ternally consistent? Are they in tine 
with tenable theories? May they even 
fulfill derivations from theory? Do they 
show relationships that outdo chance? 
Doubts of the reliability of observation 
would seem to be legitimately di- 
minished by a yes answer to all or some 
of these questions. It is hard to see 
especially how one could start with 
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seriously unreliable data and end with 
extra-chance relationships. 


INSTRUMENTAL AIDS 


The problem of instrumenting ob- 


servational child study in a materi: il way 
has been met chiefly with paper ind 
pencil, a timepiece, ‘and the inevitable 
writing board. Yet considerable work 
has been done with cinematography 
and other recording techniques by ob- 
servers of naturally occurring child be- 
havior. This work is so versatile and 


so loaded with still new methodological 
issues and difficult engineering prob- 
lems that we can comment only briefly 
upon its accomplishments and promise. 

Motion pictures of child behavior in 
essentially uncontrolled life settings 
include films that document behavior 
of the newborn (Spitz, 1947), conse- 
quences of social deprivation in early 
childhood (Spitz, 1949b), changes in 
behavior over years in the normal 
growth of individual children (Behrens, 
1946-1948; Langmuir, Stone, Buchner, 
and Bohmer, 1941-1952), neurotic or 
other pathological trends in develop- 
ment (Fries, 1950; Allen, 1940), the be- 
ginnings of emotional life in babies 
(Spitz, 1951), reactions to the medical 
visit as a life setting (Fries and Woolf, 
1944), and behavior during hospitaliza- 
tion after separation of parent and child 
(Robertson, 1952). These examples can 
be grouped with anthropological films 
that exhibit, among many other behav- 
ior samples from the lives of children, 
sibling rivalry in Bali and New Guinea 
(Mead, 19514), bathing of babies in dif- 
ferent cultures (Mead, 1951b), family 
life among the Navajo (Fries, Kluck- 
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hohn, and Woolf, 1944), and the first 
days of a New Guinea infant (Bateson 
and Mead, 1942). Here are primary and 
permanent behavior records. These 
films are reliable, and their visual lan- 
guage has many advantages over the 
best of other symbols. 

Behavior films per se are not finished 
scientific products. They are similar 
to specimen records in that each only 
preserves behavioral raw material. Both 
document, but, left unanalyzed, neither 
quite discovers. We suppose that films 
as well as specimen records become 
tools of scientific discovery in research 
on behavior only when (1) they re- 
cord representative samples of behavior 
and (2) analytical procedures draw 
from these samples uniform data which 
are generalized in the verbal and quan- 
titative language of science. 

Beginnings have been made in ap- 
plications of cinematography that go 
beyond documentation to inquiry in 
observational research with children. As 
early as 1937, for example, Bernhardt, 
Millichamp, Charles, and McFarland 
(1937) used motion pictures in a study 
of social contacts by children with their 
peers in a nursery school. They define 
a social contact as “any relationship 
with another child or overt behavior in- 
volving other children” (p. 11). A 
silent-movie camera recorded the action 
of each subject while paired observers 
recorded the target phenomena by a 
time sampling technique. The same ob- 
servers then used precisely the same 
technique to record the same phe- 
nomena as pictured when the film ob- 
tained was projected repeatedly at 
normal speed until differences between 
the observers were resolved, Perhaps the 
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most noteworthy finding of this study 
now is that the number of observed 
contacts was increased approximately 
Jo per cent by addition of what was 
seen on the screen to what was seen in 
the field! 1 So it turned out, even though 
the per cent of agreement between 
the two observers was 82. Note also 
that the film was silent. One can choose 
to be discouraged by the indication that 
observers ordinarily miss a great deal 
or to be encouraged by the evidence 
that a motion picture adds much to 
observations made without its help. 
Birdwhistell has made frame analyses 
of films in describing body movement 
and gesture by child subjects in his 
kinesiological research (Birdwhistell, 
1955; see also p. 113). Gilmer (1933) 
has described spontaneous responses of 
newborn infants on the basis of frame 
and slow-motion study of films. Mead 
(1951a, 1951b) and Bateson and Mead 
(1952) have extended their analyses of 
cultural factors in personality formation 
by studying motion pictures (and still 
photographs) of children in critical 
life situations of various primitive so- 
cieties. Bowlby, Robertson, and Rosen- 
bluth (1952) elaborate hypotheses about 
outcomes for children of parent-child 
separation by applying psychoanalytic 
concepts to Robertson’s movie of a 
hospitalized child. Kerkman and Wright 
(1959) are experimenting with frame 
analyses of pictures taken by a time- 
lapse movie camera (with a r1o-mm. 


1Jr must be considered that the time sam- 
ples were taken “when the social situation 
became complex and difficult to record.” 
Otherwise, as the investigators mention, the 
field observing and the repetitive picture ob- 
serving might have differed less. 
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lens) as a way of describing the be- 
havior of children en masse in class- 
rooms and other community behavior 
settings. Gesell (1934) has long since 
demonstrated rewarding techniques of 
cinemanaly sis as applied to films of child 
behavior in partially controlled situa- 
tions. Observational film inquiry into 
child behavior is in a fairly advanced 
experimental stage. 

Research (and instructional) films 
with children as subjects usually have 
been produced without live sound. This 
leaves silver where there could be 
gold because vocalizations are lost to a 
film without a sound track, A sound 
motion picture of a child in a life 
setting may be the best of all possible 
records for observational study. It can 
capture and preserve indefinitely about 
all there is to see and hear of a behavior 
sequence and its immediate conditions. 
The recorded actions and situations can 
be observed and analyzed from every 
conceivable point of view for as long 
and often as patience lasts. Behavior is 
hard to study partly because it happens 
so fast; it comes and goes before one 
can get a good look at it from even a 
single point of view. Sound motion 
pictures can make everyday behavior 
open to prolonged and intensive ob- 
servation and study. Good sound movies 
of child behavior in a wide variety of 
untrammeled life settings would pro- 
vide invaluable psychological experi- 
ments in nature, and getting them re- 
quires neither more nor less than large 
amounts of research money and techni- 
cal advancements that are now within 
reach, 


Tt would be easy to understate present 
technical difficulties in the filming of 
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children’s naturally occurring behavior. 
See Stone (1952) and Michaelis (1955). 
Stone finds many obstacles in the way of 
filming with live sound “observer docu- 
mentaries” of “real events” in the lives of 
children. Evidently this is not feasible for 
the time being if one wants films thar are 


suitable aesthetically for audience presenta- 


tion. Michaelis indicates, however, that 


behavior sound films of quality good 
enough for many inquiry purposes can 
even now be made under the conditions of 


field studies. His discussion (pp. 188 ) 
of anthropological filming is especially in- 
teresting in this regard. 


Dreger (1955) reports an intere 
application of sound recording alon 
a study of spontaneous conversation and 
story-telling by children in a “utterly 
unarranged” setting. Possibilities in 
electronic tracking of children’s move- 
ments in a community via automatic 
recording of signals delivered from a 
transistor radio-broadcasting unit (the 
size of a cigarette pack) borne by the 
child have been considered at the Uni- 
versity of Kansas. The addition of 
sound to such a system apparently is 
electronically possible if not otherwise 
practicable or permissible. Soskin 
(1951) earlier demonstrated the feasi- 
bility of conversation pick-up from 
radio-broadcasting units on the persons 
of adults. 

Electronic observational child study 
could boom. It is clear, if one puts 
engineering and money problems aside, 
that its progress will depend upon de- 
velopment of analysis procedures to suit 
the new refinement and fidelity of its 
records, Consider motion pictures, pre- 
ferably with sound. Some kind of se- 
quential unit analysis could be most 
adaptable to their potentialities. Films 
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can be episoded or divided into con- 
secutive integral behavior units of any 
conceivable sort with the technical help 
of reruns or slow-motion projection or 
frame analysis. Adaptations of these 
procedures to problems of sequence re- 
lations in interpersonal action, for ex- 
ample, or to base problems of action- 
to-situation relationships present an ex- 
citing prospect. 


DESCRIPTIVE CATEGORIES 


Students of child development who 
use observational methods are embar- 
rassed by riches. The nearly infinite 
range and variety of naturally occurring 
child behavior give almost unlimited 
chances for categories of description. 
What categories are to be used? This 
is a problem in choice of subject matter 
and conceptual formulation. Its solution 
for every study depends upon practical 
and theoretical interests that are out of 
bounds here. The problem can be 
brought into account, however, by con- 
sideration of three prominent dimen- 
sions along which descriptive categories 
of observational child studies differ. 
These dimensions are literal objectivity, 
psychological specificity, and theoreti- 
cal integration. 

The first two of these may be iden- 
tified well enough by their names and 
examples to follow. Categories at one 
extreme of the third, theoretical inte- 
gration, are taken directly from the 
unplanned network of everyday no- 
tions about behavior, Categories at the 
opposite extreme on this dimension are 
tied to statements of developed con- 
ceptual schemes, although they gener- 
ally are not in themselves conceptually 
refined. Take seeking affection from the 
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study by Heinicke (1956; see also pp. 
112—113) of children and their associates 
during separation of the child from his 
parents. This category refers to a com- 
monly perceived mode of action; and, 
for purposes of observing and record- 
ing, Heinicke does mot alter its every- 
day core meaning or reduce it to sub- 
ordinate, mediating behavior items. 
Neither does he leave seeking affection 
in the lay network. On the contrary, he 
subsumes it under the general relations 
category of succorance, which, in turn, 
is derived from a still more general psy- 
choanalytic model. Similar cases will be 
mentioned. 

One can rate the categories of any 
observational child study on each of our 
three dimensions. This has been done 
for illustration in Table 3-6 with the 
descriptive terms of two studies, the one 
just considered, by Heinicke, and the 
study by Arrington (1932a) of play 
behavior in preschool children. The 
following seem representative of the 
categories that observers used in these 
investigations. 

From Arrington: contacts with mate- 

rial things; verbal contacts with persons 

Gie., “talking”); physical contacts with 

persons. 

From Heinicke: hostility—agent de- 

prives; hostility—agent retaliates; avoid- 

ance; seeking attention; seeking affection. 

TABLE 3-6. RATINGS ON STATED 

DIMENSIONS OF DESCRIPTIVE 


CATEGORIES USED by ARRINGTON 
(1932a) AND HEINICKE (1956) 


Arrington Heinicke 
Categories Categories 


Literal objectivity Very high Low 
Psychological specificity Very low High 
Theoretical integration Low Very high 
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The ratings, which obviously can be 
only suggestive, are relative to the gen- 
eral run of observational research, not 
all research, in child development. 

Several differences and relationships 
are expressed by these illustrative rat- 
ings. Note first that high literal objec- 
tivity is combined with low psycho- 
logical specificity in Arrington’s cate- 
gories. Her physical contacts with per- 
sons, for example, are as palpable as can 
be and yet so lacking in psychological 
discriminativeness, so very general and 
vague in behavior content, as to make 
taps on the shoulder, hostile blows, and 
love pats all one and the same (cf. 
Murphy and Murphy, 1935). Heinicke’s 
categories are oppositely directed. See- 
ing either seeking affection or seeking 
attention, for example, requires at least 
“perceptual inference” (Heider, 1944), 
and yet each of these is a singular kind 
of social action. Both might involve the 
child in physical contact with another 
person but each stands as a particular 
and unique sort of interpersonal be- 
havior. We have already placed Hein- 
icke’s categories on the scale for the- 
oretical integration. Arrington’s, by 
contrast, are linked at most with im- 
plicit behavior theories. 

Positions on these scales of certain 
observational methods are fairly clear, 
as the following spot checks indicate. 
The categories of specimen description 
are, in the recording stage, those of 
common speech. That is to say, they 
are quite freely inferential, highly spe- 
cific in psychological content, and in 
no way bound to explicit constructs. 
Descriptive terms in the analysis stage 


of the same method have run the gamut 
on each scale. This includes the scale of 
theoretical integration, on which cate- 
gories with moorings in psychological 
field theory have been applied in studies 
by Gump and associates (see p. 92) and 
also in the Midwest research (see p. 91). 
Possibly the remaining clearest examples 
of theoretical integration are in appli- 


cations of field unit analysis. Heiniclce’s 
research is representative here, and so is 
the study by Bishop (see pp. 110-112), 
in which categories to describe mother- 
child interaction are geared throughout 


with stimulus-response formulations. 
The profile in Table 3-6 of Arrington’s 
study seems to fit well many early ven- 
tures in time and event sampling. Time 
sampling especially has moved down on 
the scale of literal objectivity and up on 
the scale of specificity, as in going 
from verbal contacts with persons 
(Arrington, 1932a) to verbal aggression 
(Walters et al., 1957) and friendly ap- 
proaches (Marshall and McCandless, 
1957). 

Advantages in work with the more 
specific categories seem clear. To take 
an extreme example for what it implies 
in less extreme cases, building a useful 
theory about physical contacts with 
persons must be nearly impossible, es- 
pecially when behavior so classified is 
recorded out of context, as it has been. 
Such a concept is so broad and equi- 
vocal as to mean almost “nothing in 
particular.” It lifts one phenotypical 
aspect from the whole of behavior and 
draws into it a multitude of things 
which, genotypically, belong far apart. 
On the other hand, one can start to 
build a theory about hostility (agent 
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deprives) or hostility (agent retaliates) 
or avoidance or seeking affection or 
seeking attention, with each singled out 
as one kind of contact with persons. 
One can indeed begin to build a theory 
that relates these and other such psy- 
chologically specific categories with 
one another. So one finds on referring 
to Heinicke’s research and to other ven- 
tures in current child psychology. 

Our discussion of reliability in ob- 
servation has anticipated a need for 
partly inferential description in research 
on molar phenomena of child behavior. 
Objectivity that rules out everything 
but palpable facts is unequal to the re- 
quirements of observing in such re- 
search, Behavioral categories that call 
for some degree of inference can never- 
theless be objective in an important 
sense if the following is correct. 


On the negative side, objective inquiry 
means correcting against biases and special 
viewpoints that limit and misrepresent 
reality. On the positive side, it means 
describing conditions in the context of 
whatever one is trying to be objective 
about as they really are in that context. 
It follows that, if one is to be objective 
in describing actions and psychological 
situations, one must break through a wall 
of directly observed conditions and events 
by inference, and must see things as the 
other person sees them from his view- 
point, and as they really are for that 
person. If, instead, the student of molar 
behavior sticks to the facts as he sees 
them from his own viewpoint, and as 
they are for him, he will not be objective, 
but subjective in a bad sense, biased—and 
wrong (Barker and Wright, 1955, p. 205). 


Anyone left with concern about this 
issue might reconsider the evidence for 
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reliable observing of behavior that no 
one can see directly. 

Blindness of data collection must vary 
inversely with the degree in which de- 
scriptive categories are bound to con- 
ceptual models. This is not to propose 
that good research designs in observa- 
tional child studies have to be cast in 
postulate-theorem-demonstration form. 
Precedent for anything quite like this 
is meager, which is understandable, The 
facts of naturally occurring behavior 
are so numerous, various, widely dis- 
persed, and complex that it is much 
easier to ask questions about them than 
to predict them; hypotheses in the true 
meaning of logical derivations from 
theory are not easily come by in this 
jungle area. At the same time, the prob- 
ability of finding significant relation- 
ships between these facts seems sure to 
be increased by articulating the cate- 
gories for gathering them with explicit 
constructs of one sort or another. It 
may go without saying that these con- 
structs need not belong necessarily to 
particular psychological “schools,” as 
in our examples. 


On Uses of Observational 
Methods 


Different methods of observational 
child study have been examined here 
one by one, Viewing these methods 
collectively now, I should like to con- 
clude by considering their possibilities 
in the larger research program in child 
development. Possible gains of further 
ecological, normative, idiographic, and 
systematic applications will be stated in 
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an effort to summarize and develop ear- 
lier points and findings. Some proposals 
apply better to certain methods than to 
others, but readers can note where this 
is true. 


1. Continued ecological research with 
these methods can lead to a natural his- 
tory of child behavior. Like its pre- 
decessors in biological science, such a 
natural history would store plain infor- 
mation. Where do children spend their 
time away from laboratories, testing 
rooms, and clinics in different localities 
and cultures? What life settings do they 
inhabit and what are the characteristics 
of these settings? What situations-of- 
the-moment arise in particular episodes 
and what behavior of children do these 
situations engender? Theoretically and 
practically important problems of mat- 
uration and learning, of social behavior, 
and of personality formation need an- 
swers to these questions, 

Observational methods do not offer 
the only approach to natural history 
problems in child development. Every- 
day child behavior and its conditions 
as well can be inquired about in inter- 
views—with children or informants. 
Both can be similarly explored by ques- 
tionnaires or reconstructed in part from 
responses to suitable tests and experi- 
mentally contrived settings (Wiest, 
1957). Yet direct observation remains 
the least roundabout way to get data 
on the natural habitats and behavior of 
children, 

2. Normative applications can pro- 
duce norms that describe child behavior 
authentically and also specify its envi- 
ronmental conditions. Norms from psy- 
chological tests and laboratory measures 
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or dispositional characteristics of classi- 
fied children. These potentiality norms 
are invaluable. But we need as well nor- 
mative descriptions that make the actual 
behavior of classified children a matter 
of scientific record. It is one thing, for 
example, to look at a table of norms and 
see that the average 2-year-old ible 
to walk upstairs alone, to mark time, 
to turn the leaves of a book, < to 
open boxes. It is quite another t to 
study word pictures or other rds 
that tell concretely how, how en, 
and in what situations representative 
2-year-olds actually walk upstai: me 
or the like. Descriptions of this kind can 


at least round out and vivify the | 
tiality norms; and observational methods 
can indeed supply them. 

The predictive value of a behavior 
norm obviously goes up with increase 
in knowledge about the conditions un- 
der which it holds. Hence norms with 
built-in qualifications that point to en- 
vironmental conditions of behavior are 
much to be desired. Here, again, ob- 
servational methods seem uniquely 
promising; for parts and properties of 
the environment are spread out before 
direct observers of behavior, Not that 
observational child studies with a nor- 
mative aim always seize the opportunity 
in this circumstance! The fact is that 
some of them short-circuit the environ- 
ment and connect behavior directly 
with the child. One could almost con- 
sider this an error of omission. Clear 
chances to describe behavior regulari- 
ties of 2-year-olds, 10-year-olds, boys, 
girls, or children of other such group- 
ings with attention to the environmen- 
tal contexts of these regularities are 
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wide open to research with observa- 
tional methods. Such description, more- 
over, can accomplish still more by sup- 
plying norms of psychological envi- 
ronments as research products of value 
in their own right. 

3. Idiographic applications stand to 
test and to supplement other procedures 
in clinical studies of individual children. 
The various diagnostic instruments of 
child clinics are often cross-validated in 
something of a circle within the clinic 
setting. Observational methods can 
breach this circle by validating play 
sessions, projective tests, and other such 
devices against descriptions of everyday 
behavior and its conditions. Research to 
this end has made only small beginnings. 
More of it could help clinicians with 
their difficult problem of seeing chil- 
dren in clinics as children are in real 
life. 

A related possibility is that, beyond 
testing other means of diagnosis, proven 
observational methods could be used to 
advantage in clinical field routines. The 
main idea here is embodied now in the 
case worker’s diagnostic role, which has 
been defined traditionally by sensitive 
interpretation of circumstances in the 
life settings of children. Good impres- 
sionistic and interpretive case reports 
may always be needed for understand- 
ing children’s troubles and adjustment 
patterns. Yet more case records of nat- 
urally occurring child behavior and its 
conditions might sharpen greatly and 
enlarge such understanding. More re- 
search to try out, adapt, and improve 
currently available ways to keep and 
analyze records of this kind might rea- 
sonably be expected to give clinical 
practice with children a stronger base. 
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4. Systematic applications stand to fill 
gaps that would otherwise be left in our 
knowledge of the laws of child be- 
havior. Experimental child psycholo- 
gists have shown great inventiveness in 
profitable tests of hypotheses, and in- 
creasingly hard problems of interde- 
pendence keep yielding to their ma- 
nipulative procedures. Direct observers 
of child behavior have nevertheless 
worked with independent psychological 
variables that may overtax the best of 
Wundtian skill for a long while to 
come. This holds especially for complex 
variables of social context that are cen- 
tral in current theories of child behavior 
and development. Examples include ag- 
gressive, nurturant, and restrictive 
modes of action by associates, sponta- 
neously changing group size and com- 
position, rated power of bona fide com- 
panions, and enduring parental accept- 
ance or rejection. Such variables have 
been subjected to control by selection 
in observational studies, with the result 
that relationships between them and sig- 
nificant dimensions of child behavior 
have been demonstrated. These rela- 
tionships are usually of a low order. 
They are generally formulated without 
elegance, and they seldom have been 
assimilated explicitly to larger systems. 
Yet they have good points, which in- 
clude the important one that lines of 
connection between their parameters 
and conditions in the natural milieu of 
the child have never been broken. This 
advantage must help to insure validity 
of implication for the everyday lives of 
children. 

These considerations seem consistent 
with the larger experience of science. 
One may suppose on principle that sci- 
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ence can ultimately harness experimen- 
tally (to its enormous advantage in pre- 
cision of measurement and control of 
variables) almost any event that it 
chooses to investigate. Yet every science 
has learned that this cannot always be 
managed in practice. It is impossible 
now to make experimental arrangements 
that allow for the full size and complex- 
ity of many events, from whirlwinds to 
wars. Laws of rivers and of ocean 
currents can be demonstrated in labora- 
tories, but scientists have to follow in- 
land streams or go to sea to get new 
leads and to learn about important com- 
plications of these laws. The strategy in 
every such case is to capitalize upon 
experiments in nature by substituting 
selection for manipulation of variables. 
This has been done in observational 
child study, and it can be done better 
with continuing improvement in com- 
parative designs and in techniques of 
observing, recording, and analysis. 
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The purposes of this chapter are four 
in number. The first is to define what 
is meant by an experimental technique 
of study. The second is to point out the 
special advantages and limitations of this 
technique when applied to the problem 
of child behavior. The third is to de- 
scribe representative problems and pro- 
cedures in the experimental investiga- 
tion of child behavior. It may be that 
such an analysis will serve to indicate 
problem areas in which experimental 
techniques may be applied or re-applied 
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with fruitfulness. If so, the fourth goal 
of this chapter is to stimulate further 


experimental study. 


Meaning of the Experiment 
Technique 


The experimental technique is on 
of several methods of obtaining inior 
mation for a science. (Most alterr 
methods can be collected under th 
bel of correlational techniques.) 
technique is most commonly ap} 
when the investigator wishes to st 
a suspected causative relationship hol: 
ing between a phenomenon of som 
interest (dependent variable) and one € 
more other factors (independent 
ables). A situation is constructed 


in 


such a way that the dependent variable 
is readily observable, the independent 
variable is introduced and varied in a 


systematic manner, and all other vari- 
ables are controlled to prevent them 
from influencing the dependent vari- 
able. Changes in the dependent va riables 
are then stated as a function of the pre- 
ceding changes in the independent vari- 
able. 

The essential concept involved in the 
definition of experimental technique is 
that of control. In experimental logic 
phenomena do not change without 
cause. A statement of cause and effect 
can be made only when a variation in 
the supposed effect is coincidental to a 
preceding variation in the supposed 
cause, if no other factor has changed. 
The best guarantee that no other factors 
have changed is to gain control of them 
and keep them at fixed values. 

This essential need for control over 
all possible causal factors has conferred 
upon experimental techniques a sec- 


ondary characteristic of being labora- 
tory procedures. A laboratory is simply 
a place in which causal factors may be 
brought under the investigator’s con- 
trol more easily than elsewhere; it has 
no other essential characteristics. For 
the physicist, this may mean a place in 
which temperature, pressure, or radia- 
tion are controllable; for the psycholo- 
a place in which stimulation to an 
inism is controllable. 

nother secondary characteristic of 
he experimental technique is that meas- 
uiement of changes in both the inde- 
pendent and dependent variables is 
done, whenever possible, by objective 
Ge., mechanical) means. Ultimately, of 
course, observation is done by the hu- 
man experimenter. The traditional me- 
diation of this process through mechani- 
cal instruments is an explicit recogni- 
tion that (1) the human observer is 
often (not always) less sensitive than 
a mechanical device, (2) the human 
observer is usually less reliable than a 
mechanical device, and (3) identical 
mechanical instruments are more readily 
produced in larger numbers than trained 
human observers (an important feature 
if science is to be considered a body of 
public knowledge). 

The use of a laboratory and of ob- 
jective means of measuring add accu- 
racy and reliability to the logic of the 
experimental technique. Thus experi- 
mentation, well executed, generates rel- 
atively precise statements of cause and 
effect which are remarkably unambigu- 
ous. This is its essential attribute. Hence 
Underwood (1957), speaking of experi- 
mental designs, says: “. - - there is only 
one basic principle, namely, design the 
experiment so that effects of the inde- 
pendent variable can be evaluated un- 
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ambiguously.” However, it must be 
noted that even an accurate, reliable, 
and unambiguous statement of cause 
and effect is not immune to any broader 
(or incorrect) meaning that the experi- 
menter may choose to place upon it. 
Interpretation is a problem outside the 
present one; but it suggests that experi- 
mental technique per se is not a final 
definition of science. Instead, it is con- 
sidered here as a technique that is usu- 
ally efficient in producing unambiguous 
statements and will frequently prove in- 
dispensable for that purpose. 


Application of Experimental 
Technique 


The formula for experimentation in 
child behavior is simply translated from 
the general formula. A situation is con- 
structed that allows for the control of 
all factors affecting a child’s behavior; 
subsequently, one of these factors is 
introduced as a variable, and objective 
observations are made of any conse- 
quent changes in the child’s behavior. 
The detailed application of this formula 
involves a recognition of the behaviors 
that a child exhibits and of the factors 
that can influence those behaviors. The 
traditional conceptual scheme for enu- 
merating the sources of all such factors 
lists three major areas; the limitations 
of an experimental technique in the 
study of child behavior become appar- 
ent when the experimenter’s ability to 
control factors from these areas is eval- 
uated. The three areas are (1) the 
child’s present environment, (2) the 
child’s history of interaction with his 
past environments, and (3) the child’s 
genetic endowment. 
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THE CHILD'S 
PRESENT ENVIRONMENT 


Clearly, the child’s present environ- 
ment will prove the simplest to control. 
Simply by choosing a well-placed room 
as a laboratory, the experimenter pre- 
vents a large amount of uncontrolled 
stimulation to the child. Within a room 
it is practical to wield stimuli in a syste- 
matic manner and to observe and record 
behavior accurately and reliably. The 
thoroughness with which current en- 
vironment can be controlled is readily 
demonstrated on the animal level, where 
a selected response of an individual ani- 
mal in a properly engineered experi- 
mental situation can easily be shown to 
vary only in response to the experi- 
menter’s manipulation of the stimulus 
environment. The same demonstration 
with a young child has been achieved, 
although perhaps not quite so easily. 
Figure 4-1 represents the bar-pressing 
behavior of a 4-year-old girl, who, 
when presented with an amber light, 
responds in a very uniform manner for 
reinforcements consisting of trinkets, 
candy, and cookies delivered on a vari- 
able ratio schedule; when presented 
with a blue light marking a period of 
no reinforcement, the subject immedi- 
ately stops responding but just as im- 
mediately resumes at her characteristic 
rate when the amber light returns. 

The behavior under study here is a 
discrimination process. The stability of 
the behavior and its immediate respon- 
siveness to the manipulated variables 
show that the experimenter has in hand 
virtually all factors involved in this 
situation. Discrimination certainly is one 
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Fig. 4-1. Cumulative response curve of dis- 


crimination learning on the ninth experimental 
day. The subject is a 3-year-old girl. Depres- 
sions on the time line indicate two minutes of 
amber light; elevations, two minutes of blue 
light. Reinforcements, indicated by pips on 
the Tesponse curve, were delivered on a vari- 
able ratio 50 schedule only during the amber- 
light periods, 


of the most important classes of child 
behavior. These statements in combina- 
tion lend a great deal of significance to 
the study described. It is important, 
nevertheless, to show the limitations of 
this significance. It would be easy to 
point to the artificiality of such studies 
and suggest that the relationships found 
need not be typical of child behavior 
in vivo. Certainly it is correct to say 
that few if any ‘of a child’s everyday 
discriminations will be formed under 
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conditions like those established in the 
iboratory. Real life discriminations are 

not made to such regular alternations of 
timuli, the cues and the responses are 
more variable and more complex, the 
reinforcers differ from occasion to oc- 
ion, the reinforcement schedule is 
more complex, and the entire process 

probably not pursued systematically. 

A laboratory study of discrimination 

ideally suited to a study of any of 
hese variables acting in a pure and sim- 
ple form. Despite the fact that a child 
will rarely encounter a pure and simple 
case of any such variable, this informa- 
tion is of value. The history of science 
uggests that the complexities of the 

hild’s real world are probably only 

multiple combinations of pure and sim- 
ple processes. Hence the complexity of 
child behavior in vivo my result only 
from the large number of factors in- 
volved; yet each factor by itself may 
operate in a simple manner, readily and 
completely understandable in a labora- 
tory setting. 

Since this supposition has served other 
sciences well in the past, no other jus- 
tification will be offered. Its acceptance 
means that “artificial” laboratory studies 
of the child’s response to his present 
environment become not only valuable 
but in fact essential. But by no means 
are such studies exhaustive. A science 
of child behavior will require a great 
deal of additional information that labo- 
ratory techniques will prove inadequate 
to supply. Yet any alternative assump- 
tion that the whole child in his whole 
world may prove to be more than the 
sum of his and its parts, although con- 
ceivably meaningful, is best considered 
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only after a thorough study of the parts 
has been made. 

Terrell reflects this point of view 
forcefully: 


There is some evidence ... for con- 
tending that the likelihood of variables 
which are known to apply at a simple 
behavioral level to be relevant also at a 
more complex level is greater than the 
probability that variables suggested as 
relevant in complicated, poorly controlled 
studies are in fact relevant at that level. 
The evidence in support of this statement 
falls into two categories: (a) the applica- 
tion of principles discovered in animal re- 
search to more complex human behavior, 
and (b) the highly inconsistent findings of 
research in human behavior concerned 
with such complex phenomena as child 
rearing variables and their effects on 
personality development (1958, p. 308). 

Whatever the merits of Terrell’s po- 
sition, the attitude expressed appears to 
be a growing one and may well account 
for much of the present and future 
character of experimental studies in 
child behavior. 

Ideally, the highly controlled, arti- 
ficial laboratory situation is not the final 
goal of an experimental technique but 
rather a starting point. When the ex- 
perimenter can show that the behavior 
under study varies only in response to 
his own manipulations of simple factors, 
then he is assured that he is in a good 

osition, not to generalize about the 
child’s real world, but to investigate 
more complex factors. These may be 
added to his highly controlled situation, 
one at a time, and the effects upon the 
behavior noted. The culmination of this 
process may be a very complex variable 
indeed, with the experimenter in pos- 
session of data showing the workings of 
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its components. “The laboratory experi- 
ment is a technique for basic and the- 
oretical research and not a goal of an 
empirical science” (Festinger and Katz, 
1953, P. 170). 


THE CHILD'S 
PAST ENVIRONMENT 


An investigator observing a child’s 
behavior in the present may believe that 
the significant factors affecting this be- 
havior are located in the child’s distant 
past. In this case the possibility of con- 
trol of the variables is lost, the factors 
gone forever, Although they cannot be 
controlled by the experimenter, it may 
be possible for him to assess and relate 
them to the child’s present behavior. 
But so doing will rarely qualify as an 
experimental technique. Any supposed 
causes of a child’s behavior, located in 
his past history, may themselves be con- 
sidered effects of some prior cause. 
Hence it is not clear where in this suc- 
cession of causes the child’s present be- 
havior should be referred. Indeed, there 
exists the possibility that the child’s 
present behavior is only a manifestation 
of a long-standing interaction, which is 
also responsible for the child’s history. 
For example: one may relate current 
“adjustment” to the feeding schedule 
adopted by the mother during infancy 
(Holway, 1949). It is possible that a 
properly chosen feeding schedule pro- 
motes subsequent adjustment; it is pos- 
sible that the mother’s basic attitudes 
may lead simultaneously to a particular 
feeding routine and “good” adjustment 
in the child but that feeding routine it- 
self does not promote adjustment; and 
it is at least conceivable that early mani- 
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festations of a child’s subsequent adjust- 
ment may influence the mother in her 
choice of a feeding schedule rather than 
vice versa. This difficulty is avoided in 
an experimental approach, basically be- 
cause it is the experimenter wh« the 
cause of variations in particular factors, 
if he is careful to avoid bias in his own 
manipulation of these factors. t it 
cannot be denied that an experimenter 


will find it difficult to establish experi- 
mental control over a variable 
feeding schedule for an appreciab 
of an infant's history. 


Strategically, the experimenter is in 
a better position if he can manipulate 
a “historical” variable in the present, 


with the intent of observing its cfects 
at some future point. This certainly can 
qualify as an experimental technique, 
with a corresponding lack of ambiguity. 
However, it will often prove difficult 
to accomplish. Simply the time involved 
in establishing the variables, waiting the 
appropriate period, and then making 
observations of their effects on the 
child’s behavior will prove an expensive 
deterrent. More important, the experi- 
menter may not be competent to estab- 
lish many variables in an experimental 
manner for a reasonable period of time. 
Children are not cared for by mechani- 
cal devices, which apply experimental 
variables only in the manner they were 
constructed to do. The human who is 
the agent responsible for applying many 
historical variables does so according to 
his own personality, which is not a 
product of the experimenter’s construc- 
tion. Hence it will be difficult to set up 
his behavior according to the experi- 
menter’s random number table. Finally, 
the ethical consequences of manipulat- 
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ing what may prove to be a significant 
variable in a child’s history provides a 
powerful barrier against such proce- 
dures, If a historical variable is under 
study, it can only be because the ex- 
perimenter suspects that it will have an 
effect upon a child’s behavior. If the 
effect is thought to be beneficial, many 
caretakers will consider it immoral to 
have their children left in a control 
group; if the effect is thought to be 
detrimental, no one, including the ex- 
perimenter, will care to fill the experi- 
mental group with his charges. 

Certainly these are severe limitations 
upon an experimental approach to his- 
torical variables. In large part, it is these 
considerations that have too frequently 
made experimental studies of child be- 
havior short-term affairs of only mild 
developmental significance. Yet these 
limitations, although real, are often 
magnified beyond their actual effect 
(Anderson, 1956). Many times, an ap- 
parent weakness of experimental tech- 
niques in attacking some historical vari- 
able represents a practical rather than 
an inherent difficulty: a shortage of 
time, personnel, ingenuity, or tech- 
nology. 

The history of experimental studies 
of children will lend some support to 
this view. Clearly, the experimental ap- 
proach has never been the most popular 
method of study, testifying to the real 
limitations of experimental techniques 
in dealing with historical variables. Yet 
the child’s interaction with his past en- 
vironment traditionally and logically 
has been the central focus of study, and 
there have been periods when experi- 
mental techniques were attempted with 
some frequency. The Dennis’ (1941) 
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study of twins reared for a year in rela- 
tive social isolation, McGraw’s six-year 
pursuit of Jimmy and Johnny (1935), 
the Kelloggs’ willingness to raise Donald 
with Gua (1933), Dorothy Marquis’ 
discrimination of infants’ feeding occa- 
sions with a buzzer (1931), and Watson 
and Raynor's conditioning of Albert to 
fear the rabbit (1920) are examples of 
a time when the difficulties of an ex- 
perimental approach often proved to be 
more apparent than real. 

The most frequent applications of an 
experimental approach, however, have 
been applied to the child’s present en- 
vironment. In the 1920’s and 1930's a 
considerable number of experimental 
studies appeared, which dealt with the 
sensory and motor capacities of the 
neonate. Still, interest in postneonatal 
development for many years centered 
about topographical and normative de- 
velopment, and although experimental 
techniques saw some application (Gesell 
and Thomson, 1929) the basic approach 
was a field or psychometric one, That 
is, the investigator essentially restricted 
himself to observing or asking the child, 
in a great variety of ways, what he 
could do. 

The concept of manipulating vari- 
ables to account for the observed vari- 
ance in what children do saw only slight 
use. In part, this was because relation- 
ships were too often handled only by 
the use of gross categorization: nature- 
nurture, reflex-instinct-learning, prac- 
tice, etc. This approach certainly did 
not provoke a large number of ques- 
tions that require an experimental an- 
swer. If, for example, a behavior ap- 
peared at a characteristic age in average 
children and if it could not be acceler- 
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ated in its appearance by direct practice 
of it, the oversimplified equation of 
“learning” and “experience” to “prac- 
tice” led immediately to the conclusion 
that this behavior was maturational, and 
“age” was translated immediately into 
something “neural.” It was not until 
more recent times that a theory such 
as Hebb’s (1949) raised complex (and 
experimental) questions about the 
varied nature of “experience” and an 
experimental study such as Jeffrey’s 
(1958) showed that a behavior might 
not be accelerated in making its appear- 
ance by direct practice but might by 
practice of a different but related be- 
havior, Indeed, it was the early 1950's 
that brought a shift in interest to a more 
realistic analysis of these antecedent 
categories. The “why” of child devel- 
opment became increasingly pressing; 
demand for both a practical and theo- 
retical understanding of development 
grew. The convenience of field and 
clinical methods brought them into fre- 
quent use, The correlations produced 
by these methods, although subject to 
a certain ambiguity, were undoubtedly 
more efficient than an experimental ap- 
proach in at Jeast producing data about 
the role of historical variables in the 
development of personality and social 
behavior. For example, Mussen and 
Conger (1956) in their textbook in child 
development cite approximately six 
times as many field or clinical studies 
as experimental studies when dealing 
with preschool social development. 
However, the present demand for a 
“why” to child behavior apparently has 
accentuated the need for unambiguous 
statements, too. At any rate, there is 
growing renaissance in the use of ex- 
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perimental techniques. The majority of 
recent experimental studies center upon 
the child’s present rather than past en- 
vironment, The subjects typica are 
preschool children, not neonates. The 
questions commonly considered deal 
with relationships suggested by some 
learning or behavior theory. (McCand- 
less and Spiker, 1956; Stevenson, 1954; 
Gewirtz and Baer, 1958; Bijou, 1957. ? 
There is still much less use of ex 
mental techniques in dealing with P 


child’s history than in the early part of 
this century. 
THE GENETIC ENDOWMENT 
OF THE CHILD 

The child’s inheritance is the most 
difficult source of variables to bring 


under adequate experimental control. 
Current genetic theory ascribes to the 
genes the power to stimulate develop- 
ment along certain dimensions, includ- 
ing behavioral dimensions. The genes 
are not directly manipulable.! The 
chromosomes upon which genes are 
said to be arrayed are not, for practical 
purposes, directly manipulable. They 
are manipulable indirectly by the use 
of selective breeding procedures, but 
this is a technique explicitly denied in 
the study of human development. 

The child psychologist, then, is left 
with the possibility of doing something 
with the children who occasionally 
come in sets of identical and fraternal 
twins. A certain experimental logic has 
grown out of this fact aimed at identi- 
fying the degree to which any behavior 

‘Indeed, the existence of genes seems to be 


established more firmly on logical than on 
empirical grounds. 
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is influenced by inherited factors. Iden- 
tical twins have identical inheritance; 
hence differences between them may be 
attributed to environmental action. But 
similarities between identical twins are 
much more difficult to refer to genetic 
endowment. Hebb (1958) argues that 
this may be done only if the environ- 
ment of the twins has been systemati- 
cally contrasted in terms of all relevant 
environmental factors. This immedi- 
itely returns the experimenter to an 
exhaustive study of environmental, not 
genetic, factors, Kallman (1946) has 
irgued from the assumption that iden- 
tical twins and fraternal twins both 
share very similar environments; when 
identical twins are more homogeneous 
in behavior than fraternal, very likely 
that difference reflects inherited rather 
than environmental factors. In response, 
of course, it can be suggested that fra- 
ternal twins do not have an environ- 
ment so similar as identicals; half of all 
fraternal twin pairs have different-sexed 
members, whereas identicals are always 
same-sexed, Furthermore, fraternals fre- 
quently look different; identicals rarely 
do. Evaluation of this argument again 
returns the experimenter to a thorough 
study of the relative similarity of en- 
vironmental factors, not inherited ones. 
Both Kallman (1946) and Newman, 
Freeman, and Holzinger (1937) have 
suggested the comparison of identical 
twins reared apart with fraternal twins 
reared together, The assumption is 
made that identicals reared apart must 
have less similar environments than fra- 
ternals reared together. Hence greater 
similarity between identicals than be- 
tween fraternals must reflect inherited 
factors. But, again, it can be pointed 
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out that twins reared apart typically 
share an environment involving early 
separation from the mother, a variable 
considered of great significance but not 
thoroughly understood. 

Both Beach (1955) and Verplanck 
(1955) have argued that little can be 
done to study the effects of inherited 
factors. They maintain that in order to 
refer a behavior to inheritance it is nec- 
essary to show that there is variation in 
the behavior that cannot be explained 
by any environmental factor. Clearly, 
this would require an exhaustive knowl- 
edge of environmental factors as a prior 
condition to making any study of a 
child’s inheritance. Acceptance of this 
position removes the problem of genetic 
endowment from the field of child 
study. This attitude is not without op- 
position, but the extent to which it is 
accepted will explain to a great degree 
the paucity of experimental attack upon 
the child’s inheritance. 

By way of summary, then, it appears 
that the virtue of any experimental 
technique is that it yields a relatively 
unambiguous conclusion about causal 
relationships. In an area as complex as 
child development this virtue is of ex- 
treme value: when relationships are 
ambiguous, too many alternative con- 
clusions of too variegated a nature are 
possible. However, experimental tech- 
niques are most readily applied to con- 
trolling the child’s present environment; 
extensive control of the child’s interac- 
tion with his past environment, al- 
though often possible, is more often 
difficult for both practical and ethical 
reasons, Since the child’s past environ- 
ment traditionally and logically i is a cen- 
tral source of variables in child devel- 
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opment, this difficulty constitutes a 
limitation on the use of experimental 
techniques in this area. Furthermore, an 
experimental approach will frequently 
aim at abstracting a fraction of the 
child’s total environment for study in a 
controlled setting. Consequently, many 
of the relationships produced appear de- 
void of immediate “practical” or “real- 
life” application. 

Current use of experimental tech- 
niques, then, seems aimed at three even- 
tualities. 


1. The quantification of precise, re- 
liable, and unambiguous laws of child 
behavior in relation to the present and 
short-term past environments. 

2. The gathering together of such 
laws into a general system of behavior 
theory, so that generalized statements 
about these laws have organizing and 
unifying value and, perhaps, practical 
application. 

3. The extension of such a body of 
fact and theory, through the use of 
other research techniques, to include 
more variables from the child’s long- 
term past environment and so produce 
a more comprehensive theory of be- 
havior and its development. 


PLAN OF TREATMENT 


What follows is a survey of promis- 
ing experimental techniques in the area 
of child behavior. These techniques 
have no single characteristic apart from 
their logic; they are almost as varied as 
the techniques of all psychological ex- 
perimentation. For present purposes, the 
logical factor to bring order into this 
variety is the age (i.e. the develop- 
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IN CHILD DEVELOPMENT 
ment) of the child under study. the 
child’s equipment changes, techniques 
are modified accordingly. Hence this 
survey is organized under three main 
categories of developmental sta 

1. Infancy: the first two years of life, 
a period of relative physical dependence 
and a lack of motor competenc« 

2. Early childhood: from 2 to 5 vears 
of age, a period of decreasing ph sical 
dependence, increasing social end- 
ence within a family unit, and rapidly 
increasing motor and verbal compe- 
tence. 

3. Later childhood and preadoles- 
cence: from 6 to 13 years of age, 3 pe- 
riod of socially structured education in 


groups, increasing social competence, 
emotional control, and independence. 


In each of these stages the following 
three common experimental problems 
will serve as the basis for organizing 
discussion: 


1. Obtaining subjects: Since children 
are neither purchased nor considered 
competent to volunteer their services to 
the experimenter, the problem of ob- 
taining children to observe is a primary 
one. A number of paths to sources of 
subjects have been developed by past 
experience, these are described in terms 
of the age level of the subjects to be 
studied. 

2. Sampling problems: Since children 
are not obtained for study in the rela- 
tively simple manner that animals are, 
many problems of biased sampling arise. 
Sometimes these are unavoidable and 
can only be recognized; more often, 
steps can be taken to detect and correct 
such biases, if important. Again, this 
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problem varies largely in terms of age. 

3. Investigative procedures: This is 
the actual core of the chapter, a de- 
scription of the methods of controlling 
variables and recording responses. A 
large part of this problem consists in 
instituting proper controls and devel- 
oping mechanical devices and instru- 
ments. Again, age will explain part of 
the variety of techniques to be consid- 
ered, but only a small part. More often, 
the development of techniques in this 
field represents a fascinating interaction 
between the ingenuity of the investiga- 
tor, the traditional tools of psychology, 
and current technological development. 
[n general, it will often be found that 
procedures applicable successfully at 
one stage of development may be used 
extensively beyond that stage, and repe- 
tition of them will be avoided in the 
discussion of later stages. 


The preceding three problems are 
common to any experimental undertak- 
ing and serve as useful points of refer- 
ence in organization. However, two 
other points may be considered as 
equally common to experimentation but 
are best dealt with once at the outset. 
The first of these is the problem of 
translating a question about child de- 
velopment into an experimental ques- 
tion; the second is the problem of ex- 
perimental design. 

It is common to hear that a particu- 
lar hypothesis of development (indeed, 
of behavior) is not adequately repre- 
sented by the experimental operation in 
a specific investigation. (This is an es- 
pecially common statement when the 
results are negative.) Clearly, no help- 
ful formula can be offered here to re- 
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solve these occurrences. It can only be 
noted that experimentation does require 
a specific and practicable operation that 
varies some factor of probable relevance 
to the behavior under study. The mean- 
ing of this operation, in broader terms 
than its own, has nothing to do with 
laboratory- -experimental techniques per 
se. If experimental techniques seem to 
promise precision and a lack of am- 
biguity, it is only if conclusions are 
stated in terms of the experimental op- 
erations used. Statements broader than 
that proceed at their own risk. 

The second problem, that of the sta- 
tistical design according to which ex- 
perimental variables are manipulated 
and responses recorded, is more readily 
dealt with. To some extent, the ages of 
the children under study play a role in 
the choice of an experimental design; 
to a perhaps larger extent, tradition and 
training lead the experimenter to cer- 
tain choices. Yet it may be possible to 
state particular logical advantages and 
limitations of the experimental designs 
available to an investigator, which 
might in some cases lead to different 
choices. Since this is one of the objec- 
tives of this chapter, a general discus- 
sion of experimental designs should 
prove valuable at this point. 

The response of children to experi- 
mental variables is commonly studied 
under any of three general designs: (1) 
as reflected in randomly composed 
groups, each group receiving a differ- 
ent degree or level of the experimental 
variable; (2) as reflected in groups 
composed of children who are matched, 
between groups, in terms of relevant 
variables; and (3) as reflected in the 
changed behavior of an individual 
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child, relative to a stable baseline of re- 
sponse characteristic of him. 

Random Groups. The experimenter 
may establish a degree of control over 
a child’s present environment, yet find 
that the child’s response to an experi- 
mental variable in that situation is un- 
stable and/or that children in general 
show widely varying kinds of response 
to it. This variance is attributable, in 
large part, to incomplete control of the 
present environment and/or the vari- 
ables the child brings with him to the 
laboratory (from his history of inter- 
action with past environments and his 
genetic endowment). The experimenter 
may not be primarily interested in any 
but the experimental variable. In this 
case he will frequently resort to an ex- 
tensive replication of pairs of observa- 
tion: one child exposed to the experi- 
mental variable, and a control child ex- 
posed to a different degree or level of 
that variable. If enough children are 
randomly assigned to these groups, the 
effects of the experimenter’s imperfect 
control of the present situation, and of 
the variables the child brings with him, 
will be canceled out. In effect, the ex- 
perimenter establishes control not over 
the behavior of a child but over the 
average behavior of a group of children. 
In general, the more unstable the be- 
havior of a single child, and/or the 
more variable the behavior of different 
children, the larger the required groups. 

Furthermore, when large groups are 
required, the experimenter may, in the 
end, conclude that he now understands 
the nature of his experimental variable 
but may also conclude that he will 
rarely find a situation in which this 
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variable alone will play a major explan 
tory role.! 

Matched Groups. Again, the investi- 
gator may find that the behavior of thc 
child under study is unstable or too dif- 
ferent from that of other children. He 
may believe that he recognizes and can 
measure the significant variables respon- 
sible for this variation, which the child 
brings with him. In this case he may 
again resort to replicating pairs of ob- 
servations (pairs of children) but may 
match the children in each pair so ti 
they are highly similar in terms of these 
variables responsible for the (irrelevant) 
variation. If the experimenter is correct 
in his recognition and measurement of 
these variables, then he will be able to 
study the effect of the experimental 
variable with relatively small groups of 
children. Furthermore, he will be better 
able to state in what situations the ex- 
perimental variable plays a major role 
(i.e., in situations closely describable by 
the matching variables). If the investi- 
gator is incorrect in his use of the 
matching variables, he will find that he 
has to use larger groups to study the 
experimental variable, or (following a 
possibly unexplored confidence in his 
matching technique) he may conclude 
(sometimes wrongly) that the experi- 
mental variable has no effect by accept- 
ing the null hypothesis. But when 
matching variables may be used with 


‘*Randomly composed groups” does not 
necessarily imply representative groups. All 
that is meant by this term is that two or more 
groups of subjects have been formed without 
bias. Results may be unambiguous as far as 
the independent variable is concerned, yet not 
generalizable to all children. 


THE LABORATORY-EXPERIMENTAL STUDY OF CHILD BEHAVIOR 


relative confidence, they undoubtedly 
contribute to the statistical efficiency of 
the design. Part of this efficiency is lost, 
however, since a relatively large num- 
ber of subjects is usually required to 
find in them a pair of small matched 
samples. Still, if matching can be accom- 
plished, the experimenter may use fewer 
subjects, an undoubted saving in experi- 
menter-hours. 

Baseline Techniques. Adequate con- 
trol of the child’s present environment 
in the laboratory will often allow use 
of the child as his own control. It is 
necessary that the response under study 
be br ought to (or exist at) a stable level 
that is capable of variation when con- 
ditions are changed. If the investigator 
can rely upon the stability of the re- 
sponse, he may present or remove ex- 
perimental variables at will, recording 
the consequent variation in the response. 
This technique is extremely efficient in 
many ways. It yields a law of behavior 
applicable to a particular child, not to 
a group mean that may correspond to 
no actual child. If it is easily replicated 
in a number of individual children, it 
is a general law. If not, then the changes 
in experimental technique, which the 
experimenter must use in order to repli- 
cate it in other children, may often give 
a meaningful clue to the nature of each 
child’s past history. Undoubtedly, this 
method is economical of subjects. Very 
often it may not be economical of the 
experimenter’s time: it may take a good 
deal of pilot work to produce a labora- 
tory situation in which a given response 
will be suitable for baseline studies. 
However, this work, once done, should 
often prove a remarkably gainful in- 
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vestment. The current progress of 
studies of operant learning in animals, 
using baseline techniques, bears this out. 
In general, when child subjects are 
scarce, an attempt at baseline proce- 
dures should often be worth the gam- 
ble. 


Infancy 
OBTAINING SUBJECTS 


Many avenues for gaining access to 
infants for study are open to the re- 
search psychologist. Babies have been 
available in child-study clinics, mater- 
nity wards, institutions for children, 
and in the homes of their parents. Each 
approach presents certain unique ad- 
vantages and problems. 

“Outpatient” research clinics have 
been the setting for both experimental 
work and extensive longitudinal 
studies. The work of Gesell and his 
colleagues at the former Clinic of Child 
Development, Yale University, is prob- 
ably the best-known example. Organi- 
zations of this kind are usually part of 
a medical school and frequently operate 
in conjunction with a child-care or 
well-baby clinic. Mothers bring their 
infants in for observation at regular in- 
tervals. In exchange for their coopera- 
tion in a study, mothers and babies of- 
ten receive professional services. The 
advantage of this approach is that it 
enables the research staff to design a 
laboratory with the permanent equip- 
ment essential for most experimental 
studies (e.g., one-way viewing glasses, 
listening and recording devices, and 
motion picture cameras) as well as a 
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sufficient number of more flexible de- 
vices for specific investigations (e.g., 
cribs with stablimeter attachments, cribs 
with adjustable trays, special floor 
mats, and play pens). Furthermore, by 
careful scheduling, several investigations 
can be carried on simultaneously, thus 
utilizing the facilities to maximum ca- 
pacity. The obvious limitation is the 
high cost of establishing such a labora- 
tory, if one does not already exist, and 
the need for continuing operating 
funds, if the clinic is an established one. 

Most research on the neonate and 
young infant has been conducted in 
laboratories in maternity wards. For ex- 
ample, some of the pioneer studies in 
the 1920’s and 1930’s on the behavioral 
equipment of the neonate were done in 
hospitals at Johns Hopkins University, 
Ohio State University, and the State 
University of Iowa. Although these 
laboratories are not so costly to estab- 
lish as child-study clinics, they present 
other problems. Arranging for adequate 
space, working out relationships with 
the medical and nursing staffs, and co- 
ordinating the investigation with hos- 
pital routines are the most important, 
Another point worthy of mention is 
that investigations in hospital labora- 
tories must necessarily be limited to 
study of infants from the time of birth 
until mothers are discharged, a period 
that is continually shrinking. In the past 
mothers were kept in the hospital for 
two or three weeks after delivery; cur- 
rently they remain for only three or 
four days. 

Experimental studies in children’s in- 
stitutions, such as foundling homes, 
children’s homes, and orphanages, have 
been conducted with or without the 


benefit of a psychological laboratory. 
Constructing and operating a laboratory 
in a children’s institution pose problems 
similar to those encountered when 
working in a maternity ward. Arrang- 
ing to conduct a single study, however, 
presents different problems. First, the 
investigator must obtain permission 
from the superintendent or the admin- 
istrative board. Such a request usually 
takes the form of a letter stating the 
purpose and nature of the study and 
assuring the administration that the pi 
cedures will not injure the infants, Re- 
quests will have to be written in non- 
technical language and include infor- 
mation on the amount of floor sp 
required, any encroachment on person- 
nel time, the duration of the study, and 
its effect on institutional routines. A 
second problem concerns establishing 
working relationship with the members 
of the staff directly concerned with 
child care. One way of accomplishing 
this is to proceed with the experimen- 
tal program in small steps to give the 
staff opportunity to understand and 
gain confidence in the experimenter and 
his procedures. Third, in many situa- 
tions of this kind the investigator is ex- 
pected to reciprocate by giving profes- 
sional services. The kind and amount 
rendered are, of course, dependent 
upon the investigator’s competence to 
give the services requested and his other 
obligations. 

One outstanding advantage of insti- 
tutional laboratories over hospital labo- 
ratories is the Opportunity to see in- 
fants for longer periods and in later 
developmental stages. Another is that 
many studies can be carried out with 
little disruption of the infant’s routines. 
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The major limitation is that experimen- 
tal plans may be hampered by the loss 
of subjects at any time because of adop- 
tion, Since current practice is to release 
infants for adoption at the earliest age 
possible, infants of this source are be- 
coming increasingly scarce and older 
subjects quite rare. 

Investigations in the homes of parents 
have usually consisted of intensive 
studies on one or two infants (Blau and 
Blau, 1955; Dennis and Dennis, 1935), 
or relatively brief observations on a 
larger number of infants, each in a dif- 
ferent home (Brackbill, 1958). In 
studies of the first type making arrange- 
ments for performing the study has 
been obviated by the fact that the in- 
vestigators have, for the most part, 
been the parents or guardians of the 
subjects. In studies of the second type 
obtaining subjects may produce some 
difficulties, First, mothers with infants 
within the desired age range, who are 
willing and committed to cooperate to 
the point of “seeing the study through” 
will have to be found. The work of 
finding such mothers may often begin 
profitably through contacts with the 
community’s preschools, schools, and 
clinics, with mother’s groups or clubs, 
and with local pediatricians. One ad- 
vantage of working in the child’s home 
is that studies can be conducted with 
a minimal amount of disturbance to the 
infant and his routines. Another is that 
limitations on the number and age range 
of subjects that can be obtained is less 
severe. On the other hand, considerable 
time and money are required for travel 
and extra research activities. For exam- 
ple, Brackbill (1958), in a study on in- 
fant conditioning reported utilizing 
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both individual and group discussions 
with mothers to introduce her problem 
and secure cooperation. During the 
course of the study she gave them sim- 
ple journal articles on infants, showed 
graphs, etc. Another limitation of this 
mode of operation is that it is relatively 
easy to lose subjects because of changes 
in the mother’s attitudes or circum- 
stances, Finally, conducting studies in 
private homes precludes the use of any 
kind of heavy or complicated apparatus. 

A procedure used with preschoolers 
(Bijou, 1958) should also be of value 
with infants. That is, infant research 
could be conducted in a mobile labora- 
tory complete with viewing, listening, 
and response recording devices. The 
trailer-laboratory could be taken by 
truck or car to institutions or private 
homes and remain in a location for an 
hour, a week, or several months, de- 
pending on the study. It would be en- 
tirely possible to conduct studies under 
the constant conditions enjoyed in a 
traditional laboratory, yet be less costly 
to establish and operate. Investigative 
procedures conducted in a trailer would 
be more disruptive to the baby’s rou- 
tines than in an institution or private 
home but probably less so than in a 
child-study clinic. 

A particular problem relevant here is 
that of terminating a study. The rela- 
tionships uncovered in a particular 
study are not affected by the proce- 
dures that terminate the study, of 
course; but the probability that these 
and other subjects in the same area will 
be available for future study definitely 
is. This problem is not restricted to the 
study of infants in their own homes; 
rather it is common to almost all child 
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research. Probably the most important 
single factor is to explain to the care- 
taker (beforehand) the intent, meaning, 
and procedures of the experiment in 
understandable terms. Repetition of 
this, with “progress reports” during the 
study, is good strategy. Then, a simple 
announcement of termination, with 
thanks, should prove adequate, espe- 
cially if the experimenter can say that 
the study was a “success” and show 
reprints and articles. A group address, 
or a typed letter of thanks and explana- 
tion, may be particularly convenient. 


SAMPLING 


Most methods of obtaining infants for 
study yield biased samples. For exam- 
ple, when a research clinic opens its 
doors to mothers who are interested and 
able to give the required time, energy, 
and cooperation, it is probable that they 
would constitute a group that would be 
above average in education and socio- 
economic status. The same thing would 
probably be true of mothers who would 
Consent to cooperate in a study in their 
homes. Likewise, samples of mothers 
and infants drawn from a hospital ward 
or institution would be biased along 
other dimensions. This is recognized by 
most investigators, yet few attempts 
have been made to obtain random sam- 
ples. This is partly because of the diffi- 
culties of obtaining subjects and partly 
because many investigations require in- 
tensive individual analysis and can han- 
dle only a few cases efficiently, If, how- 
ever, a study requires a representative 
sample, the research clinic and the mo- 
bile laboratory offer the most promising 
approaches to an approximation. The 
clinic, because of the large number of 
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cases it can handle efficiently, can 
enough cases to extract from them a 
near-representative sample, The mobile 
laboratory may be used to approach the 
many cases whose unwillingness to co 
operate is based largely on the distance 
or expense involved in traveling to the 
laboratory, the number of other chil- 
dren in the family who then would re- 
quire a sitter, etc. Whether or not a 
sampling procedure has been used, it i 
essential that some account be given of 
how the subjects were obtained. Too 
often such information may be critical 
in resolving otherwise contradictory 
results. 

Not only should there be a descrip- 
tion of how the subjects were obtained 
but also the basis upon which they were 
eliminated. Some of the criteria are 
questionable physical status, recent 
changes in caretaking relationships, and 
poor health or illness of the mother 
during pregnancy. For some studies, it 
may be necessary to eliminate those 
who do not stay awake for long enough 
periods to permit study and whose 
sleeping cycles are markedly irregular. 
When working in private homes, it may 
be expedient to eliminate first borns, 
since new mothers may be tense and 
overly concerned with the effects of 
any sort of investigative procedure. 
When working in institutions, it is 
recommended that infants over 5 or 6 
months of age be excluded, unless the 
problem concerns behavior that may be 
related to institutional living. 


INVESTIGATIVE PROCEDURES 
IN EARLY INFANCY 


Because of the tremendous equip- 
mental differences of the neonate and 
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the 2-year-old infant, investigative pro- 
cedures vary widely. It is advantageous, 
therefore, to discuss this aspect of re- 
search in two parts. The first section 
deals with methodology for full-term 
babies from the time of birth to the 
time when the subject can begin to sit 
for longer periods and can reach and 
grasp efficiently. In terms of chrono- 
logical age, this would be about 9 
months. The second section extends 
from the end of early infancy to the 
age at which basic skills in manipula- 
tion, locomotion, and simple verbal be- 
havior are developed. This would be 
about age 2. 

Investigative procedures are viewed 
as consisting of two parts: (1) manipu- 
lating independent variables and meas- 
uring responses and (2) controlling the 
setting conditions affecting the behavior 
under observation, Since problems of 
manipulating independent variables and 
measuring responses are closely tied in 
with the general experimental area, the 
specific variables involved, and the age 
of the subject, they are discussed by the 
case method, i.e., by describing the re- 
lationship between problems and proce- 
dures in selected studies. The problems 
of controlling the setting conditions are 
discussed in terms of both general con- 
siderations and examples, They are also 
emphasized in some of the studies pre- 
sented to illustrate manipulative and re- 
cording procedures. 

The problem of controlling experi- 
mental conditions other than the one 
to be varied is a matter of holding 
constant the conditions preceding and 
during sessions. With respect to con- 
trolling in-between-sessions conditions, 
the young infant presents fewer prob- 
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lems than older subjects, since verbal 
communication is not a factor, The con- 
ditions before and during sessions re- 
quiring vigilant supervision pertain to 
the health and behavioral state of the 
infant. These include colds, sieges of 
colic, sudden separation from the 
mother or caretaker, radical changes in 
living conditions, accidents, emotional 
experiences, and the like. Such events 
have to be evaluated to determine 
whether the consequences would alter 
the behavior of the infant in the ex- 
perimental situation. 

Reference to controlling conditions 
during sessions involves the exercise 
of traditional experimental safeguards. 
Typically they include conducting ses- 
sions at the same hour of the day 
(thereby avoiding the effects of varia- 
tions in food elimination, sleep, and 
fatigue cycles), holding sessions for the 
same length of time, and keeping physi- 
cal and social conditions constant. 
Physical factors of some significance 
would include constancy in the con- 
traption in which the infant lies and 
the illumination and temperature of the 
room. The social conditions would in- 
volve standardization of the behavior 
of the mother (or caretaker) and in- 
vestigator, if either or both have direct 
contact with subjects. Control of both 
sets of these conditions may be facili- 
tated by the use of panels, cabinets, 
booths, or special experimental rooms. 
Examples of the specific use of these 
devices are given in a later section. 

Studies illustrating techniques for 
manipulating variables and measuring 
responses were selected on the basis of 
two factors. First, they involve an ef- 
fective way of presenting and mani- 


156 
pulating stimuli or conditions and/or 
methods for recording and measuring 
responses, and, second, they are con- 
cerned with one of the major problem 
areas in a stage of development. The 
categories for major problem areas are 
as follows: 


1. Conditioning. 

2. Discriminative processes. 

3. Motivation, 

4. Frustrating, sudden, or 
stimulation, 

5. Changes in complex behavior. 


strong 


It is recognized that these categories 
do not include mutually exclusive 
processes. To a certain extent, any clas- 
sification system is arbitrary. This chap- 
ter must examine many experimental 
procedures, the labels for which are 
matters of considerable controversy. 
It does not seem appropriate to deal 
with these issues here; consequently, 
the classifications used are meant simply 
to allow adequate description and a 
wide sampling of problems. 

Conditioning. Because of the dif- 
ficulty of obtaining stable measures of 
behavior, experimental work in early 
infancy poses many problems. One 
method that has been used to advantage 
is the classical conditioning technique. 
Studies have been conducted to deter- 
mine whether the phenomena observed 
in animals may be observed in the 
human infant, to further the analysis 
of the conditioning process itself, and 
to evaluate other variables such as 
ability to discriminate two levels of 
auditory discrimination. To illustrate 
some of the investigative procedures for 
studying the neonate and some of the 
methodological problems involved, two 
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carefully controlled studies, concerned 
with at what age and to what stimuli the 
infant can be conditioned, are con- 
sidered. 

Wenger's study (1936), conducted at 
the Infant Laboratory at the State 
University of Iowa, was performed in 
a Pratt stablimeter (Pratt, Nelson, and 
Sun, 1930) to control for extraneous 
noises and variations in temperature and 
illumination. (See Fig. 4-2.) Onset, 
duration, and termination of 
pendent variables were controlled by 
mechanical and electrical devices, and 
responses were recorded by observers, 
polygraphic, or photographic tech- 
niques. Five experiments were con- 
ducted. The first was on three infants 
twenty-four hours after birth. The 
unconditioned stimulus was a light from 
a 100- or 200-watt lamp, eliciting eye- 
lid closure. The conditioned stimulus 
was a tactual vibrator attached to the 
plantar surface of the foot. Duration 
of the conditioned stimulus was 3.4 
seconds and came 3 seconds after the 
light flashed. One experimental session 
was held a day. Since attempts were 
made to obtain 30 to 5o pairings per 
session, the duration of sessions varied 
from two to three hours. Eye closure 
responses were recorded by two ob- 
servers and an interobserver reliability 
check was made. Experiment I included 
two control studies, one to determine 
whether maturation might account for 
increase in frequency of responses, the 
other, whether light stimulation might 
account for it. The first was done by 
comparing the frequency of reactions to 
the conditioned stimulus of the experi- 
mental subjects to six control subjects 
9 days of age. The controls did not have 
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Fig. 4-2. A stablimeter, for measuring reactions of the infant to stimulation. (a) shows the ex- 
perimental cabinet with the platform on which the baby lies. (b) shows how motions of the 
platform are translated into a written record. The two lower lines are signal lines that show 
when stimuli were presented and provide a time marker (Pratt, Nelson, and Sun, 1930). 


any pairings. Wenger concluded that 
conditioning appeared on the fifth day 
after 124 pairings. Experiment II was a 
six-months’ continuation study of one 
of the subjects in Experiment I. Ex- 
periments III, IV, and V used an elec- 
trotactual stimulation delivered by an 
electrode fastened to the plantar surface 
of the left great toe as the unconditioned 
stimulus and respiratory and electro- 
dermal responses were explored. Ex- 
periment III, with one subject, was an 
exploratory study of a vibratory stimu- 
lus as a conditioned stimulus. In Ex- 
periment IV a pure tone (frequency of 
1084 cycles at 50 decibels above adult 
human threshold) served as a condi- 
tioned stimulus on five subjects. Two 
control experiments were included, one 
on the unconditioned responses to the 
conditioned stimulus and the other on 
the effects of repeated electrotactual 
stimulation. The first was on seven in- 


fants, who were given only auditory 
stimulation on their ninth postnatal day, 
the second, on four subjects who were 
given 30 shocks per day for seven days 
and on the ninth day were tested with 
the tone. Responses were recorded by 
observational and photographic means. 
Wenger claimed that three of the five 
subjects showed conditioning. In Ex- 
periment V the conditioned stimulus 
was a flash of light (3.4 seconds) from 
a 100-watt lamp. There were four sub- 
jects, but complete information was ob- 
tained on only two, Three infants 
served as subjects in a controlled experi- 
ment designed to evaluate uncondi- 
tioned responses to the light flash. Both 
of the experimental infants showed con- 
ditioning of respiratory and withdrawal 
responses on the ninth day. Experiment 
VI involved the feeding behavior of two 
infants. A buzzer, 80-90 decibels above 
adult human threshold, started about one 
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second before nursing began and lasted 
ten minutes. It sounded for 0.55 second 
at intervals of one second. Responses 
were observed by the investigator and 
by the parents. No evidence of condi- 
tioning was found during eight weeks 
of study. 

Wickens and Wickens (1940) fol- 
lowed up Wenger's study to give special 
attention to controlling motivation and 
the effects of the unconditioned stimu- 
lus. Their study was conducted in the 
Ohio State Infant Research Laboratory 
on 36 infants under 10 days of age, born 
in the Ohio State University Hospital. 
Setting conditions were ` controlled 
much as in the Wenger study (Dock- 
ery and Valentine, 1939). A buzzer 
served as a conditioned stimulus, and a 
shock, delivered through an electrode 
attached to the sole of the right foot, 
causing foot withdrawal, served as the 
unconditioned stimulus. Thirty-six in- 
fants were divided into three groups. 
The experimental group was given 12 
buzzer-shock pairings a day for three 
days. Control Group I received 12 
shocks a day for three days and never 
received pairings of buzzer-shock. This 
group was designed to determine 
whether repeated presentation of shock 
would increase response during the test 
period. Control Group II received the 
buzzer alone on the first and third days. 
This was a check on whether any ef- 
fects observed could be accounted for 
by maturation. On the third and fourth 
days all were given the buzzer alone, 
until three consecutive failures to re- 
spond. Tests on the third day were to 
evaluate conditioning and extinction; 
on the fourth, spontaneous recovery 
and extinction. The Experimental 
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Group showed conditioning, extinction, 
and spontaneous recovery. So did Con- 
trol Group I. The authors present two 
theoretical possibilities, one in terms of 
sensitization and the other in terms of 
conditioning to a sudden environmental 
change. The findings have several im- 
plications (Underwood, 1949); at ly 
rate, they demonstrate the need for a 
careful consideration of the experi- 
mental conditions used, through control 
groups representing analyzable com- 
ponents of these conditions. For in- 
stance, it follows from Wickens and 
Wickens’ discussion that a study might 
be done with a control group designed 
to test the effect of a stimulus other 
than a buzzer after repeated shock 
presentations. 

The distinction between classical 
(respondent) conditioning and instru- 
mental (operant) conditioning rests 
upon identifying the response being 
conditioned as one that is controlled by 
eliciting or by consequent stimulation, 
respectively. Occasionally, a response 
may seem to fall in both categories, and 
this may occur more frequently in 
young infants than in older subjects. In 
this case it may be a matter of con- 
troversy, whether the investigator is 
studying one kind of conditioning or 
another. Previously cited studies have 
cited milk and nipples as eliciting stimuli 
for the sucking response, and their 
procedures have been identified as clas- 
sical conditioning. It is also possible to 
consider milk as a reinforcing stimulus 
and the nipple as a cue or discriminative 
stimulus marking occasions when suck- 
ing will be reinforced. 

Kantrow’s study (1937) of condi- 
tioned sucking is an example. Her 
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method was to present a nursing bottle 
to the infant, having sounded a buzzer 
5 seconds before, and to maintain the 
buzzer during the first 15 seconds of 
sucking. Then the buzzer stopped, but 
the child was allowed to continue suck- 
ing for 15 to 20 seconds, Preceding the 
buzzer was a baseline period, during 
which the infant’s sucking was ob- 
served, and which varied between 25 
and 75 seconds in length. (Initial base- 
line periods for a session lasted as long 
as 120 seconds to promote adaptation 
to the apparatus.) Sucking was recorded 
on a kymograph by a linen harness 
placed under the infant’s chin and 
drawn up over his ears by weighted 
strings running over pulleys. The char- 
acteristic chin motions involved in 
swallowing were transmitted through 
the strings to the kymograph pen. The 
subjects were orphanage infants, rang- 
ing in age from 1 to 3 months. Observa- 
tions were made at the 10 A.M. and 23:65 
and 10 P.M. feedings. The conditionin 
observed may point to classical condi- 
tioning of the sucking response to a 
buzzer, through pairings with the 
eliciting stimulus of the nursing bottle’s 
nipple, or it may indicate instrumental 
conditioning of the sucking response to 
the discriminative stimulus (cue) of 
the buzzer, through reinforcement of 
this response by milk on occasions 
marked by the buzzer, An experimental 
test will require at least control of milk 
as a consequence of the conditioned 
response, 

Davis, Sears, Miller, and Brodbeck 
(1948) exemplify an instrumental con- 
ditioning logic in a study of the de- 
velopment of extranutritive sucking in 
newborn infants. Working in a mater- 
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nity hospital, they made up three groups 
of infants, one group being fed at the 
mother’s breast, another group by 
bottle, and a third by cup. This proce- 
dure established a dimension of the 
frequency of reinforced sucking re- 
sponses required to satiate the infant 
for milk ( breast-feeding requiring most 
responses, bottle-feeding an inter- 
mediate amount, and cup-feeding least). 
The experiment lasted ten days, the 
tenure of the infants in the hospital. 
Daily tests of extranutritive sucking 
were made by inserting a rubber- 
covered finger tip into the infant's 
mouth at intervals after feeding and 
measuring the amount of sucking on 
the finger tip within a fixed time. Dif- 
ferentially increasing responsiveness to 
this test between groups supported the 
authors’ claim that instrumental con- 
ditioning principles could be applied 
to the sucking response. 

Blau and Blau (1955) illustrate an 
interesting variation in procedures for 
attacking the same kind of problem. 
They worked with a sample of one 
infant (their own) in their home. 
Observation of behavior and control of 
feeding conditions were unsurprisingly 
extensive. The infant was fed from 
either a two-hole or eight-hole nipple, 
in a regular alternation of feedings, 
from age 3 to 7 weeks. Observations 
were made of extranutritive sucking, 
crying, restless activity, and general 
fussiness. The sample necessarily in- 
volved a baseline design, which pro- 
duced reliable data quite efficiently in 
their single case. Clearly, the method is 
suitable to the investigator who is also 
a new parent. 

Discriminative Processes. One of the 
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research problems that has received a 
good deal of attention is the question 
of the sensory capacities of the neonate. 
The major task has been to isolate a 
response class that will yield definite 
and reliable records. For the most part, 
general activity (motility) has served 
as a measure obtained by a stablimeter 
and polygraph. Responses have also 
been recorded by motion pictures and, 
of course, by observers. Four studies, 
involving different sense modalities, are 
described to give a sampling of meth- 
odologies employed. 

The first one, which was conducted 
by Stubbs (1934), is concerned with 
auditory stimulation. Two features of 
special interest here are that three di- 
mensions of auditory stimuli—pitch, 
intensity, and duration—were studied 
over wide ranges and that objective 
records were obtained by a stablimeter, 
pneumograph, and polygraph. The 
study was conducted in an experimental 
cabinet; stimuli emanated from an os- 
cillator. 

Weiss (1934) performed a parallel 
study involving visual stimulation. 
Methods were similar, but only the 
effects of three levels of light intensity 
were studied (minimal, dim, and mod- 
erate). Again, general motility (meas- 
ured by a stablimeter) was the index of 
response. 

Visual stimulation is very complex. 
It has, for the adult eye, discriminable 
components of intensity, color, satura- 
tion, and distribution (flux). In a verbal 
animal these components are readily 
controllable, but in lower organisms, 
and in the young child, questions about 
visual ability become complex and re- 
quire special experimental procedures. 


METHODOLOGY IN CHILD DEVELOPMENT 
Some have seen application in infant 
studies, 

For example, sensitivity to color 
(wavelength) in infants between 2 
24 months of age was studied by Staples 
(1932) with a preference method. For 
younger infants, Staples’ method was 
to have the infant lying on his back in 
his crib, looking at a framed piece of 
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a cardboard mounted about one foot 
from his face. On the cardboard were 
pasted two circles, one colored, the 
other either colored or gray. The ex- 
perimenter stood at the foot of the 
infant’s crib, her head and face hidden 


behind the cardboard, looking at the 
infant through a small slit in the card- 
board. The colored circles were pre- 
sented for standard lengths of time, 
and the experimenter recorded on stop- 
watches held in each hand the duration 
of the infant’s visual fixation on each 
color. For infants older than 514 months, 
a larger cardboard background was 
used; the infant was placed in a sitting 
position, with the cardboard held 
vertically in front of him, and the re- 
corded response was the infant’s reach- 
ing for or pointing to one of the two 
disks. (See Fig. 4-3.) 

Staples’ method included trials on 
which the brightness of a preferred 
color was varied or the brightness of its 
paired gray was varied. The color 
preferences Staples found were stable 
for various levels of brightness of the 
colors and the grays, which is some 
indication that it was wavelength and 
not brightness that is preferred by the 
infant. There always remains the pos- 
sibility that brightness was not varied 
over a wide enough range. Assumptions 
that the infant's eye is similar to the 
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Fig. 4-3. The Staples’ method for studying responses to color involves a large cardboard, to 
which one of several colored disks has been affixed, and the reaching response of the infant 


(Staples, 1932). 


adult's in matching colors for brightness 
are probably gratuitous. 

Crudden (1937) has made an investi- 
gation of the infant's sensitivity to 
thermal stimulation, His method was to 
wrap a small hollow cylinder to the 
infant's calf with a single layer of band- 
age and to maintain a steady flow of 
water through this cylinder. Tempera- 
tures between 16 and 45°C. were used, 
always in terms of a temperature 
change, cither from body temperature 
(33-34°C.) to a higher or lower tem- 
perature or from a higher or lower one 
to body temperature, The response re- 
corded was any change in activity. The 
recording was accomplished by taking 
a motion picture of the infant and also 
a protocol of all observed activities of 
the infant as they occurred. The ob- 
vious advantages of a photographic 
record are that it can be reviewed many 
times and also submitted to a panel of 
judges to determine the reliability of 
observations. 

Thermal stimulation is not so complex 
in nature as light or sound stimulation. 


However, the duration, frequency, and 
extent of the stimulation are still notable 

rameters, as in sound and light stimu- 
ation. Conceivably, parameters, such 
as the rate of change of the stimulation 
(the specific heat of the stimulating ob- 
ject) and the modality of stimulation 
(contact with solids, liquids, or gases 
by radiation or by induction) may pre- 
sent problems. 

Motivation. Studies of motivation 
usually present points of strong con- 
troversy, since this seems to be the 
nub of dissension between typical be- 
havior theories. Studies of infant 
motivation do not escape this aura, but 
here these questions are not of great 
consequence. The experimenter is often 
concerned simply with the question: 
Does a particular stimulus serve as a 
reinforcer, a reward, or a drive for an 
infant? Questions concerning why the 
stimulus is effective or what its proper 
label should be tend to retreat from 
central interest; before one can ask why 
a stimulus is effective in promoting 
labeling, it must first be identified, and 
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this has proven a difficult enough prob- 
lem in itself, for the young infant. 

A study by Jensen (1932) apparently 
avoids some of these difficulties and 
is useful in demonstrating the effective- 
ness of milk as a stimulus. He con- 
structed a special feeding bottle, from 
which the infant sucked milk; the pres- 
sure changes within the bottle, arising 
from each sucking response, were 
transmitted to a kymograph. Milk of 
the correct temperature (40°C.) gives 
rise to a steady rate of sucking; varia- 
tions in the temperature of the milk, 
or substitution of fluids of differing 
taste or texture, are reflected in a falling 
off of the rate or magnitude of response 
in the same infant, Infants as young as 
2 days old are studied with this method. 

Jensen’s technique is noteworthy in 
that it measures a response resulting 
directly from the stimulus under study. 
Furthermore, the sucking response ‘is 
one of the few well-organized and pre- 
cise responses the newborn has avail- 
able. This response is easily elicited by 
all sorts of stimulation, especially in the 
oral region. Despite this difficulty, the 
use of a baseline technique, in which the 
infant serves as his own control, allows 
clear conclusions in single cases, The 
technique would seem worthy of far 
wider application than it has seen, es- 
pecially when the psychologist must 
adapt his procedures to the require- 
ments of specialists in infant health, 

A similar use of baseline technique 
is made in two other studies of infant 
motivation. In both the central prob- 
lem is to identify a stimulus complex 
that will act as a reinforcing stimulus 
in operant conditioning. A problem 
common to both is finding a response 
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to strengthen that is not too thoroughly 
tied to eliciting stimuli. Brackbill (1958) 
chose the smiling response in 4-month 
old infants. The reinforcing stimulus 
was composed of auditory, tactual, 
movement, and social components, The 
experimenter, immediately upon ob- 
serving a smile from the infant ly ng 
supine in his crib, smiled in return, be- 
gan to speak softly to the infant, picked 
it up, held, jostled, and patted it for 
thirty seconds (continuing to talk), 


and then returned it to the crib. The 
infants were studied in their own homes, 
with the experimenter serving as a 


substitute mother during the exp 
mental sessions. 

Rheingold, Gewirtz, and Ross (1959) 
chose vocalization as a response to be 
conditioned. Again, the reinforcing 
stimulus was a complex one: the ex- 
perimenter responded to the infant's 
vocalization by smiling, clucking, and 
lightly squeezing the infant’s abdomen. 
The infants were about 3 months old, 
and during the experimental sessions 
lay supine in their cribs, the experi- 
menter standing over them and looking 
down. The study required six daily ses- 
sions: the first two constituted a baseline 
condition, followed by two daily ses- 
sions of reinforcement of vocalization; 
the final two sessions were an extinc- 
tion period similar to the baseline 
period. 

An essential assumption in both of 
these studies is that the responses are 
modifiable by certain kinds of stimulus 
consequences. However, even casual 
observation of infants can suggest that 
a good deal of their behavior is con- 
trolled by eliciting (as compared to 
consequent) stimulation. An operant 
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conditioning session, in which stimuli 
are offered as consequences of responses 
in successively closer succession, may 
also be viewed as a period of increasing 
elicitation of behavior, if indeed the 
stimuli being used have any eliciting 
power. The use of an experimental con- 
dition in which these complex stimuli 
are offered at the same rate, but 
randomly rather than as consequences 
of a response, would seem a useful safe- 
guard when conclusions are drawn. 

Frustrating, Sudden, or Strong Stim- 
ulation. Studies considered here are 
those which ordinarily fall into cate- 
gories of emotion and emotional be- 
havior. Rather than attempt to give 
illustrations of the diversified tech- 
niques that would have to be included, 
consideration will be limited to meth- 
ods of frustration and of presenting 
sudden and/or strong stimulation. 
Frustration may consist of the actual 
prevention of a response by mechanical 
means, the nonreinforcement of a re- 
sponse that is ordinarily reinforced, or 
the introduction of conflict by stimulat- 
ing competitive behavior. Almost all of 
these categories have been applied to 
children of some age level, and a con- 
siderable number is exemplified in in- 
fant studies. 

The prevention of responses has 
been the subject of a surprising num- 
ber of studies, Some of these were prob- 
ably stimulated by Watson’s early study 
of infant rage as a response to restraint 
(1919). Watson’s methods were to hold 
the infant’s head steady in his hands, 
to pin its arms to its sides, or to hold 
its legs tightly together. Response was 
observed and passed on as generalized 
statements of Watson’s impressions. 


Sherman and Sherman (1925) applied 
pressure (by hand) to the chins of 
infants; they recorded (by eye) gen- 
eralized arm motions, Sherman’s later 
studies (1927) utilized motion-picture 
recordings of the results of various 
forms of restraint. Apart from its un- 
doubted superiority in assessing just 
what infant response is to restraint, this 
technique also allowed the well-known 
study in which adults are shown the in- 
fant’s response but not the stimulus 
condition giving rise to it; their labeling 
of the response is less accurate in this 
condition, a fact that has been given 
various significance by theorists. Studies 
by Pratt, Nelson, and Sun (1930) 
utilized the additional control condition 
of observing infants only when “asleep 
and dry.” 

Nonreinforcement of responses ordi- 
narily being reinforced is the subject of 
a study by Dorothy P. Marquis (1943). 
The subjects were newborn infants in 
a maternity hospital. Marquis’ pro- 
cedure was to interrupt their bottle- 
feeding sessions for ninety seconds, at 
points when the infant had consumed 
one quarter, one half, and three quar- 
ters of his ration. Infants were observed 
twice a day for the first ten days of 
life. (Some were tested again at the age 
of 3 months to determine whether in- 
dividual differences noted at birth had 
persisted. Mothers brought their chil- 
dren back to the hospital for these ob- 
servations.) The responses observed 
were general body activity, recorded by 
mounting the bassinet as a stablimeter, 
and also the latency and duration of 
crying and of mouth activity. 

Sudden and/or strong stimulation of 
an infant has been studied sparingly. 
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Watson and Raynor (1920) performed 
a classic experiment by teaching 9- 
month-old Albert to fear a rat by pair- 
ing it with the loud sound produced by 
striking a steel bar vigorously with a 
hammer. Recording was done primarily 
by visual impression. The generalization 
of Albert’s responses to a rabbit, a dog, 
a sealskin coat, and a mass of white 
cotton were judged to vary quantita- 
tively, but again, by eye. Lack of re- 
sponse to wooden blocks bespoke a 
discrimination of these from the rat. An 
accurate quantitative description of the 
gradient, however, is not likely by such 
methods. 

A detailed study of the immediate re- 
sponse of infants to the sound of a 
revolver shot has been made by Landis 
and Hunt (1937). Their technique 
utilized motion-picture records of the 
infant’s responses. By employing a 
camera capable of recording 1500 
frames per second, they were able to 
record the startle response in infants, 
even in cases in which the Moro reflex 
effectively concealed the startle re- 
sponse from the naked eye and from 
conventional camera records (i.e., those 
taken at 64 frames per second and 
slowed to quarter speed for slow-mo- 
tion studies). Crying was also recorded, 
more readily, of course, but with less 
useful results. That objective techniques 
of recording can far surpass the unaided 
experimenter probably need not be 
labored in this example. 

Changes in Complex Behavior. 
Studies of sequences of responses in the 
younger infant consist primarily of in- 
vestigations of the development of 
reaching, sitting, and similar motor de- 
velopment. This research is devoted to 


examining the influence of presumably 
biological variables upon the ability of 
the infant to perform such instrumental 
learning. The variable, although labeled 
“biological,” in experimental terms is 
defined simply as chronological age, and 
the experimental question, as asked in a 
study by Dennis (1941), is sii iply, 
“Does a restriction of certain types of 


environmental interaction (with adults) 
hinder such learning?” The procedure 
was to rear a pair of fraternal twins 
from the second month of life through 
the fourteenth, under conditions of 
minimal social interaction with each 


other or with adults and with restricted 
opportunity for practice of various 
motor behaviors. Dennis compiled 
norms from the records of 40 baby 
biographies and checked the twins’ de- 
velopment against these and others, 

It should be noted that the labeling 
of this development as requiring 
“biological” or “maturational” variables 
contrasted to simply “learned” is an 
extension beyond the logic of the ex- 
perimental design. Certainly it can be 
concluded that the particular modes of 
learning restricted for these infants are 
not the single essential ingredient of 
development studied (if they were re- 
stricted enough); but there is no guar- 
antee resulting from the experimental 
data that these are the only modes of 
learning responsible for development. 

An example of social learning in 6-to- 
8-month-old infants is afforded by 
Rheingold’s study (1956). Rheingold 
served as mother for four infants in an 
orphanage: she alone was their care- 
taker for seven and a half hours a day, 
five days a week, over a two-month 
period. Otherwise, these infants, and 
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a matched control group, were cared 
for by many, constantly changing care- 
takers. Social discrimination or gen- 
eralization to the experimenter, to a 
relatively unknown second person, and 
to a completely unknown third person 
were tested weekly. The test involved 
observation of the infant’s response to 
engineered social situations with each 
of these three persons. These test situa- 
tions were designed to approximate 
“natural” social events, such as an adult’s 
approach to the crib, holding the baby, 
speaking to him (“Hello, baby, how are 
you?” “You naughty baby, what did 
you do?”), and returning the baby to 
the crib. 

The experimental techniques devised 
by Rheingold in this study present 
some contrast to those of the studies 
previously cited. One difference is that 
Rheingold’s study of social learning 
requires an experimental history to be 
set up, and this history is eminently a 
social one. Her study goes a long way 
in demonstrating that experimental 
techniques can be applied to these prob- 
Jems in the human infant with good re- 
sults, 


Later Infancy 


By the time an infant is 9 months of 
age, much significant development has 
taken place. Largely this is a matter of 
motor competence: the -month-old 
can stand by holding furniture; he has 
been able to sit unassisted for some 
weeks; and his hands are becoming in- 
creasingly free and adept for manipula- 
tion of any loose parts of his 
environment, Very shortly he will 
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creep. By the end of later infancy (the 
end of the second year) this motor de- 
velopment will be far advanced. The 
child will be able to approach and 
manipulate a great deal of his environ- 
ment. This will allow the use of many 
of the experimental techniques de- 
veloped for adult organisms of various 
species. However, during this same 
period, the child’s verbal development 
will increase from a near-zero level to 
one which will be of considerable use 
in affecting the child’s behavior. From 
the experiment’s point of view, this will 
prove a mixed blessing as well as an im- 
portant phenomenon for study. 


INVESTIGATIVE PROCEDURES 


The control of pre-experimental con- 
ditions presents a few problems differ- 
ing from those considered for the 
younger infant. In this age range the 
child must often enter into the experi- 
ment rather than simply be placed in 
it. Therefore, the use of inducements 
becomes a factor. Typically, the parent 
or caretaker will bring the child, after 
which the experimenter can frequently 
rely upon curiosity for a continuance 
of the child in the experiment. 
Arsenian’s study (1943) documents the 
importance of this problem quite 
clearly. 

Instructions to the subject are a prob- 
lem even in this age range. Because 
more complex experimental situations 
are possible here, some kind of instruc- 
tion may be necessary, if only to save 
the experimenter time. The child in his 
second year frequently can be told 
what to do, if simple enough words are 
used in short enough sentences. If suc- 
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cessful, this is undoubtedly an efficient 
technique. Too often, however, the 
experimenter may find that he promotes 
misunderstanding in the child by only 
the use of verbal instructions. Sometimes 
this can be corrected by simpler in- 
structions. The experimenter may find 
that an experienced teacher can sug- 
gest an optimum wording (Brackbill, 
1958). An alternative is simply to 
demonstrate to the child what he is to 
do. Imitative behavior may be expected 
with good probability of success. The 
child should be given a chance to 
demonstrate that he understands his in- 
structor before the experiment starts. 
Keeping the child free from acci- 
dental upsets before the experiment re- 
mains a critical problem at this age. 
Emotional responses are easily elicited, 
and their Consequences may be strong 
enough to compete successfully with 
many of the behaviors under study. 
The problems of maintaining control 
over the general setting of the experi- 
ment are somewhat different from those 
described for early infancy. Time of 
day, in terms of deprivation-satiation 
cycles, is still important; however, 
longer cycles between feedings and 
naps can be expected. Similarly, ex- 
perimental sessions can often be longer 
than for the younger infant. The prob- 
lem of keeping physical stimulation 
under control is still present. As in the 
case of the younger infant, the ad- 
vantages of delimiting the experimental 
environment by the use of panels, 
screens, and specially designed rooms 
can be of critical value. The very 
definitely increased sensitivity of the 
child to social stimuli introduces a com- 
plex problem, centering about the be- 
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havior of the mother (when present) 
and the experimenter (when obsery- 
able). These problems are best 
trated by the specific instances in which 
they arise; some of them are dealt with 
in the examples that follow. Too often, 
though, this source of variation is ne: 
lected. Merrill’s well-known st 
(1946) demonstrates some of the pos- 
sibilities involved; even though it deals 
with older children, its implication is 
readily applied. 

The examples that follow are note- 
worthy in that they reflect some of 
these new problems as well as the wider 
range of stimuli and responses 
available to experimental study. Selec- 
tion has been made essentially on this 
basis. 

Conditioning. Studies of conditioning 
in later infancy frequently exemplify 
the same lack of distinction between 
classical and instrumental conditioning 
noted earlier, This is, of course, largely 
a historical accident: the law of effect 
apparently was not widely recognized 
until relatively recent times, and the 
studies to be cited are not recent. Much 
of the confusion arises, as before, from 
the use of milk as an eliciting stimulus 
(since milk is an eminent candidate for 
a reinforcing rather than eliciting stimu- 
lus in any instrumental conditioning 
design involving infants). The studies 
of Krasnogorski (Mateer, 1918) are an 
example. Initial experiments utilized the 
showing of a glass of milk to a 14- 
month-old, with observation made of 
the frequency of swallowing responses, 
essentially by eye. The use of swallow- 
ing is based upon the premise that the 
conditioned secretion of as little as 0.5 
cubic centimeters of saliva will induce 


ilius- 


now 
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swallowing in infants of this age. Later 
studies by Krasnogorski utilized a tam- 
bour, placed over the thyroid cartilage 
or over the hyoid bone, connected to a 
kymograph by rubber tubing, which 
differentially recorded both swallowing 
and mouth-opening responses. In still 
later studies direct measurement of 
salivation was made by fitting a cup- 
shaped receptacle directly over the 
salivary gland and leading the saliva 
away through a tube to a receptacle 
where it could be counted as drops 
emitted per time interval. This last tech- 
nique, although admirably direct, also 
involved an almost prohibitive amount 
of instrumentation connected directly 
to the infant and is probably better 
suited to considerably older children. 
The age of the children studied ap- 
parently ranged from the first 2 or 3 
months (where the method was re- 
ported as unsuccessful) to 6 years. 

The setting of the study is described 
as an isolated room, in which the child 
was placed upon a table, usually with 
his eyes bandaged. Typical trials were 
spaced at three-minute intervals, the 
unconditioned stimulus being presented 
fifteen seconds after the conditioned 
stimulus, which often was maintained 
for a full minute. 

At least two other studies of infant 
conditioning are immediate responses to 
Krasnogorski’s publications. Mateer 
(1918) cited the importance of Krasno- 
gorski’s results and set about verifica- 
tion, Much of her technique is a direct 
reaction to her own criticisms of 
Krasnogorski’s. In particular, she has 
been at pains to specify in more detail 
the experimental setting, the relation- 
ship between the experimenter and the 
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subject, and the subjects (in terms of 
sex, IQ, normality or deficiency, and 
other psychological measures). Her 
technique involves several interesting 
variations. The use of an eye bandage 
was modified to provide the conditioned 
stimulus. The child wore the bandage 
on the forehead throughout the ex- 
periment, and the conditioned stimulus 
consisted of the experimenter’s sliding 
down of the bandage over the eyes. The 
bandage remained over the eyes for 
twenty seconds, and was then slid up 
to the forehead again. At the eleventh 
second of the duration of the con- 
ditioned stimulus a bit of sweet choco- 
late was placed in the child’s mouth 
as the unconditioned stimulus. Record- 
ing of swallowing responses was done 
both by eye and by an apparatus, 
similar to Krasnogorski’s, operating 
from a tambour over the hyoid bone or 
thyroid cartilage. 

Marinesco and Kreindler (1933) re- 
port studies of infant conditioning that 
also represent a direct reaction to the 
work of Krasnogorski. Their method is 
notable in that the unconditioned stimu- 
lus used is unavoidable electric shock, 
so that there can be little question that 
this study represents classical rather than 
instrumental conditioning. The in- 
fants ranged in age from 25 days (no 
success) to 3/4 years; many were seen 
on repeated occasions, separated in some 
cases by several months. Most of the 
children were normals, but some were 
identified simply as a spastic idiot, 
microcephalic idiot, hemiplegiac, or re- 
tarded child. Studies were made of the 
phenomena of conditioning, extinction, 
retention, spontaneous recovery, in- 
hibition, conditioned inhibition, delayed 
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response, and the chaining of several 
conditioned stimuli to the uncondi- 
tioned stimulus. 

It is notable that none of these studies 
includes control over the sort of pro- 
cedures that Wickens and Wickens 
label “pseudoconditioning.” Hence it 
is debatable how much of their effects 
are ascribable simply to repeated pre- 
sentation of the unconditioned stimuli, 
independently of any association be- 
tween the conditioned and uncondi- 
tioned stimuli. As already stated in the 
studies of Krasnogorski and Mateer it 
is debatable how much of the effects 
are ascribable to operant rather than 
respondent conditioning. Certainly it 
seems clear that the entire field of in- 
fant conditioning requires a modern 
replication, making use of newer tech- 
niques and attending to findings and 
concepts that have assumed major sig- 
nificance since these studies were per- 
formed. (It remains to be seen, though, 
whether the psychologist so inclined will 
find as ready a willingness on the part 
of parents or institutions today to allow 
infants to be subjected to some of these 
rather heroic procedures.) The simple 
insertion of a pressure-sensitive device 
into an infant’s feeding bottle, which 
would close a sensitive switch, should 
allow an easy recording of the rate of 
the sucking response; the insertion of 
a simple pressure valve, which would 
close the nipple in response to the ex- 
perimenter’s actions, should allow con- 
trol of the supply of milk as a stimulus. 
This special kind of a bottle converts 
any crib into a laboratory and invites 
a study of the sucking response in terms 
of the scheduling of its consequences. 

If any aspect of Mateer’s method is in 
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fact instrumental operant rather than 


classical or respondent conditioning, it 
is an example of a restricted operant 
situation, one in which the child ay 
make only a single response per trial 
and in which trials are nece ily 
separated somewhat in space and/or 
time. The free-operant situation by 
contrast, is one in which the child's 
response does not appreciably displace 
him either in space or in time from his 
next response; hence behavior may be 
emitted at a wide range of rates, which 
may vary freely (Ferster, 1953). This 
technique is frequently said to be more 
sensitive and more flexible than the 
restricted operant situation. However, 
it apparently has seen no application to 


the period of late infancy. The 1 
of Brackbill, and of Rheingold and 
Gewirtz, represent free-operant situa- 
tions in early infancy, utilize such 
responses as smiling or vocalizing. The 
detailed development of a free-operant 
technique for older children, charac- 
terized by several studies described in 
the next section, involves a thoroughly 
ambulatory child in a relatively strange 
room and uses lever pressing as a re- 
sponse. This technique has been found 
barely feasible and hardly rewarding 
below the 2-year level; hence there 
appears to be a gap in the technology of 
operant-conditioning procedures for 
late infancy. The development of tech- 
niques and reinforcing stimuli, no doubt 
by borrowing selectively from the pro- 
cedures successfully applied to early 
infancy and early childhood, remains 
one of the challenging problems of the 
laboratory-experimental approach to 
child development. 

Discriminative Processes. Studies of 
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discriminative capacities in the older in- 
fant almost invariably make use of his 
ability to prehend. Ling (1941) has 
made a study of form discrimination in 
infants, which exemplifies an efficient 
technique for this age bracket. The basic 
method of the study is that of multiple 
choice, The infant, sitting in an or- 
dinary hospital crib, had placed before 
him a large gray tray containing a semi- 
circle of five holes, each spaced the 
same distance from the infant. In these 
holes form blocks painted a bright 
yellow (circle, oval, square, triangle, 
and cross) could be mounted. These 
blocks might be fastened down or not 
at the experimenter’s option. A one- 
way vision screen, hung from the ceil- 
ing of the small, specially made 
experimental cubicle, swung out to 
cover the stimulus objects as they were 
placed on the child’s tray and then 
swung back to uncover them, allowin 

the experimenter to observe the child 
from behind it. Usually, one form was 
a positive stimulus, and another (or 
others), negative. The positive form 
was removable from the tray by the 
child; the negative form was fixed to 
the tray. Furthermore, the positive 
form had previously been dipped into 
a strong saccharine solution and al- 
lowed to dry; if the child brought the 
form to his mouth (a frequent oc- 
currence), it tasted quite sweet. (The 
forms were sterilized continuously 
throughout the experiment.) The 
usual techniques of randomized order 
of presentation and position were fol- 
lowed, Responses scored included visual 
fixation, reaching for a form, reaction 
time, handedness, mouthing the form, 
and general “affective attitude,” includ- 
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ing vocalization. (It is not clear in 
Ling’s report how the form was taken 
from the infant if he chose to suck on 
it.) The process was adapted to a study 
of preference for forms simply by not 
fastening any form to the tray. (Pre- 
sumably, all forms were sweetened.) 

The technique seems an efficient one 
and can be applied to infants as young 
as 6 months. Ling’s comments indicate 
that the motivating power of the en- 
tire problem seems to increase as the 
child approaches about 1 year in age 
and to decrease after that. In particular, 
his visual attention to the forms after 
that age is transitory; however, when 
he does attend, his performance is 
coordinated, alert, and controlled. 

Munn and Steining (1931) report 
a variant on this technique, using a 
single 15-month-old boy as a subject, 
apparently in his own home, seen daily 
for well over a month. The discrimina- 
tion apparatus was a box with two 
doors on its front face, each door bear- 
ing a different figure (circle, triangle, 
square, etc.) and fronting on a small 
compartment that would contain a 
piece of chocolate if the form were 
the “correct” one. Between trials, while 
the chocolate and the forms were being 
arranged, the child played games in an 
adjoining room (in effect, he “hid” 
from the experimenter). Close to 400 
trials were required for an initial dis- 
crimination. 

Skeels (1933) has simplified this 
kind of procedure somewhat by adapt- 
ing the familiar formboard. In this case 
the child is presented with a board con- 
taining a number of forms, each fitting 
into its own recess. Under the correct 
form, when he lifts it, the child will 
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find a cookie. This method is certainly 
applicable to children of 15 months 
(the youngest children Skeels tested) 
and presumably may be applied to chil- 
dren just old enough to lift the form 
from the board. I. M. L. Hunter (1952) 
has reduced the technique to its bare 
rudiments for preverbal infants by 
simply concealing a raisin under one 
of two blocks of wood (differing in 
size) and teaching the child to lift a 
block to find the raisin underneath. 

Most of these studies have used 
procedures as simplified as possible. 
Gellermann (19332) has performed 
studies of discrimination in children in 
which he utilized a procedure that, 
although not complex, is still more 
cumbersome than many of those de- 
scribed above. However, his procedure, 
an almost perfect translation of the 
familiar Yerkes alley technique used 
in animal experimentation, allows a 
close comparison of children and ani- 
mals (in this study, chimpanzees) in 
the same physical apparatus. The child 
(or chimpanzee) was seated on a chair 
facing the forms to be discriminated, 
which were mounted just behind and 
just beside doors on a platform, under 
which the child (or chimpanzee) might 
find chocolate-covered cookies, ba- 
nanas, grapes, oranges, or apples. Sub- 
jects were handled in a standard way 
(whether child or chimpanzee) and 
spoken to in the same formula (“Now 
sit on the chair”), 

An interesting feature of Geller- 
mann’s study is the use of a special 
order of stimulus pairs, Gellermann 
generated a series of orders in which 
the correct stimulus was to be placed 
on a particular side (e.g., the left side) 
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designed to minimize the formation of 
position habits and still yield a chance 
score of 50 per cent correct. The series 
consisted of ten trials each; some 44 


series were generated (Gellermann, 
1933b). N 
Motivation. Only one study of moti- 


vation is reported for this age range, The 
question of evaluating the effectiveness 
of different stimuli as reinforcers or 
rewards has had little attention here, 
and procedures are far from adequate. 

Mast (1937) has reported a technique 
applied to children just below 2 years 
of age and older. The child was pre- 
sented with a wooden box with a 
hinged glass top. Through the glass 
top, the child could see a toy inside. 
However, the only way to open the 
box and get at the toy was to insert 
a stylus into a small hole in the front 
of the box; when the stylus contacted 
the brass plate in which the hole was 
drilled, a doorbell rang inside the box, 
and the lid opened, allowing the child 
access to the toy. The experimenter 
sat behind the child and timed the 
response. The only clue afforded to the 
child in solving the problem lay in the 
instructions he was given at the out- 
set: “Do you see what is in this box? 
When you have opened the box you 
may have it. You must take this in your 
hand (giving him the stylus) and open 
the box with it.” Questions by the 
child were answered briefly, without 
giving any help in the problem. Also, 
“encouragement” was offered to the 
child from time to time; since many 
studies have shown the reinforcing 
effectiveness of approval, the “motiva- 
tional” aspects of this procedure seem 
quite varied. With further experiences 
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with the box, it becomes a question 
ether the child is motivated pri- 
rily by approval, the doorbell, or the 
toy. (When encouragement is used 
freely by the experimenter, in a non- 
standard manner, it would seem likely 
that it is used most when the subject is 
least motivated by other stimuli, is in- 
attentive, or displeased.) 

lt is doubtful whether this technique 
could be extended to much younger 
children without increasing resort to 
demonstration, which would obviate 
much of the point of the study. How- 
ever, the use of a transparent container 
to present a variety of almost available 
incentives is probably a useful tech- 
nique at much younger levels, if co- 
ordinated with a different response, 
Apart from any other direct techniques 
of comparing reinforcing stimuli, then, 
there remains a summary of the stimuli 
used in the studies described above: 
candy, sweets, cookies, raisins, milk, 
honey, fruit, toys, and (possibly) the 
Opportunity to manipulate (Ling). 

Frustrating, Sudden, or Strong 
Stimulation. One study classified under 
this heading is probably typical of an 
experimental difficulty that is en- 
countered often in this age range, al- 
though the population used can hardly 
be termed typical. Arsenian (1943) has 
studied the effects of the mother’s 
(or a familiar adult’s) absence on the 
free behavior of a child in a strange 
room. Her subjects were children from 
the nursery of the Massachusetts’ State 
Reformatory for Women, many of 
whom had been born and raised in that 
relatively restricted environment, with 
little or no outside experience. In age, 
they ranged from 11 to 30 months; the 


youngest could barely walk. Half of 
them had mothers who worked in the 
nursery itself; the rest saw their mothers 
daily but only during visiting hours, 
Cattell and Gesell IQ scores indicated a 
normal range. Half of the children were 
brought to a strange room every other 
day and left there, alone, for five 
minutes. There were 11 such sessions. 
The other half of the children were 
accompanied into the room either by 
their mothers or by a familiar adult 
(a nursery caretaker), Each of the two 
groups was subdivided, and half of each 
group was placed in a contrasting con- 
dition for the last six of its 11 sessions 
in the strange room (i.e., some children 
who had been alone the first five trials 
were accompanied in the last six and 
vice versa). The room itself was some 
distance from the nursery, newly 
decorated, and filled with new toys 
and pictures; it was designed to be 
attractive to the children. The child 
was observed in the room from behind 
a one-way screen, and behavior was 
recorded as a protocol on an electrically 
timed recording device that yielded be- 
havior in terms of frequency per time 
interval. Behavior was judged as be- 
longing to one of several categories: 
crying, autistic gestures, play, locomo- 
tion, and vocalization. Patterns of these 
response categories were synthesized 
into positively directed and negatively 
directed sequences. Adaptation to the 
situation as a function of repeated ex- 
periences with it and as a function of 
mother (familiar adult) presence was 
readily demonstrated. It may be in- 
structive to experimenters that adapta- 
tion (replacement of crying and 
autistic gestures by play, locomotion, 
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etc.) in children left alone in the room 
on all trials was not gradual and linear 
but instead tended to be abrupt. In- 
telligence and age played little if any 
role within the range observed. 

The technique of this study seems to 
manipulate an important class of vari- 
ables for this (and older) age ranges. 
The response measures used by 
Arsenian are broad and general (ap- 
propriate for an initial investigation); a 
refinement of these, and a wide use of 
many other responses and tasks, should 
prove of great significance in experi- 
mental investigations of these children, 


Early Childhood 


Before 1950 research with toddlers 
was devoted mostly to response anal- 
yses. With the shift of interest from 
norms to antecedent and consequent 
relationships has come a small but in- 
creasing number of investigations with 
laboratory-experimental procedures. 


OBTAINING SUBJECTS 


Much of the research on early child- 
hood was made possible not only by 
the availability of subjects in nursery 
schools but also by the establishment, 
about 1920, of laboratory nursery 
schools. These laboratory schools, part 
of a college or university (Moustakas 
and Berson, 1955), have undoubtedly 
contributed the major portion of chil- 
dren for study. The definite advantages 
to doing research in a laboratory 
nursery school are the availability of 
research facilities and equipment and 
the stimulation usually provided by 
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staff and students. One frequent dis- 
advantage is the limited number of 
subjects, which is especially felt when 
teaching and research needs are heavy. 


Nevertheless, even then a sufficient 
number is usually available for pilot 
studies and intensive individual analyses. 


In the nursery-school setting two es- 


sential integral parts of the investi or’s 
task are to conduct talks and discus- 
sions with parents, in w hich the nature 
and possible implications of the re- 
search are explained, and to develop 
an understanding and cooperative re- 
lationship with the teaching staff, so chat 
children may be taken from ups 
and returned without hinder the 


procedures of the study or the routines 
of the school. 

Probably the second largest number 
of subjects have come from private 
nursery schools and from private and 
public day-care centers. Day-care 
centers are all-day schools for the 
youngsters of working mothers. Re- 
quests for permission to embark upon 
a study in these two types of school 
are generally addressed to the owner, 
head teacher, or chairman of the ad- 
ministrative board. Finding adequate 
working conditions is probably the 
‘problem most frequently encountered 
here. In most instances little or no space 
is available. Sometimes there may be a 
spare room, but rarely will it prove 
sufficiently isolated from the activities 
of the school, When space limitations 
make it impractical to control experi- 
mental conditions adequately, conduct- 
ing research in a nearby research center 
or a mobile laboratory are possible 
alternatives. Again, the investigator 
must assume responsibility for giving 


talks on his research to teachers and 
parents, 

ubjects are sometimes procured by 
direct contacts with parents. Lists of 
parents with children of the required 
age have been obtained from pedia- 
tricians, teachers, clinic waiting lists, 
mothers’ clubs, and students from uni- 
versity housing groups. In all instances 
nal and written contacts are made; 
| in many situations transportation 
provided to a research center. Under 
some circumstances psychological serv- 
ice is given in exchange for cooperation. 

Residential institutions—orphanages 
or schools for exceptional children, such 
as the retarded, blind, emotionally dis- 
turbed, and dependent—provide ex- 
cellent opportunities for experimental 
analyses. Some have research labora- 
tories in operation; others make facilities 
available for specific studies on re- 
quest. Procedures for obtaining per- 
mission and establishing working re- 
lationships are in substance the same 
as those described in the section dealing 
with infants in institutions. In prepar- 
ing for a relatively short-term study in 
a residential school, portable or pre- 
fabricated observation booths (Gewirtz, 
1952) and readily transportable ap- 
paratus (Lipsitt, 1958; Norcross, 1958) 
deserve consideration. 


SAMPLING 


Youngsters procured through any of 
the above procedures are, of course, 
not random samples of preschool chil- 
dren. For example, a survey by Mou- 
stakas and Berson showed that 
“children enrolled in laboratory nurs- 
ery schools came largely from middle- 
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class professional and business families” 
and that “Only 55 per cent of the schools 
were open to both white and negro 
children; 3 per cent were for colored 
children, 42 per cent for white children 
only” (p. 108). Samples of children 
from institutions would also be ex- 
pected to be biased on socioeconomic 
and racial dimensions as well as others. 

Since getting random samples of 
preschoolers for experimental studies is 
a difficult, if not an impossible, task, 
one realistic objective might be to 
obtain random samples within the 
available subpopulations. Another might 
be to work out the specifications of 
a representative sample, search for 

arents with youngsters who qualify, 
and solicit their cooperation, 

Unless special status or historical 
conditions are the major point of in- 
terest in a study, it is advisable to 
eliminate preschoolers characterized by 
anomalies, emotional disturbances, de- 
velopmental retardation, and recent 
changes in living conditions (such as the 
arrival of another sibling, the addition 
of a new relative to the household, a 
separation of the parents, or a move to 
a new location). 


INVESTIGATIVE PROCEDURES 


As in the discussion of infants, in- 
vestigative procedures here are broken 
down into manipulating independent 
variables and measuring responses on the 
one hand and holding constant setting 
conditions on the other. The latter, it 
will be recalled, are subdivided into 
conditions operating before and 
during the manipulations of the in- 
dependent variable. In discussing ex- 
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perimental work with children in this 
age range and older, it will be helpful 
to divide further the conditions operat- 
ing before experimental manipulation 
into (1) inducements to participate in 
the research, (2) instructions for per- 
forming the experimental task, and (3) 
influences occurring between sessions. 

Since the preschool child often shows 
reluctance to accompany a compara- 
tive stranger, it is common practice to 
establish a routine to minimize emo- 
tional reactions to attempted separations 
and to reduce the number of refusals. 
Unfortunately, most reports fail to 
describe the inducement procedure; 
some give only a brief statement, such 
as “the child was asked to come to the 
adjoining room to play games.” Often 
it is impossible to determine whether 
variations in the inducing procedures 
could account for differences in the 
results of some replicated studies. 

In planning the inducement proce- 
dure, the following considerations, 
drawn from common practice, might 
be taken into account. First, one person, 
ordinarily a young woman, should take 
the young child from the school group 
or home to the laboratory and return 
him. Second, this person should spend 
a good deal of time getting acquainted 
with the child before the study begins 
if at all possible. Third, whatever the 
adult says in asking the child to come 
to the experimental room should be 
simple, neutral in tone, and repeated 
each time in the same manner, For 
example, “Now it is your turn to 
come and play games.” “Would you 
like to play some games and get some 
of these (showing toys or edibles)?” 
Fourth, the relationship between the 
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young woman and the child during 
the trips to and from the laboratory 
should be pleasant and nondirective: 
the caretaker never initiates conversa- 
tion and responds to questions and com- 
ments in a pleasant reflective manner. 

It is well established that variations in 
procedures may produce variations in 


experimental results. It is therefore 
standard practice to plan and report in- 
structions with care. There are two 
special problems in research with chil- 
dren (preschool children in particular). 
The first concerns comprehension. 


Does the child understand what is re- 
quired of him? Inevitably, when a 
youngster is asked whether he under- 
stands his task, he nods his head, smiles, 
and eagerly says “yes.” Later he dem- 
onstrates otherwise. The second prob- 
lem is the applicability of a set of in- 
structions to groups at various matura- 
tional levels or with developmental de- 
fects. Working out the simplest set 
of instructions with the aid of a nursery- 
school teacher and giving the child 
ample opportunity to show understand- 
ing by performance (Brackbill, 1958) 
will often be useful. Examples of simple 
sets of instructions are, “Here are two 
toys. You may turn either one over 
(like this) and see what is under it. You 
may keep whatever you find.” “Here 
are some toys. You may play with 
them as long as you like.” House and 
Zeaman provide an example of a pro- 
cedure that allows the subject time and 
opportunity to learn the task in a dis- 
crimination problem: 


On the first presentation of the stimulus 
tray, both food cups were left uncovered 
with candy placed in one of them. The 
experimenter asked, “Can you find the 


THE 

candy?” and pointed if the subject failed 
to pick it up. On the next trial, a gray 
wedge was placed over one of the food 
cups containing candy, with the other 
cup left uncovered. Again, S was asked to 
find the candy and aided by the ex- 
perimenter if he failed. When the subject 
picked up the gray wedge without 


prompting (usually one or two trials), 
discrimination trials were begun (1958, p. 
413). 


The third class of conditions that may 
affect experimental performance in- 
cludes events between sessions, such as 


the subjects’ receiving information, mis- 
information, and erroneous instructions 
from peers, teachers, and parents. Still 
others are upsetting and interfering 


conditions and events altering schedules 
of deprivation related to experimental 
reinforcers. Birthday parties are a com- 
mon example. Discussions among the 
children are another and are one of 
the most difficult problems in this 
respect. Since the behavior of children 
cannot be controlled outside the ex- 
perimental situation, the investigator 
may evaluate the significance of such 
interactions either by setting up a con- 
trol group with matched subjects from 
private homes in different parts of the 
community or by replicating the study 
using a baseline procedure with each 
child serving as his own control. Al- 
ternatively, unauthorized instructions 
from parents and teachers may be pre- 
vented, at least in part, by discussing 
the purpose and nature of the research 
and by telling them what their behavior 
toward the child should be in order to 
be helpful. Upsetting conditions usually 
consist of accidents, fights, or trouble 
at home just before an experimental 
session. Mothers and teachers often can 
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supply enough information on such 
matters to allow the investigator to 
evaluate their influence. There are, of 
course, still other interfering condi- 
tions that may compete with the be- 
havior required by the experimental 
task, e.g., holiday celebrations, special 
outings, or even the weather. Changes 
in the living conditions of the child 
or groups might affect motivation (and 
may lead to elimination of subjects or 
modifications in experimental design).” 
Again, reports of teachers and parents 
and direct observation will provide most 
of the essential information about these 
happenings. 

If the working situation is adequate 
in terms of space and physical features 
and if there is protection from ex- 
traneous stimuli, the problem of con- 
trolling setting conditions centers upon 
holding constant nonexperimental social 
variables. Controlling the interactions of 
child and adult presents an acute prob- 
lem in research with preschool chil- 
dren. These youngsters do not like to 
be alone in a room for the time required 
by most studies. One solution may be 
to adapt the child to remaining alone 
for increasingly longer periods before 
the experimental run, This approach, 
however, requires considerable time 
and may result in the loss of subjects 
who cannot comply (Warren and 
Brown, 1943). Another possibility may 
be to have an adult in the room and 
attempt to control social interactions. 
Several studies have followed this course 
by having the experimenter present, 
but operating the apparatus from be- 
hind a one-way screen on the experi- 
mental table (House and Zeaman, 
1958), or by having an adult behind an 
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opaque screen in the room while the 
experimenter operates the apparatus and 
observes the child from an adjoining 
room through a one-way screen (Bijou, 
1957). 

Now we shall turn to a discussion of 
the ways in which stimuli have been 
manipulated and responses measured. 
As in the infancy section, the approach 
is through representative studies in 
major problem areas. 

Conditioning. Classical conditioning 
with young children has been a prob- 
lem in early childhood in the sense that 
many have been interested in whether 
conditioning principles developed in 
animal studies apply to young humans 
and, if they do, what are the differences 
in the parameters as a function of age. 
Techniques employed are exemplified 
in studies by Marinesco and Kreindler 
(1933), Krasnogorski (1909), and 
Mateer (1918) described in the preced- 
ing section. 

One of the basic issues of the con- 
ditioning studies previously cited is 
involved in studies of preschool chil- 
dren as well. Do instrumental or operant 
conditioning principles, derived from 
animal studies, apply to children? One 
study by Warren and Brown (1943) is 
of particular interest here because 
methodology was seen as the basic 
problem. Some effort was made to 
duplicate the animal situation. Children 
between the ages of 2 and 5 were taken 
into a small room. A horizontal bar 
projected from one wall. Under the bar 
Was a white enameled candy tray. 
Also in the room were a nursery-school 
book and a hobby horse. Responses to 
the bar were recorded on a cumulative 
marker, and candy pellets made of 
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sugar, corn syrup, and gelatin served as 
reinforcers. They were delivered 
through a tube to the tray. Daily ses- 
sions were held for fifteen days. For 
the first few days the parents were 
urged to come with the child but not 
to enter the room unless needed. The 


experimenter, however, entered the 
room with the child and played with 
him when it seemed indicated. Finally, 
the child was left alone. The similarities 
between these procedures and those in 
animal studies are apparent, but are 
some of the differences. For example, 
there were differences in moti nal 
operations, in the length and number 
of sessions, and in the adaptation pro- 
cedures. Nevertheless, the authors stated 
that their data indicated that known 
operant conditioning principles may be 
validly applied to preschool youngster: 

The development of a method for 
the analysis of operant behavior in 
young children was also undertaken by 
Bijou (1957). The objective was to 
devise an experimental situation in 
which a variety of problems indigenous 
to this age span could be analyzed. The 
experimental room, equipped with one- 
Way screens, microphones, and speak- 
ers, was colorfully decorated like a 
playroom and contained two tables 
with a toy on each. One toy was a 
commercial product consisting of a 
wooden lever and ball enclosed in a 
cage. Striking the lever flipped the ball 
to the top of the cage. As it fell, it 
automatically rolled to its original posi- 
tion on top of one end of the lever and 
was in position to be catapulted again 
by the next response, The child was 
free to play with this at any time. The 
other toy was a brightly colored box 
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vith blue and amber jewel lights, a 
metal handle protruding on the front, 
and a tray at one side for receiving re- 
nforcements, Downward pressure on 

e handle was always followed by a 

des click and now and then by a 
trinket, a cookie, a piece of dry cereal, 
or candy. Responses, reinforcements, 
and outstanding events were recorded 
on a cumulative marker. A young 
woman brought the subject into the 
room, remained with him during the 
session, and returned him to his school 
group. She followed a routine for re- 
moving the child’s coat, taking him to 
the center of the room, giving simple 
instructions, and retiring behind an 
opaque screen to read a book. The 
experimenter, operating in an adjoining 
room, observed the child through a 
one-way glass screen, monitored the 
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apparatus, and gave instructions, 
through a one-way intercom system 
and ear-speaker, to the adult in the 
room. Using this experimental setup 
(see Fig. 4-4), which was duplicated 
in a mobile laboratory (Bijou, 1958) 
investigations have been conducted on 
reinforcement, extinction, discrimina- 
tion, escape, and avoidance training. 
Discriminative Processes. Studies of 
discrimination outnumber by far all 
others in this age period. This fact is 
even more striking when studies of 
transposition, learning sets, concept 
formation, and abstracting behavior are 
included. A few investigations have 
employed free operant techniques. 
Most, however, involve discrete mul- 
tiple-choice tasks with intervals be- 
tween trials controlled by the experi- 
menter. Usually the mode of prevent- 
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Fig. 4-4. Floor plan of the mobile child study laboratory (Bijou, 1958). 
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ing stimuli is closely tied in with the 
required responses to increase the prob- 
ability of contiguity between attending 
and choice behavior. For example, sub- 
jects have been required to open a 
drawer under a card with a geometric 
form (Hunter, 1952), open a door with 
a distinctive stimulus (Alberts and 
Ehrenfreund, 1951), press on a glass 
window showing a stimulus object 
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(Long, 1940), point to an object hav- 


ing a characteristic stimulus dimension 
(Hicks and Steward, 1935), push a 
button under a light (Shepard, 1956; 
Lipsitt, 1958), move a box (Spiker, 
1956), point to an inverted cup con- 
cealing a toy (Calvin, 1955), or move 


a stimulus object to see whether there 
is candy in a hole beneath (House and 
Zeaman, 1958). 


Fig. 4-5. Wisconsin General Test Apparatus (After House and Zeaman, 1958). 


In general, discrimination studies 
have followed virtually every well- 
known experimental strategy. Subjects 
ave been equated but given different 
‘mounts of pretraining; or they have 

en given the same discrimination 

ining but tested under different con- 
itions. In other studies subjects have 
heen assessed as different in some di- 
ension and have been given the same 
scrimination task; then they have 
been tested under different conditions. 

An example of study using a con- 
trolled multiple-choice technique with 
ood control over adult-child interac- 
ion is one by House and Zeaman. 
heir apparatus was a modification of 
the Wisconsin General Apparatus Test. 
(See Fig. 4-5.) 


It consisted of a table with a sliding 
stimulus tray 30 in. by 12 in. with two 
circular food wells 2 in. in diameter cen- 
tered 12 in. apart. In the center of the 
table, separating S from E, was a one-way 
screen. The sliding tray when pulled be- 
hind the screen was invisible to S. For a 
trial, the tray was pushed directly in front 
of S (1958, p. 412). 

Five geometric forms were used, 
each repeated in five colors. Each sub- 
ject was assigned two stimuli selected 
at random from among the 25. Correct 
and incorrect stimuli were designated 
randomly, Subjects were given pre- 
training on the task before beginning 
the training procedure, 


For a discrimination trial, E pushed 
forward the stimulus tray with two stimuli 
covering the two food cups so that it was 
directly in front of S. The two stimuli 
were the same on every trial for a given 
subject with one and only one always 
correct. The position of the correct 
stimulus was varied irregularly from left 
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to right according to a Gellermann’s series. 
E recorded S's first response as correct or 
incorrect (1958, p. 413). 

Serial Learning. Serial learning is 
used here to refer to the learning of a 
behavior-chain or a sequence of stimuli 
and responses “with only minor varia- 
tions in the ordinal position of each 
stimulus or response” (Verplanck, 
1957). In a sense, it is a series of linked 
multiple choices. Learning of this type, 
sometimes referred to as sensorimotor 
(Munn, 1954) or multiple responses 
(Underwood, 1949), is of particular 
significance in early childhood, since 
this is the period in which the child 
characteristically strengthens and ex- 
tends manipulatory, locomotor, and 
verbal sequences. Research problems 
have centered on differences related to 
age and on conditions that strengthen 
or weaken chains. Some investigations 
require manual or locomotor responses, 
such as finger and alley mazes, others, 
verbal sequences with numbers or 
words. 

A study by Lambert, Lambert, and 
Watson of the strengthening and weak- 
ening of chains will serve to illustrate 
a technique with a manual sequence. 
The apparatus was the Solomon token- 
reward machine, (See Fig. 4-6.) It con- 
sisted of two boxes, one on the other. 
The top box had a crank on the right 
side and a chute at the front for deliver- 
ing tokens; the bottom one, a slot for 
receiving tokens and a trough below a 
tube for dispensing candy, both on the 
front side. The child turned the handle 
a specific number of turns, received a 
poker chip, inserted the token in the in- 
sert slot, and, after a varying number of 
these act-sequences, received a small 
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Fig. 4-6. The Solomon token-reward vendor 
(Lambert, Lambert, and Watson, 1953). 


piece of candy. One of the procedures 
was described as follows: 


The S turned nine turns and this turn- 
ing was followed by a click which ac- 
companied the automatic ejection of a red 
token. The S then inserted the red token in 
the slot of the candy vending machine, 
and then turned the handle of the work 
machine nine times. This time a white 
token was automatically ejected, and in- 
sertion of this in the token slot was fol- 
lowed immediately by ejection into the 
candy trough a small piece of candy (1953, 


P- 324). 
The number of trials per day varied 


from one to four, and the experiment 
lasted ten days. 


Turning time measures were recorded 
throughout. This was done by a second 
E, who sat in an adjoining room and read 
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the turnings from a dial which led by a 
flexible cable to the work machine in the 
experimental room (1953, p. 324) 


A serial learning technique involving 
pictorial and verbal stimuli has been 


explored with kindergarten children 
and is described in the serial learning 
section on later childhood and pre- 
adolescence. 

Problem Solving. Studies of problem 
solving, reasoning, rational learning, 


and thinking, taken together, constitute 
another significant problem a in 
early childhood. Are children in the 
initial phase of gaining facility with 
their behavior equipment capable of 
such behavior? An oft-recurring ques- 
tion is, “At what age does reasoning 
behavior start?” Another interest cen- 
ters around functional problems. If 
problem-solving behavior starts in early 
childhood, this is the time, par excel- 
lence, to study the conditions that in- 
fluence it. However, the dimensions of 
the behavior have not been adequately 
worked out; consequently, the meth- 
ods, procedures, approaches, and tech- 
niques vary almost as widely as the 
problems posed. If an empirical defini- 
tion of the behavior or part of the be- 
havior is adopted, as suggested by 
Skinner, striking advances could be ex- 
pected to follow. 


Problem solving may be defined as any 
behavior which, through the manipula- 
tion of variables, makes the appearance of 
a solution more probable (1952, p. 247) 


An example of one of the few studies 
following this lead is that by Kendler, 
Kendler, Pliskoff, and D'Amato. Their 
aim was to determine whether infer- 
ential behavior resulting from learning 
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is influenced by motivational variables. 
The equipment was described as fol- 
lows: 

The apparatus is a gray box-like struc- 
ture with a lever projecting from each of 
its vertical sides. The face of the ap- 
paratus had two rectangular apertures and 
three transparent windows that could be 
covered. The levers, painted black or 
white, when pushed inward, moved small 
boxes (either a yellow square box or a 
red triangular box) along a wooden track 
into the lower aperture. One of these 
boxes was attached by a hidden string 
to a translucent circular plastic container 
mounted on wheels. When this box was 
pulled down, the plastic container was 
drawn into the upper aperture (1958, p. 
208). 

The subjects drawn from nursery 
schools in the New York metropolitan 
area were given three training experi- 
ences; two led to the attainment of 
containers with raisins that served as 
subgoal objects, and the third led to 
a box with or without the major goal, 
a gold-colored charm. On test trials the 
child was confronted with the two 
sticks. The choice of one led to a box, 
which, if pulled, would produce the 
major goal object. The container with 
the charm could be seen in one of the 
windows. Motivation was varied dur- 
ing the training and testing trials. 

Motivation. The term motivation is 
used here to refer to behavior changes 
related to differences in reinforcements 
and operations of satiation and depriva- 
tion. The experimental plan of most 
of the studies in this age group employ 
one of the techniques described above. 
Equated groups of children, each of 
whom receives a different class of re- 
inforcers or a different satiation-depri- 
vation operation, are presented with a 
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task. Differences in performance are 
compared, 

Terrell and Kennedy (1957) and 
Terrell (1958), for example, studied 
the effect of different kinds of rein- 
forcement in a discrimination learning 
and transposition situation, In the 
Terrell and Kennedy experiment, 
praise, reproof, candy, and tokens, each 
accompanied by a flash of light, were 
compared. The task consisted of but- 
ton-pushing responses to the larger of 
two 3-dimensional geometric objects. 
There were four groups, one for each 
condition, divided into two age groups, 
4 to 5 and 8 to 9, For the praise group, 
the reinforcement after each correct 
response was “That’s fine. You are 
doing well.” Subjects in the reproof 
group were told “No, not that. You 
are wrong,” after each incorrect re- 
sponse. The candy group was given a 
“small piece of candy.” The subjects 
of the token group were allowed to 
transfer a dried bean from one jar to 
another and were told that as soon as 
“enough” of the beans were transferred 
they could have a “small bag of candy.” 
The control group received only the 
light flash. 

Instructions were given in full at the 
beginning, and after every tenth trial 
this sentence was repeated: “Now re- 
member, the game is to see how soon 
you can learn to make the light go on 
every time.” After reaching the cri- 
terion of learning, a four-trial trans- 
position test was given without chang- 
ing the type of reward. The children 
in the two age groups, 4 to 5 and 8 to 
9, required significantly fewer trials to 
discriminate and transpose when candy 
was used, 
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A study by Brackbill and O'Hara 
(1957) also attacked a motivational 
problem dealing with the relative ef- 
fectiveness of presenting and with- 
drawing tangible reinforcement, The 
former was accomplished by giving an 
M & M candy after a correct response; 
the latter, by taking away a candy after 
an error, The procedure was a typical 
multiple-choice technique. The stimuli 
were three boxes differing in color and 
size, and the task was to learn a simple 
position alternation sequence between 
the first and the second boxes. At the 
beginning of the session the subject was 
“staked” to a box of candy. 

Frustrating, Sudden, or Strong 
Stimulation. Despite increasing interest 
in the concept of anxiety, there have 
been relatively few experimental 
studies on the effects of strong and 
sudden stimulation in early childhood, 
and follow-up studies on earlier inves- 
tigations (Landis and Hunt, 1937; Jer- 
sild and Holmes, 1935; Jones, 1924; and 
Hagman, 1932) have yet to be per- 
formed. Of the older studies, the Hag- 
man study is described here mainly be- 
cause of the method used to record 
responses. Hagman was interested in 
comparing mothers’ reports of their 
children’s fears with the children’s be- 
havior in a test situation. Reactions to 
a phonographic recording of thunder 
were recorded by means of motion pic- 
tures (16 frames per second). The 
camera and lights were in full view 
and pictures were taken as the child 
stood inside the doorway, about 15 
feet from the camera. A young woman 
brought the child into the laboratory, 
holding his hand throughout. As they 
stepped inside the room, she said: “We 
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will wait here a minute.” The investi- 
gator ran the camera for five seconds, 
started the phonograph, and continued 
to take pictures for fifteen seconds or 
longer. Twenty-six days later the child 
was returned to the laboratory. Just 
before the camera started, the young 
woman momentarily let go of the 
child’s hand and backed out of the 


room. The films were analyzed in terms 
of behavior changes in eight different 
parts of the body, right hand, left hand, 
feet, head, eyes, etc, by running 


through the film several times forward 
and backward. Extent of activity in 
each area was rated on a scale. Findings 
indicated a decrease in activity and an 
increase in glancing at the escort. 

A well-known study of preschool 
children consequent to frustration pro- 
duced by withholding reinforcement is 
that by Barker, Dembo, and Lewin 
(1941). Nursery-school children were 
brought one at a time into a playroom 
containing run-of-the-mill toys. On the 
second visit the child found that one 
partition of the room had been re- 
moved, and in the new section was an 
attractive array of toys. The toys avail- 
able during the previous session had 
been mixed with the more alluring ob- 
jects. After the experimenter was con- 
vinced that the child had played with 
all the new toys, she collected the toys 
available during the first session and 
placed them where they had been 
originally, The child was then told to 
come and play with these toys. As he 
left the new toys, a wire-mesh screen 
was lowered between him and the new 
toys. Thus he was back with the toys 
used in the first session but in full view 
of the more elaborate toys, Observa- 
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tional data were grouped into units of 
behavior that in turn were quantified 
on a 7-point constructiveness-of-play 
scale, It has been pointed out (Under- 
wood, 1957) that the results of the 
Barker, Dembo, and Lewin study could 
be accounted for on other grounds and 
that data on a control group with ini- 
tial and final play sessions but no inter- 
vening frustration would be helpful in 
clarifying this ambiguity. 

Changes in Complex Behavior. 
There is a group of experimental prob- 
lems centering around the modification 
of complex physical, social, or emo- 
tional behavior at this age level. The 
experimental design gencrally consists 
of equating two groups on some ex- 
perimental variable and matching them 
on several other dimensions, giving one 
training, and retesting both on the ex- 
perimental variable, A variation may 
include an additional period in which 
both groups receive treatment and ad- 
ditional testing. 

An example of a study of biological 
maturation and the learning of motor 
skills is one by Hilgard (1932). Two 
groups of children, aged 24 to 36 
months, were matched for chronologi- 
cal age, mental age, sex, and approxi- 
mate initial skill in climbing, cutting 
with scissors, and buttoning. The 
climbing task consisted of negotiating 
a three-step, 2%-foot ladder, stepping 
on to a table at the top, and then 
climbing down again. There were toys 
on the table “to attract and please the 
child.” The score was the average time 
taken to go up and come down. The 
buttoning test consisted of buttoning 
together standard strips containing but- 
tons and burton holes. Scores were 
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based on number of successes and time. 
The cutting-test task required cutting 
along a straight line, Ratings were based 
on ability to handle the scissors and 
accuracy in cutting. One group re- 
ceived twelve weeks of training with 
retests at two-week intervals, In ladder- 
climbing training, for example, the ex- 
perimenter tried to help the child im- 
prove his time and quality of perform- 
ance in a variety of ways, including 
putting one foot instead of two upon 
each rung of the ladder. At the end of 
twelve weeks, the control group re- 
ceived four days of intensive training. 
At this time the practice group made 
higher scores than the control group, 
After a brief period of training, how- 
ever, the control group reached the 
level of the practice group. 
Covariations in frustration-produced 

behavior and training are exemplified 
by the research of Keister and Upde- 
graff (1937) and Keister (1938), In the 
latter study two groups of children, 
aged 43 to 72 months, were matched 
on chronological age, sex, and intelli- 
gence, They were contrasted in terms 
of immature reactions co failure (eg. 
retreating or giving up almost at once, 
crying, sulking) on a difficult puzzle 
and a task requiring considerable physi- 
cal strength for their developmental 
level. Training was restricted to the 
relatively immature group and con- 
sisted of introducing the child in easy 
stages to a graded series of jigsaw puz- 
zles and block-building tasks, The pro- 
cedures with the puzzles were flexible 
and informal. Praise and support were 
given, but no assistance. In the block- 
building tasks praise and encourage- 


ment were also freely distributed, but 
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the child was left to work the prob- 
lems himself. Sessions were terminated 
with praise for the work accomplished. 
After completion of training both 
groups were tested again on a difficult 
puzzle similar to the original one. The 
trained group made significantly more 
“attempts to solve alone” and “interest” 
responses than they had in the original 
test, and there were less requests for 
help, less emotional behavior, and less 
escape behavior. The control group 
(relatively “mature” to start with) did 
not show any significant improvement. 

A parallel study, except that it dealt 
with social behavior, was conducted by 
Jack (1934). Children ranging in age 
from 45 to 56 months, with the lowest 
“ascendance” scores in a previous ex- 
periment, served as subjects. Their 
scores were derived from observations 
of behavior with each of five children 
in a sandbox with toys. The training 
group received individual treatment de- 
signed “to make their position more 
secure and to assure them of a certain 
degree of confidence.” Instruction and 
assistance was given in working on 
mosaics, picture puzzles, and a picture 
book. There were seven or more ses- 
sions, lasting from ten to fifteen min- 
utes. Following the sessions, each child 
was observed in four pairings—each 
time with a different, originally-more- 
ascendant child—in situations with the 
material used in training. About ten 
weeks after the initial ascendance tests 
the trained children were again paired 
with the same children as before for a 
final observation. They showed signifi- 
cantly greater gains in ascendance than 
the untrained group; the untrained 
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children showed no changes between 
the initial and the final test. 

These three studies obviously in- 
volve intricate interactions and a va- 
riety of reinforcers. They, and others 
like them, have explored relationships 
between performance and training that 
should now be submitted to more syste- 
matic and detailed analysis. Just what 
specific conditions in the training have 


or have not had effects? How precisely 
can these effects be described 

other kinds of reinforcement ar 
tive in promoting such behavior 


Later Childhood and Pre- 
adolescence 


This section deals with studies of 
children between 6 and 9g (the first 
four elementary school grades) and be- 
tween 10 and 13 (the last four grades). 
These age spans, sometimes referred to 
as later childhood and preadolescence 
or as middle and later childhood, are 
merged because of the similarity in ex- 
perimental techniques employed. Dif- 
ferences are discussed as they arise. 

Experiments in this age range involve 
more verbal stimuli and responses, have 
sessions extending over longer periods, 
include more situations in which the 
subject performs alone, and pose many 
difficulties in controlling verbal be- 
havior between and within sessions. 
Fewer experimental studies have been 
conducted on children in this age 
range. (There seems to be an inverse 
relationship between stage of develop- 
ment and number of experimental 
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studies.) However, the indications are 
that this situation gradually is being 
changed. 


OBTAINING SUBJECTS 


As would be expected, most subjects 
in this age group have been procured 
in public and private schools. The pro- 
cedure for obtaining permission to do 
research varies, but basically it con- 
sists of personal or written contact with 
the principal or superintendent, in 
which the nature, method, and implica- 
tions of the investigation, the time re- 
quired of each child and for the total 
study, and space needs are described. 
Frequently time and space problems 
arise. For example, the time of the ses- 
sions may have to be adjusted to class 
schedules. When a study requires the 
children to return for a series of ses- 
sions, arrangements may have to be 
worked out to compensate for lost 
time. This problem would be attenu- 
ated when working with preadolescents 
in a junior high school, in which par- 
ticipation might be limited to study- 
hour classes. The frequently occurring 
problem of inadequate working space 
may be overcome by the use of a mo- 
bile laboratory in the schoolyard or 
by taking the children to a research 
center, The latter involves the responsi- 
bility for transportation, and for that 
reason is often avoided. 

Some investigators have obtained 
children through parent contacts. Long 
(1958), for example, wrote letters tell- 
ing parents that he had received their 
child’s name and address at the school 
office and asking permission to have 
the child serve as subject. He described 
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the research and pointed out that it 
was a part of a university project. He 
also indicated what was not included. 
(“Lest you be concerned, let me as- 
sure you that the situation involves no 
competition between children, no psy- 
chological tests, or anything else that 
might be disturbing.”) Finally, infor- 
mation was given on appointments and 
transportation arrangements. Long’s 
approach has particular merit for re- 
search conducted during the summer 
months. 

Subjects have also been obtained 
through organizations such as boys’ 
clubs, community centers for youth, 
the Boy Scouts, and the Camp Fire 
Girls. For example, a settlement house 
was used to study aggressive behavior 
in experimentally created “social cli- 
mates” by Lewin, Lippitt, and White 
(1939) and cooperative behavior in 
pairs of children by Azrin and Lindsley 
(1956). Permission to conduct a study 
is ordinarily obtained both from the 
person in charge and the parents. 

Finally, large numbers of children in 
residential institutions have served as 
subjects. As reviews by McPherson 
(1948, 1958) have shown, many learn- 
ing experiments on the retarded child 
have been conducted in these settings. 
Since the method of requesting permis- 
sion and the problems encountered in 
doing research in institutions are simi- 
lar to those discussed in the parallel 
section on early childhood, no further 
comment is necessary. 


SAMPLING 


With one exception, problems of 
sampling are similar to those discussed 
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in the early childhood section. But in 
studies in which children are obtained 
in the public schools sampling tech- 
niques may be exercised to avoid bias 
in socioeconomic status, race, intelli- 
gence, and the like. Procedures for ob- 
taining random or representative sam- 
ples under these circumstances have 
been carefully described (Lindquist, 


1940). 


INVESTIGATIVE PROCEDURES 


As in the preceding sections, investi- 
gative procedures are divided into 
problems of inducements, instructions, 
intersession events, and setting condi- 
tions. Inducement procedures with 
children in this age span are as major 
a source of variance as in early child- 
hood. Some standard procedure is 
needed simply to deal with misconcep- 
tions about the experimenter and the 
experiment. For example, investigators 
working in a school or institution have 
reported that their subjects often be- 
lieve that the experimenter is a school 
psychologist and that the study is to 
find out whether certain children 
should be in a lower grade, A standard 
procedure may also prove necessary in 
explaining the nature of the study and 
the reason for soliciting the child’s co- 
operation, One workable procedure is 
to take each subject into a waiting 
room and discuss all aspects of the 
study. Children who show persistent 
negative attitudes can be eliminated at 
this point if this bias is important, 

Some investigators maintain that in- 
structions to children in this age range 
should be held to a minimum or even 
eliminated. For example, in a study of 
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cooperation by Azrin and Lindsley in- 
structions consisted merely of telling 
the subjects that they were going to 
play a game and showing them how 
parts of the game worked. The authors 
believe that an abbreviated procedure 
has the following advantages: 


(a) The initial acquisition of coopera- 
tion can be studied, (b subjects (S's) that 
learn by demonstration and ir 
with difficulty (i.e., infants, certain « 
of psychotics, and lower organisms) c 
be studied, and (c) no problems involving 
the effects of instructions upon the be- 
havior of the S’s are involved (1956, p. 
100), 


However, there are some experimen- 
tal situations with children that require 
instructions. In these instances they 
should be in the simplest possible terms 
and may well be followed by a period 
of practice. In a study of stimulus gen- 
eralization as a function of age Med- 
nick and Lehtinen followed this pro- 
cedure. 


In order to provide a behavior test of 
S’s understanding of the task, two brief 
demonstration trials were given in which 
the center lamp and a peripheral lamp 
were used as test stimuli. It was decided 
to procure this behavior indication of S’s 
understanding of the task despite the pos- 
sibility that the older children would learn 
more from these trials than the other 
children (1957, p- 180). 


It will be recalled that some of the 
intersession events that have potential 
effects on experimental behaviors are 
instructions and admonitions from 
teachers, supervisors, and parents (“Be 
a good boy, and do your very best”) 
and discussions among the participants, 
There are the added possibilities of 
transitory emotional reactions and 
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variations in satiations and deprivations 
relative to the experimental reinforcers. 
In an effort to deal with these factors, 
some investigators tell the adults in 
contact with the children what their 
role should be and how to deal with 
questions about the study. Individual 
and group discussions at the end of the 
study may be used to express apprecia- 
tion and to show how their coopera- 
tion helped carry through the study, 

Conversations among the participants 
may produce erroneous  self-instruc- 
tions, misconceptions, emotional pre- 
dispositions, and the like, and are in- 
deed difficult to control. Several ap- 
proaches are possible, First, the experi- 
menter may try to determine whether 
the child has any preconceptions about 
the study. For example, in a study of 
the relative efficacy of various types of 
stimulus-objects in the discrimination 
learning of first grade children Calvin 
and Clifford note: 


While walking to the testing room, S 
was asked such questions as “Do you know 
where we are going?” and “Do you know 
what we are going to do?” as a check on 
possible intercommunication between S's 
about the experiment. It was found that 
some of the S's had talked to each other 
about the game but not about the actual 
problem involved (1956, p. 104). 


Second, at the end of a run the ex- 
perimenter may ask the child not to 
tell anyone about the game until it is 
over, to keep it a secret (Hendrickson 
and Schroeder, 1941). It is, of course, 
difficult to estimate the effectiveness of 
either of these procedures, Third, the 
experimenter may take these precau- 
tions and also run a matched control 
group with children living in homes 


separated at some distance one from 
the other. Fourth, the investigator may 
replicate the study with a repeated- 
measures design involving baseline 
techniques. 

Comments from teachers and parents 
and obvious changes in the child's be- 
havior suggesting an emotional state 
must be evaluated by the experimenter 
to determine whether the subject 
should be continued, The same is true 
of changes in conditions influencing 
schedules or cycles of relevant satia- 
tions and deprivation. 

Holding constant the social aspect of 
the setting conditions of an experiment 
is potentially less of a problem with 
these children than with the younger 
ones. The problem may be completely 
eliminated when it is possible for the 
child to work alone, One deterrent to 
climinating the investigator may be the 
lack of an adjoining room with a one- 
way screen and listening equipment. 
Another may be the need for the in- 
vestigator in the room to set up the 
problems, observe behavior, and record 
responses, Under both these conditions 
it is suggested that inexpensive devices, 
such as a portable observation booth 
(Gewirtz, 1952), a one-way screen 
mounted on a table (House and Zea- 
man, 1958), or some sort of an opaque 
screen or partition (Bijou, 1957), be 
used. If social interactions are curtailed 
by these or like procedures and all 
verbal interactions are standardized, at 
least in tone, a considerable portion of 
uncontrolled variance will be reduced. 

As before, techniques for presenting 
stimuli and recording responses are dis- 
cussed in terms of representative 
studies in problem areas. 
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Conditioning. There are relatively 
few experiments on classical condition- 
ing problems with children between 6 
and 13. A few have been conducted 
to observe differences in conditioning 
phenomena as a function of age (c.g., 
Mateer, 1918; Razran, 1935). Some have 
involved the classical conditioning 
method as an approach to other prob- 
lems. Riess (1940), for example, con- 
ditioned the galvanic skin reflex in 
children of 7 years and older to analyze 
generalization with respect to dissimilar 
sounding verbal concepts. One reason 
for this paucity of studies may be that 
satisfactory apparatus has not as yet 
been developed. For instance, there is 
no technique comparable for children 
under 12 to the eyelid conditioning 
method used extensively for young 
adults (Spence, 1956). 

In contrast to the few classical con- 
ditioning studies are the numerous in- 
vestigations on instrumental or operant 
problems in normal and deviant chil- 
dren. A project by Long (1958) is rep- 
resentative. He and his coworkers 
(Long, Hammack, May, and Campbell, 
1958) have explored methodology and 
problems of schedule interactions, re- 
inforcement, concurrent operants, and 
satiation and deprivation, The experi- 
mental situation consisted of a console 
in a 9 x 10 foot soundproof cubicle. 
The subject sat in front of the console 
table; directly ahead was a telegraphic 
key or plunger for responding, colored 
lights, a translucent screen on which 
pictures or other stimuli were pro- 
jected, and a tray for reinforcers. He 
was shown how to work the apparatus 
and given an Opportunity to obtain re- 
inforcers. When schedules of reinforce- 


ment were investigated, the subject was 


also given an Opportunity to ond 
while reinforcement was being with- 
held. After the preliminaries, ex 
perimenter told the subject he was 
leaving and would return in a short 


time. “During subsequent sessions such 
instructions were unnecessary, | the 
child usually went directly to the cu- 
bicle, closed the door, and started 
working” (Long, 1958, p. 5). Responses 


were obtained in graphic form by a 
cumulative recorder. Trinkets, pennies, 
and colored slides served as reinforcers. 


Another operant technique study is 
one by Azrin and Lindsley (1956) on 
social behavior. Children between the 
ages of 7 and 12 were paired on the 
basis of age and sex. Each team of two 
was taken into the experimental room 
and asked to sit at opposite sides of a 
long table. In front of each child were 
three holes in the table and a stylus 
with “an extension cord.” A wire 
screen in the center of the table pre- 
vented each child from manipulating 
the stylus of the other. The investiga- 
tor oriented the children to the ap- 
paratus and left the room. Part of their 
task was to discover that when the 
styli were placed in opposite holes 
within 0.04 second of each other a 
red light flashed and a jelly bean came 
down a chute, falling into a cup ac- 
cessible to both children. Recording 
and programming apparatus was in an 
adjoining room. The first reinforce- 
ment period was followed by an ex- 
tinction period and a second reinforce- 
ment period. All pairs learned the co- 
Operative response and an equitable 
way to share the candy. Response rate 
increased during the reinforcement pe- 
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riods and gradually decreased during 
extinction. The authors give the fol- 
lowing reasons for studying coopera- 
tive behavior by this technique: “(a) 
a more continuous record of the co- 
Operative process is obtained, (b) ex- 
traneous environmental variables are 
minimized, and (c) relatively long pe- 
riods of experimental observation are 
possible” (p. 100). 

Discriminative Processes. As in the 
previous developmental period, most 
experiments in this age span are con- 
cerned with discrimination problems. 
A study by Carter and Schooler (1949) 
is taken as an example because of the 
authors’ special emphasis on method- 
ology. The study was concerned with 
the influence of value on the percep- 
tion of children from contrasting so- 
cioeconomic backgrounds. The ap- 
paratus, patterned after Bruner and 
Goodman (1947), consisted of a metal 
box with a ground-glass face. At the 
opposite side of the box was a light 
source. Between the light and opening 
was a movable metal plate with a small 
hole. A turn of the knob on the side 
of the box moved the plate forward 
or backward, caused the size of the 
disk of light on the ground glass to 
become smaller or larger. The diameter 
of the illuminated disk could be readily 
and accurately measured. A “poor” 
group and a “rich” group between the 
ages of g and 11 served as subjects. 
Each was seated in front of the ap- 
paratus and shown how it operated. 
Using the method of average error, the 
investigator instructed the subject (1) 
to make the illuminated disk the size 
of a coin of each denomination, (2) to 
match the size of the disk with a penny 
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in front of him (and then with a nickel, 
dime, quarter, and also a half dollar), 
(3) to match the disk with aluminum 
disks the size of all five coins, and (4) 
to match the disk with cardboard disks 
the size of all five coins. Finally, a series 
of aluminum disks were shown, one at 
a time, and the subject was asked to 
say whether each was larger or smaller 
than a half dollar, The same procedure 
was followed for the four other coins. 
On the basis of their findings and the 
literature, the authors conclude, 


The present results suggest that the 
value of a disc or the need associated with 
it is an important determiner of the judged 
size only when the stimulus object is 
equivocal or absent, as when judgments 
are made from memory (p. 207). 


Serial Learning. Techniques to 
study serial learning as such include 
manual, verbal, and temporal mazes, 
punch boards, and the like. An exam- 
ple of a maze study (Abel, 1936) is 
found in the section on motivation 
which follows. 

Norcross (1958) has described a de- 
vice for studying verbal and pictorial 
material in serial fashion, Her research 
has been devoted to seeing whether 
findings on mature subjects apply to 
children. She has explored procedures 
and apparatus for comparable experi- 
mentation with children. The equip- 
ment consisted of a Hunter Card Mas- 
ter,! a device which controls exposure 
and interval between stimuli with high 
reliability. Since it is also flexible, it 
permits the investigator to present 
either or both halves of a stimulus card. 
Her investigations with kindergarten 


*Produced by the Hunter Manufacturing 
Co., 930 So. Linn St., lowa City, Iowa. 
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and sixth-grade children demonstrated 
that the method “is suitable at the pre- 
reading age level and with elementary- 
age school children.” 

Problem Solving. An example of a 
problem-solving study in which per- 
formance was related to manipulating 
verbal instruction is one by Hendrick- 
son and Schroeder (1941). They were 
interested in varying the degree of ex- 
planation of the principle involved in 
solving a problem, Eighth-grade boys 
from a suburban junior high school 
were divided into three groups, desig- 
nated as the control, experimental A, 
and experimental B. The task was to 
hit a standard rifle target with a BB 
air gun. In the first problem the target 
was submerged in water at a depth of 
6 inches. After three consecutive suc- 
cessful hits, the second task was pre- 
sented. This time the depth of the wa- 
ter was changed to 2 inches. The boys 
in the control group were merely told 
where to stand and the number of con- 
secutive hits they would have to make 
to show mastery of the problems. 
Those in experimental groups A and 
B were given an elementary explana- 
tion of refraction involving a diagram 
and a short text. They were allowed 
to study the material until they indi- 
cated they understood. No questions 
were permitted. The -explanation for 
group B contained an additional sen- 
tence stating the relationship between 
depth of water and distance between 
the object under water and the image 
of the object. Those in group B re- 
Juired fewer trials to learn the first 
ask and to transfer to the second task, 
vhereas those in group A took fewer 
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trials to learn both tasks than the con 
trol group. 

Motivation. As noted earlier, prob- 
lems of the relative effectiveness of re- 
inforcers and the consequences of satia 
tion and deprivation on performance 
are usually studied in situations involv. 
ing conditioning, multiple choice, se 
learning, or problem solving, Several 
examples are given below. 

A series of investigations by Screven 
(1958) utilized the work output of a 
restricted operant to evaluate various 
reinforcement conditions. The tech- 
nique, designed for children between 
4 and 7, required the child to sit in a 
chair and on signal to run down a 15- 
foot runway, obtain a marble or piece 
of candy on a table, and return to his 
seat. On occasions in which there was 
no reward object, he was to touch the 
table. By means of timers, measures of 
reaction time (time between onset of 
signal and standing up) and running 
time were obtained. Work output was 
controlled by fastening a rope to a 
harness fitted over the child’s shoulders 
so that the subject pulled against a con- 
stant weight. Ample opportunity was 
given the child to learn the task; most 
were trained in four or five trials. 
Studies were conducted to compare 
performances with and without re- 
wards and with increasing and con- 
stant work loads, 

Another study of motivation in an 
Operant situation by Flanagan, Gol- 
diamond, and Azrin (1958) suggests an 
interesting approach to aversive and 
escape training in stuttering behavior. 
Although the youngest subject was 15 
years of age, it is reported here because 


a 
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the same technique may be applied to 
younger children. The subject was 
seated at a table and asked to read 
aloud. Every time he stuttered, the in- 
vestigator pressed a switch, which acti- 
vated a marker. “When a curve of 
stuttering frequency considered smooth 
was obtained,” one of two thirty- 
minute training periods was begun. In 
aversive training every depression of 
the switch produced a one-second blast 
of a 6000-cycle tone at 105 decibels in 
the subject’s earphones. In escape train- 
ing the same stimulation was constantly 
present, and every activation of the 
switch interrupted the tone for five 
seconds, Training on either condition 
was followed by a thirty-minute pe- 
riod without training. A check was 
made on the reliability of the experi- 
menter’s judgments, Each subject was 
seen on two consecutive days, each 
day for a different schedule, e.g., on 
day 1 subject one had a baseline pe- 
riod, an escape training period, and a 
period of no escape training; on day 
2 he had a baseline period, an aversive 
period, and a period of no aversive 
training. The order was reversed for 
the second subject, etc. The findings 
suggest that escape training increases 
the rate of stuttering and aversive 
training decreases it. 

Few studies are concerned with the 
operations of satiation and deprivation 
on the performance of children, espe- 
cially in relation to social reinforce- 
ment. Hence a study by Gewirtz and 
Baer (1958) is of particular interest. 
Their objective was to study the two 
Operations in relation to approval 
(“Good,” “Hm-hmm,” “Fine”) given 
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subjects were first-grade boys and girls 
from a laboratory elementary school. 
They were divided into three groups. 
Subjects in the deprivation group were 
left alone in the experimental room for 
twenty minutes before starting the task, 
Those in the nondeprivation group 
started the experiment without delay, 
and those in the satiation group re- 
ceived considerable praise and admira- 
tion while drawing pictures and cut- 
ting out designs during the twenty- 
minute waiting period. (An attempt 
was made to give exactly 30 reinforcers 
during that period.) The experimental 
task was to drop marbles into either of 
two holes of a toy while the experi- 
menter sat alongside and observed. 
“The E observed S’s play for a ‘base- 
line’ period of 4 minutes during which 
no reinforcers were dispensed.” Imme- 
diately after this there was a ten-minute 
period in which social reinforcers were 
given in successively higher fixed ra- 
tios following a “correct response” 
(which was defined as that preferred 
least during the last minute of the base- 
line period). They found that the re- 
inforcing effectiveness of approval was 
greatest after deprivation and least 
after satiation. 

A well-known serial-learning study 
on the relative efficacy of classes of 
reinforcers is one by Abel (1936). Her 
subjects were boys between the ages 
of 9 and 10 from lower middle-class 
homes, whose fathers were in the oc- 
cupational classes of skilled laborers, 
small shopkeepers, etc. They were di- 
vided into five groups, each to receive 
different rewards in the learning of a 
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finger maze with ten-choice points. 
After instructions and practice on a 
simpler maze of the same kind, differ- 
ential training was initiated. Group 1 
received no experimental reward at the 
end of each run. Group 2 received a 
penny each time. Group 3 received 
social reinforcement and were encour- 
aged to improve their scores. (“Good,” 
“Very good,” “Let me see if you can 
make even fewer mistakes this time.”) 
Group 4 received no experimental re- 
ward during the first 15 trials and a 
penny after each one thereafter. Group 
5 received the same kind of social rein- 
forcement given group 3 during the 
first 15 trials and a penny after each 
completion of trial 16. Analysis was 
made of the number of errors as a 
function of trials and incentive. Ma- 
terial reward (pennies) alone or in 
combination with earlier training of no 
reward or verbal reward resulted in 
fewer errors than verbal reward or no 
experimental reward conditions. 

Examples of motivational studies in 
which multiple-choice techniques were 
used, such as Terrell and Kennedy’s 
(1957) and problem-solving proce- 
dures, such as Kendler, Kendler, Plis- 
koff, and D’Amato (1958), could be 
cited. For a review of the use of tan- 
gible positive reinforcers with a variet 
of techniques with young children, see 
Bijou and Sturges (1959). 

Frustrating, Sudden, or Strong 
Stimulation. A technique for studying 
the physiological consequences of frus- 
tration was performed by Jost (1941). 
His subjects were normal children 
from an elementary school and chil- 
dren with adjustment problems from 
a special school. The task was to learn 
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a series of numbers by the anticipation 
method. Numbers with digits from ı 
to 5 were flashed on a lighted pancl, 
and the subject was allowed to learn 
two series successfully. Then a series 
apparently like the others was presented 
but was too difficult to recall. Frustra- 
tion was said to occur when the subject 
discovered that he could not lear: 
series that was very similar to the onc 
he had formerly learned easily, The 
Measures taken were galvanic skin 
sponse, respiration, pulse rate, blood 
pressure, hand tremor, gross move- 
ments, and brain potentials. Brain- 
potential measures were taken by a 
crystalline recording method; the 
others were recorded photographically. 
A method for analyzing conflict be- 
havior is described by Barker (1942). 
The experiment was concerned with 
time factors and vicarious trial-and- 
error behavior (VTE) produced by 
forced choices between liquids of simi- 
lar and dissimilar desirability and with 
real (the subject had to drink the liquid 
chosen) and hypothetical consequences 
(the subject indicated the one he 
would drink if required). Seven liquids 
were used: (1) orange juice, (2) pine- 
apple juice, (3) tomato juice, (4) wa- 
ter, (5) lemon juice, (6) vinegar, and 
(7) saturated solution of salt water. 
The liquids were placed, by the method 
of paired comparisons, in a preference 
series for each subject individually. The 
subjects were boys, 9 to 11 years old, 
all of whom could read the names of 
and were familiar with the seven liq- 
uids. Each received some money for 
participating in the study. The ap- 
paratus and procedure were as follows: 
each subject was seated in a chair at 
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a small table. “At a suitable distance 
and height,” the names of the liquids 
were exposed side by side at the same 
height in an exposure apparatus; “at a 
conyenient reaching distance,” a lever 
was mounted 2 inches below the sur- 
face of the table and extended 6 inches 
above the top of the table and was so 
placed and weighted that it came to 
rest in a vertical position on a line mid- 
way between the cards in the exposure 
apparatus. The lever could be moved 
easily to the right or left from the ver- 
tical position in ares of 5 inches. When 
moved as far as possible to either side, 
a circuit operating a buzzer was auto- 
matically closed. The subject indicated 
the liquid of his choice by moving the 
lever to the appropriate side. Move- 
ments of the lever from the vertical 
of less than the 5-inch maximum that 
sounded the buzzer did not “count,” 
and the subject was free to reverse the 
movement at any point within the 
limits of these arcs. The lever was re- 
turned to the central position before 
each exposure. On a continuously mov- 
ing tape, a record was made of the 
time of exposure of the cards and of 
all the movements of the lever, thus 
providing a record of the time from 
the moment of the occurrence of the 
conflict until its resolution and of the 
VTE behavior occurring. Every pair 
of cards was presented twice in each 
sequence of conflicts, their positions 
were reversed on the two exposures to 
avoid the influence of position, The 
data for a particular choice in a se- 
quence consisted of the average of the 
two determinations. 

A study concerned with the physi- 
ological reactions to the sudden stimu- 
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lation resulting from a loss of support 
was conducted by Ray (1932). The 
subjects were boys selected “at ran- 
dom” by their teachers and the prin- 
cipal. They ranged in age from 7 to 
12 years and in IQ from 73 to 144. 
Loss of support was accomplished by 
a 2-to-3-inch vertical drop in the seat 
of a chair. The procedure involved (1) 
seating the child in the experimental 
chair and giving him a Stanford-Binet 
Intelligence Test, (2) adjusting the 
apparatus for obtaining circulatory and 
respiratory measures, (3) making two 
or three preliminary runs to check on 
the functioning of the apparatus, and (4) 
making the test run after a baseline pe- 
riod of five minutes. Reactions were re- 
corded by a Summer pneumograph and 
a modified form of the Erlanger sphyg- 
momanometer connected with Marey 
tambours to record on smoked paper. 
Analysis was made by comparing per- 
formance during the baseline period 
and following the sudden drop. Re- 
garding the feasibility of the method, 
Ray said, 

Experiments on the physiological reac- 
tions of children of school age can be 


carried on, although under greater dif- 
ficulties than are encountered with adults 


(1932, p. 116). 
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“Physical growth” is a commonplace 
synonym for anatomic ontogenesis. 
Anatomic ontogenesis encompasses all 
of the structural modifications that a 
biologic organism undergoes during 
ontogeny, It includes the gamut of 
changes sometimes subgrouped under 
such captions as morphologic differen- 
tiation, dimensional growth, propor- 
tional development, and structural mat- 
uration (Meredith, 19572). 

As a facet of the field of child de- 
velopment, anatomic ontogenesis per- 
tains to study of the human organism 
during the first two decades of its in- 
dividual life span. Exploration of this 
facet involves studying the child from 
zygote to young adult with regard to 
sequential changes in the kind, num- 
ber, size, shape, position, and composi- 
tion of his body components. 

At the laboratory level, investigators 
of anatomic ontogenesis are seen iden- 
tifying the structural components of 
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organisms by kind, measuring their 
size, describing their form, counting 
their number, observing their location, 
and examining their constituents, Why 
are these tasks undertaken and how are 
they performed? Although this chapter 
is written to deal principally with the 
how part of the question, the why part 
should be recognized as of prime im- 
portance, In responsible research a clear 
purpose is the star to which a wagon 
of efficient method is hitched. 
Research methods should subserve 
the general objectives of studying ana- 
tomic ontogenesis in children, Among 
the broad purposes of scholars working 
in this field are the following: (1) de- 
scription of specific aspects of struc- 
tural change in different cells, tissues, 
organs, and body regions, (2) system- 
atization of accumulated series of 
morphologic findings in ways to reveal 
principles of human ontogenesis, (3) 
discovery of variables that modify 
physical growth and exploration of 
each variable with respect to time of 
impact, direction of influence, and 
magnitude of depressing or accelerat- 
ing effect, (4) investigation of relation- 
ships between sectors of the structural 
facet of child development and sectors 
of other facets of child development, 
(5) comparison of the developmental 
anatomy of children and other animal 
organisms, (6) provision of normative 
materials on physical growth to meet 
clinical, health education and other 
service needs, and (7) contribution to 
a cumulative understanding of the at- 
tributes of desirable anatomic onto- 
genesis and of the conditions that will 
roduce them (Meredith, 1953b). 
Methods should also subserve the 
particular aims of a given investigation. 


202 


At its growing edge, research centers 
upon well-formulated and pertinent 
questions, explicitly expressed and test- 
able hypotheses, clearly specified 
and solvable problems. Methods have 
the function of high-grade tools that 
may be devised, modified, and utilized 
to answer thoughtfully considered 
propositions relating to the anatomic 
development of children. Ontogenetic 
knowledge increases in fitting stride as 
techniques are kept subordinate to 
ideas, as methods play a supporting role 
to problem perspicacity. 

The foregoing paragraphs deliber- 
ately emphasize the perspective that 
adequate qualification as an investiga- 
tor of physical growth entails far more 
than a reasonable mastery of the meth- 
ods used in the area. It comprises con- 
tinuing re-examination of what ana- 
tomic ontogenesis encompasses, the 
reasons it should be studied, and the 
ways it can be explored with greater 
effectiveness. The competent investi- 
gator engages in day-by-day integra- 
tion of activities in the library and the 
laboratory, These integrative efforts 
ferment an ongoing interplay of crea- 
tive thinking about frontier problems 
and selective utilization of promising 
methods for their study. 

From here forward, the scope of this 
chapter is delimited to a discussion of 
research methods in the study of ana- 
tomic ontogenesis during the first 
twenty years of the human life cycle. 
It would be unrealistic to expect the 
discussion to achieve the feat of deal- 
ing with the topic comprehensively and 
exhaustively. This follows from no 
more than cursory contemplation of 
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the task of colligating all relevant 
methodology with respect to anthro- 
pometry, radiography, photography 
vital staining and implanting, prepara- 
tion of histologic slides, preservation of 
nonliving organisms, anatomic dissec- 
ton, cast preparation, apparatus con- 
struction and calibration, sampling 
theory, biostatistics, experimental de 
sign, and the preparation of scientific 
reports. Although the treatment nec- 
essarily is selective, it aims at repr 
sentative breadth and meaning. 

Primary consideration is given to the 
acquisition of data for investigating 
structural development in childr 
The following questions indicate the 
range of subtopics chosen for inclu- 
sion. From what sources are data 
drawn? For what purpose is a given 
source unique or superior to alterna- 
tive sources? What types of data are 
gathered; what types, derived? What 
instrumental aids are available for use 
in obtaining source materials and se- 
curing data? How much training is re- 
quired to obtain satisfactory proficiency 
in various methods of data collection? 
What factors influence the validities of 
different series of data? In what ways 
do research on procedures in obtaining 
data and standardization of method 
facilitate the study of problems of 
structural change? 

The substantial treatment of data col- 
lection is augmented by a brief section 
on the subdivision of data for statistical 
analysis. This section illustrates the dif- 
ferent ways in which subgroups are 
formed and shows the interdependence 
of grouping procedure and research 
outcome. 
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SOURCES OF DATA 

The data employed in studying on- 
togenetic changes in body structure are 
drawn from a variety of sources, eg. 
from living subjects and cadavers, from 
photographs and roentgenograms, from 
silhouettes and contour tracings, and 
from histologic slides and plaster casts. 

It is a principle of scientific method 
that research data should be amassed, 
using the most parsimonious medium 
for attaining the needed degree of ex- 
actitude. This is a basic reason for 
studying many problems in terms of 
data obtained directly on the living or 
the nonliving child. If it were an in- 
vestigator’s purpose to describe vari- 
ability of head width for children of 
a given age, measuring the subjects’ 
heads directly would be more eco- 
nomical, and for most purposes no less 
valid, than obtaining roentgenograms 
of the head and measuring them (Potter 
and Meredith, 1948). Similarly, the 
principle of parsimony would require 
direct inspection of the oral cavity (in 
preference to dental cast preparation 
and inspection) when an investigator’s 
problem called solely for data from 
counting the number of erupted teeth 
and direct observation of the axillary 
region (in preference to photographic 
reproduction and observation) when 
the problem called solely for data from 
rating the quantity of pigmented hair 
in this region, 

Indirect sources (plaster casts, roent- 
genograms, and so forth) supply the 
only data sufficiently valid for the 
study of certain problems, e.g., deter- 
mining the extent to which rate of 
change in width of the lower face dur- 
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ing childhood shows oscillation and in- 
dividual variation (Newman and Mere- 
dith, 1956). With respect to other 
problems, indirect source materials 
constitute the sole avenue of investiga- 
tion available. An example is the study 
of childhood differences in increase of 
muscle tissue in selected body regions 
(Stuart, Hill and Shaw, 1940; Maresh, 


1948). d 
In research programs in which, for 
purposeful reasons, roentgenograms, 


photographs and/or casts are being ob- 
tained, the investigator may have a 
choice of sources that will yield iden- 
tical data. For instance, when dental 
casts are being made as a source of 
tooth measurements that cannot be 
taken in the mouth conveniently or 
with satisfactory reliability, one has the 
option of accumulating data on tooth 
eruption either by direct examination 
of each child’s dental arches or by in- 
spection of the casts. 

The Living Child. Human anatomic 
ontogenesis takes place in individuals 
who live and move and acquire stand- 
ards of modesty. Respiratory and di- 
gestive functions, postural changes, and 
reluctance to submit some parts of the 
body for examination all give rise to 
methodologic problems in using the 
living child as a source of research data. 
Problems pertaining to the child’s will- 
ingness to cooperate usually can be re- 
solved by taking sufficient time to de- 
velop rapport, remaining sensitive dur- 
ing examinations to cues from the child, 
and exercising professional tact in the 
collection of data. Some of the prob- 
lems that arise from involuntary physi- 
ologic functioning can be only paitly 
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surmounted. In determining the body 
weight of a child, for instance, it is 
considerably more feasible to remove 
or standardize the extraneous material 
carried as clothing (Clark, 1930; Sum- 
ner and Whitacre, 1931) than to re- 
move or standardize the extraneous ma- 
terial carried as urine and feces. 

Townsend (1887) reported (1) 
weighing the contents of the bladder 
and bowels of a stillborn infant and 
(2) weighing four live infants imme- 
diately following birth and again after 
washing. He estimated that the extra- 
neous urine, meconium, and vernix 
caseosa added o.r kilogram! to the 
weight of the neonatal organism. Grif- 
fith (1899) listed as factors that may 
spuriously influence weekly weight 
gains in infancy the time and quantity 
of feeding, the emptying of the bladder 
and bowels, the amount of exercise and 
respiration, and the metabolic changes 
going on during sleep. 

Three preschool children were 
weighed by Palmer (1930) at approxi- 
mate hourly intervals during the day 
for periods of three to five days. He 
found that children typically gain in 
weight during the day, attain a maxi- 
mum weight after the evening meal, 
lose weight during the night, and reach 
a minimum weight in the morning after 
discharge of the accumulated excreta. 
Diurnal fluctuation varied for different 
children and for the same child on dif- 
ferent days. Sumner and Whitacre 
(1931) weighed 58 elementary school 
children nude and then reweighed 
them following the request that they 

* It is customary to use the metric system in 


biologic research: 1.0 kilogram = 2.20 pounds; 
1,0 centimeter = 0.39 inch, 
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urinate. After urination, 50 per cc 
were 0.1 kilogram or more lighter anc 
17 per cent were 0.2 to 0.5 kilogran 
lighter. During a six-month perioc 
Curtiss (1898) obtained nude weights 
for three young adults just before they 
retired at night and as soon as the) 
arose in the morning. There was no 
food intake and no urination or defeca- 
tion between the two weighings. Aver- 
age loss during the night was 0.4 kilo- 
gram, with variation in loss extending 
from 0.2 to 0.7 kilogram, 

McKay and Patton (1935), in a study 
on food habits and physical develop- 
ment of preschool children, obtained 
nude weights at monthly intervals, 3l- 
ways having the child empty his blad- 
der and always weighing at the same 
hour of the day. Clayton (1944), in a 
study on food habits and physical con- 
ditions of elementary-school children, 
requested each subject to urinate be- 
fore being weighed, determined weight 
with the subject wearing only a one- 
piece gown, and subtracted the weight 
of the gown. 

Beyond the methodologic difficulties 
of obtaining the weight of a living 
organism lies the biologic perspective 
of body weight as a gross over-all meas- 
ure that aggregates varying tissue repre- 
sentations in slender, muscular, and 
obese children of a given age (Stuart, 
Hill, and Shaw, 1940) and disregards 
change in tissue proportions with age 
(Reynolds, 1948). In the phraseology 
of Scammon (1929), a weight increase 
of 1.0 kilogram in an infant represents 
a gain in brain substance and viscera, 
together with some enlargement of 
other parts of the body; in an ado- 
lescent, it is primarily an increase in 
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bone and muscle; whereas in the middle- 
aged it is often an accumulation of 
adipose tissue. 

Next to body weight, the structural 
trait most frequently studied on the 
living child is stature. Before stature 
can be determined, the head and lower 
limbs must be placed in fixed positions 
with reference to the trunk. The head 
is positioned so that the lowest point 
on the border of the bone socket of the 
left eye and the highest point on the 
anterior margin of the tragus of each 
ear" lie in a plane at right angles to the 
long axis of the trunk; the lower limbs 
are extended and parallel, with the feet 
at right angles to the long axes of the 
limbs. When the head, trunk, and 
lower limbs are in this relationship, 
stature is the projected (straight-line) 
distance from the top of the head to the 
plantar surfaces of the feet. 

Before the child is able to stand, 
stature is taken with the body supine; 
at later ages, it is obtained with the body 
erect and/or supine. It is common to 
refer to stature in the erect orientation 
as standing height, or body height, and 
stature in the recumbent orientation as 
crown-heels length, vertex-soles length, 
or total length (Krogman, 1950).” 

"The tragus is the cartilaginous projection 
before the entrance of the external acoustic 
meatus, 

*Stature and total length are not equivalent 
terms in early prenatal life. From the time a 
long axis of the organism is apparent (ap- 
proximately two weeks after fertilization) 
until the middle of the second prenatal month 
the largest rectilinear dimension of the body 
is stem length (length of the head, neck, and 
trunk), It is not until the second half of the 
second month that the heels form and the 
lower limbs increase sufficiently for stature 
to surpass stem length. 
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Early studies of individual children 
by Bradford (1883), Shinn (1893) and 
Hall (1896) found stature “horizontal” 
to exceed stature “vertical” by 1.1 centi- 
meter in a girl aged 1 year, 1.5 centi- 
meter in a boy aged 1.5 years, 1.8 
centimeter in a child aged 3.5 years, 
and 2.3 centimeters in an adolescent 
aged 15 years. Studies of the same prob- 
lem on large samples of children have 
been made by Wilson et al. (1930), 
Palmer (1932), and Vickers and Stuart 
(1943). The Vickers-Stuart investiga- 
tion, based on more than 2000 children 
between the ages of 2 and 10 years, 
shows that on the average stature erect 
is 1.5 centimeter less than stature 
supine. The Palmer report, covering 
the postnatal period of 18 months to 
20 years and utilizing over 1000 subjects, 
includes tables of erect and supine 
equivalents for each centimeter of 
stature from 80 to 180 centimeters. 

From measurement of a young adult, 
it was discovered prior to 1780 that 
stature is greatest following recumbent 
sleep and decreases after arising and 
with prolonged exertion (Scammon, 
1927a). About a century later, Malling- 
Hanson (1886) studied a group of 70 
boys and confirmed diurnal stature 
variation for the school-age segment of 
the human life cycle. Investigations on 
preschool children have been made in 
the present century. Taken collectively, 
the accumulated studies support the 
generalization that at all ages beyond 
2 years ambulant human beings typi- 
cally are 2.0 centimeters taller when 
they assume the erect position in the 
morning than when they return to the 
recumbent position at night (Redfield 
and Meredith, 1938). 
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Greater reduction in stature occurs 
during the first hour after arising than 
during any succeeding hour. On infants 
6 months to 1 year of age, Hejinian and 
Hatt (1929) found an average decrease 
in body length of 0.8 centimeter after 
sitting for one hour and 1.1 centimeter 
after standing for one hour. Similar 
losses on children of preschool age 
have been found by Palmer (1930) 
during the first hour after getting up 
in the morning and by Redfield and 
Meredith (1938) during the first hour 
after an afternoon nap. In the former 
study the average losses were 0.7 centi- 
meter over the first hour and 0.4 
centimeter over the second hour, mak- 
ing a combined loss during the first 
two hours of fully one half the daily 
total. The Redfield-Meredith study 
obtained an average gain from a two- 
hour afternoon nap period of 1.2 centi- 
meter and an average loss for the first 
hour after arising from the nap of 0.8 
centimeter. 

It follows that in investigating growth 
problems pertaining to stature diurnal 
and postural variation should be con- 
trolled and the controls specified. To 
disregard whether a child in nursery 
school is measured sometimes before 
his rest period and sometimes immedi- 
ately following a nap (assuming com- 
parable orientation for measurement) is 
to confound his seriate record with 
artifacts as large as the mean increase 
in stature for two months, Macy and 
Kelly (1957), in a study of chemical 
and anatomic ontogenesis, measured 
children supine immediately on their 
waking in the morning. Whitacre 
(1935), in a study of seasonal variation 
in stature, measured each subject on 
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successive occasions in the erect posi- 
tion and at a constant hour of the school 
day. Newcomer and Meredith (1951), 
in a study of body size of adolescent 
boys, measured stature erect in the 
afternoon between 1:00 P.M. and 3:90 
P.M. 

Why is stature influenced by the 
orientation and condition of the am- 
bulant child’s body? The increase with 
change from a standing position to 
dorsal recumbency seemingly is 
brought about by the reduction of body 
curvatures (particularly those of the 
upper and lower back) and muscular 
relaxation. The decrease with sustained 
assumption of erect and sitting positions 
probably results from compression of 
the intervertebral disks, closer apposi- 
tion of the bones of the lower limbs, 
and reduction of muscle tonus. 

Another group of kinesiological and 
physiological problems is faced in us- 
ing the living child as a direct source 
for measures of shoulder breadth and 
girth of the trunk and limbs. Maxi- 
mum girth of the leg in the calf region 
is not the same (1) as a child stands 
with his weight distributed equally in 
both lower limbs and (2) as he sits 
with his legs hanging over the edge of 
a table. Girth of the upper limb mid- 
way between the shoulder and the 
elbow is not the same (1) when a child’s 
arm is raised and his biceps muscle con- 
tracted and (2) when the arm hangs at 
his side in a relaxed state. The human 
shoulder girdle is highly mobile and 
not readily placed in the same posi- 
tion on successive occasions, Girth of 
the thorax alters with inspiration and 
expiration, elevation of the shoulders, 
tipping of the head, and orientation 
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of the trunk to the force of gravity, 
Girth of the abdomen varies with time 
and quantity of food intake, especially 
in infancy, and is difficult to bring 
under control, since children are able 
to produce marked fluctuations by 
voluntary contraction of abdominal 
muscles, 

Far more data for studying anatomic 
ontogenesis have been obtained from 
direct measurement of the living child 
than from any other source. This is 
shown readily by reference to research 
bibliographies, reviews, and syntheses. 
Krogman (1941), in displaying the ex- 
tent to which body size and form are 
related to racial, geographic, socioeco- 
nomic, dietary, and other variables, 
assembled more than 1000 tables based 
on direct study of children. A Children’s 
Bureau (1927) publication lists over 
500 studies on age changes in body 
weight. More than 60 investigations, 
employing data obtained directly on 
children, are colligated in a review of 
North American research for the 
period 1850-1900 (Meredith, 1936). 
There are syntheses of direct measure- 
ment research drawing from 130 studies 
on growth in stature during infancy 
(Meredith, 1943), 70 studies on the as- 
sociation between body size and birth 
order (Meredith, 1950), and 30 to 
50 studies each on the development of 
head width (Meredith, 1953a), head 
girth (Meredith, 1946b), face width 
(Meredith, 1954), upper limb length 


*Research on the physical growth of chil- 
dren began prior to this time. A development 
of scientific activity similar to that in America 
during the last half of the nineteenth century 
occurred in Europe during the last half of 
the eighteenth century (Scammon, 1927b). 
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(Meredith, 1947) and foot length 
(Meredith, 1944). Multitrait investiga- 
tions, each extending from early child- 
hood to early adulthood and utilizing 
direct measurement data, have been 
reported by Herskovits (1930), Gray 
and Ayres (1931), Meredith (1935), 
Boynton (1936), Goldstein (1936), 
O’Brien et al. (1941), and Simmons 
(1944). 

Cadavers and indirect source materials 
portray fixed ontogenetic states; they 
can be kept for future reference. The 
living subject is an ephemeral source; 
it cannot be retained in a given state 
for later study. This sets the living 
child considerably apart from the other 
sources on two methodologic counts, 
one largely surmountable and the other 
restricting. A record for number of 
erupted teeth obtained from direct ex- 
amination of an infant one year after 
birth cannot be confirmed six months 
later nor can a record for width of hips 
taken on a child 10 years of age be 
verified some years later. It is possible, 
however, to cancel effectively the need 
for confirmation by providing that 
original independent records for each 
trait and subject studied shall be made 
by two or more trained workers, The 
second count pertains to the study of 
additional traits at a later date, i.e., the 
investigator mhy in time wish to ex- 
tend his research with data for more 
traits in order to analyze relationships 
not initially projected for study. These 
could not be obtained from the living 
sample (since this would have become 
a source for data representing a later 
stage), whereas they might be accessible 
if nonliving or indirect sources were 
used. The one way to minimize this 
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limitation is by increasing the thorough- 
ness with which research programs 
originally are planned. 

The Child Cadaver. Compared with 
the living child, the child cadaver has 
methodologic advantages from being 
more rigid, more amenable to anatomic 
study, and more permanent. There are 
no rapport requirements, no voluntary 
movements, and no oscillations due to 
ongoing contraction and expansion of 
respiratory, circulatory, or digestive 
systems, Segments and organs may be 
removed for examination, and tissues 
may be dissected for macroscopic study 
or sectioned for microscopic study. Al- 
though in investigating some problems 
it is necessary to correct for structural 
changes produced by the medium or 
media employed in specimen preserva- 
tion, cadavers can be preserved over 
long periods. A major disadvantage of 
the child cadaver lies in the fact that 
it is not changing ontogenetically and 
cannot be utilized as a source for 
longitudinal records. It may be drawn 
upon for a wide range of observations 
and measures representing structural 
status at the time of death, but it can 
give no data on topics such as intra- 
individual increment relationships for 
sets of traits or interindividual trend dif- 
ferences for a given trait. 

Child cadavers are difficult to obtain 
for study. Their scarcity explains why 
students preparing to do research on 
structural development in children get 
much of their anatomical laboratory 
training with specimens of human adults 
and young mammals of other species. 
It also explains why far more research 
has been done on phases of anatomic 
ontogenesis open to investigation on the 
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living child (e.g., postnatal change in 
stature and body weight) than on phases 
open only to investigation on the 
cadaver, e.g., postnatal change of in- 
ternal organs with respect to weight, 
volume, surface area, and tissue com- 
ponents. 

The difficulty of securing prenatal 
human specimens tends to be related 
inversely to stage of development. For 
ages below one week, when the or- 
ganism is as small as a grain of sand 
and placentation has not begun, less 
than a dozen normal specimens have 
been recovered for study (Hertig et al., 
1954).! Many of the research publica- 


tions on human ontogenesis up to the 
age of one month deal with the procure- 
ment and description of single embryos 
(Bremer, 1906; Dandy, 1910; Ingalls, 


1918; Greenhill, 1927; West, 1930; Lit- 
zenburg, 1933; Odgers, 1937; Brewer, 
1938; George, 1942; Wilson, 1945). For 
the period between the third month and 
birth, there are investigations based on 
samples of several hundred fetuses 
(Streeter, 1920; Schultz, 1926; Scam- 
mon and Calkins, 1929). 

Of the child cadavers that become 
available for study, many cannot be 
accepted as anatomically ‘normal. The 
diseases that cause death, and the in- 
juries that precede or follow death, in- 
flict varying types and degrees of 
damage; an organ may be hypertro- 
phied, a tissue dehydrated, a segment 
distorted; the modifications may be 


* This does not imply that little is known 
regarding early stages of ontogenesis. Em- 
bryologists have amassed a large amount of 
applicable research on animals of the verte- 
brate phylum and mammalian order (Witschi, 
1956). 
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slight or severe, localized or wide- 
spread. Investigators of normal anatomic 
ontogenesis utilize cadaver samples ap- 
praised as nonpathologic for specific 
purposes, i.e., appraisal of cadavers for 
normality is made with reference to the 
anatomic item projected for study. 
Fewer rejections would be made in 
selecting a sample to obtain weight 
of the osseous skeleton than in selecting 
a sample to obtain weight of the heart 
or liver. Boyd (1933), in a well-con- 
trolled study of normal variability in 
weight of the spleen, selected cadavers 
that met the following criteria: (1) 
death caused by a violent accident, (2) 
ill less than twenty-four hours following 
the accident, (3) no symptoms of 
relevant disease either at the time of 
the accident or at necropsy, (4) no 
gross hemorrhage prior to death, and 
(5) death not caused by poisons likely 
to affect spleen weight, i.e., compounds 
of mercury, arsenic or bismuth, and 
the strong acids and caustics. 

The time during which cadavers can 
be studied in the fresh state is short. 
For more extended study, they must be 
fixed with a preservative. It is known 
that preservation media can produce 
changes in the size and form of the 
body; these changes may take the direc- 
tion of shrinkage or expansion, become 
more marked with time, and vary for 
different tissues. Consequently, a 
methodologic requirement in the use 
of child cadavers as a source for quanti- 
tative data is the control or correction 
for preservation effects. Rasmussen 
(1947), in studying the weight of the 
pituitary gland and its components, 
achieved good control by using the 
glands of children deceased less than 


209 


one hour, hence only slightly fixed. 
Since in many studies the collection of 
data at necropsy is not practicable, 
considerable technical attention has 
been given to preservation media. To 
the extent that the ideal of preserving 
cadavers in true biologic size, form, and 
composition cannot be attained, it is 
necessary to know what corrections 
should be made. 

Jackson (1909), Schultz (1919), and 
Scammon and Calkins (1929) investi- 
gated the influence of formalin fixation 
on human fetuses during periods of 
three to eight months. Using a solution 
one part 4o per cent formalin and 
nine parts distilled water, they found 
enlargements approximating 1 per cent 
for stature and head dimensions, 13 
per cent for total volume of the body, 
and 20 per cent for thickness of the 
abdomen. Other workers have reported 
shrinkages from immersing soft tissues 
in alcohol and from embedding young 
embryos in paraffin matrices for sec- 
tioning. A highly developed program of 
laboratory procedures for preparing 
and sectioning young embryos has been 
described by Heard (1957). 

For the study of human ontogenesis 
during prenatal life, a large number of 
the specimens judged to be satisfactory 
with respect to normality and preserva- 
tion are of limited usefulness because 
information on age is lacking. A cadaver 
cannot be used in investigating trait 
distributions at successive ages in the 
absence of direct data pertaining to age. 
Consider a series of 17 embryos, two 
of known age and 15 of unknown age. 
The two of known age (say 4 prenatal 
weeks) have stem lengths of 0.3 and 
0.6 centimeter, respectively. Each of the 
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15 others, on the basis of a series of 
observations for presence, number, and 
size of body structures, is estimated to 
approximate 4 weeks of age. For the 
study of individual differences at age 
4 weeks, in stem length or in any other 
trait, the group of 17 embryos can 
contribute two subjects only. 

The earlier the stage of development 
under investigation the more important 
does precise information on age become. 
Changes are so rapid in the early weeks 
of life that to age a 2-week-old embryo 
incorrectly by three days is more 
serious than incorrectly aging a 2-year- 
old child by three weeks. Notwithstand- 
ing, information leading to close 
determination of age for embryos is 
difficult to secure. It is not possible to 
obtain data for the exact time of fertili- 
zation; rarely is information available 
from which the day of fruitful copula- 
tion can be determined with reasonable 
dependability; and not uncommonly the 
date of the beginning of the menses 
preceding fertilization is unobtainable. 
In consequence, primary identification 
and subgrouping of prenatal specimens 
is not usually in terms of true age: in 
studies on embryos it is in terms of stage 
of preparation for implantation (Hertig 
and Rock, 1944), number of somites 
(Bartelmez and Evans, 1926; Heuser, 
1930), or the developmental categories 
defined by Streeter (1942); in studies on 
fetuses it is in terms of body length 
(Scammon and Calkins, 1929) or 
menstrual age (Streeter, 1920)." 


1 Menstrual age is equivalent to the true 
age of the organism plus the time between 
commencement of the mother’s last menses be- 
fore pregnancy and formation of the zygote 
(roughly, true age plus a fortnight). 
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The following list illustrates aspects 
of anatomic ontogenesis that have been 
investigated using child cadavers as the 
direct source for data, During the pre- 
natal period developmental changes in 
number of ossification centers (Mall, 
1906), composition of the tailbud 
(Kunitomo, 1918), position of the 
larynx and lungs (Noback, 1923), linear 
dimensions of the head and trunk 
(Scammon and Calkins, 1929), areal 


measures of the upper and lower li abs 
(Boyd, 1935), ponderal measures of 
the hands and feet (Scammon, 1930), 
and body proportions (Schultz, 1926). 
For infancy and early childhood, de- 
velopmental changes in weight of the 
brain and its subdivisions (Scammon, 
1936), calcification of the permanent 


teeth (Logan and Kronfeld, 1933), and 
weight of the kidneys, liver, and spleen 
(Coppoletta and Wolbach, 1933). Over 
the first two decades of life, develop- 
mental changes in volume and surface 
area of the cerebrum (Scammon and 
Hesdorffer, 1935), surface area and 
weight of the stomach (Scammon, 
1919), weight of the thymus gland and 
its component parts (Boyd, 1936), and 
total area of the external body surface 
(Boyd, 1935). 

Roentgenograms (Radiographs). 
With the discovery of the X-ray by 
Wilhelm Roentgen in 1895, many facets 
of anatomic ontogenesis previously ac- 
cessible to study only by the cross-sec- 
tional approach (using different 
cadavers for each stage) were opened 
to study by the longitudinal approach. 
Roentgenograms could be made de- 
picting the same child at successive 
stages with respect to ossification of 
the skeleton (Pyle and Sontag, 1943)» 
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calcification of the dentition (Gleiser 
and Hunt, 1955), thickness of the 
skeletal musculature (Reynolds, 1944), 
and shape of the viscera (Lincoln and 
Spillman, 1928). 

The roentgenogram has become the 
most broadly used of the several in- 
direct sources of research data on 
human structural development. Seriate, 
or time series, radiographs have been 
accumulated for all regions of the 
body (Stuart, 1939). Beginning with 
Pryor’s publication (1905), reported in- 
vestigations on child growth have 


utilized radiographs of the head 
(Young, 1957), face (Woods, 1950), 
thorax (Maresh, 1948), abdomen 


(Bakwin and Bakwin, 1935), pelvis 
(Reynolds, 1947), arm and forearm 
(Maresh, 1955), wrist and hand (Bald- 
win, Busby, and Garside, 1928), knee 
(Pyle and Hoerr, 1955), leg (Stuart 
and Dwinell, 1942), and foot (Wallis, 
1931). 

One easily underestimates the tech- 
nical aspects of obtaining roentgeno- 
grams suitable for research purposes and 
adequately interpreting them as datum 
sources. A radiographic film, once ex- 
posed and developed, is a permanent 
record, a fixed exhibit pertaining to 
some part of the child’s body. This 
tangibility should not be confused with 
anatomic validity. In order to study de- 
velopmental changes in the form of a 
structure, the consecutive roentgeno- 
grams must be secured with the struc- 
ture in a similar position on each occa- 
sion. For studying developmental 
changes in the size of a structure, there 
must be standardized positioning and 
also adjustment of the radiographically 
obtained measures to “life-size” meas- 
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ures. For investigating developmental 
changes in tissue density, it is necessary 
to hold the degree of X-ray penetration 
in good control and detect deviations 
from control with a densimeter (Garn 
and Shamir, 1958). In short, the extent 
to which data derived from radiographs 
are valid depends upon the purpose(s) 
for which they are used and numerous 
methodologic considerations. 

The object to be radiographed is 
placed between the X-ray tube and the 
cassette (film holder) in a location 
near the cassette and some distance from 
the tube. Exposure is made for x time 
(seconds) at y milliamperes and z kilo- 
volts, the values for these three factors 
varying with the mass, composition, and 
location of the object (Stunz, 1934; 
Stuart, 1939; Cartwright and Harvold, 
1954). Exposure time increases with 
the distance of the object from the 
tube; the higher the milliampere-seconds 
the greater the density (blackness) of 
the image; kilovolts control contrast, 
higher kilovoltage being required for 
the penetration of hard tissues than 
soft tissues (Franklin, 1957). X-rays 
emanate from a small area termed the 
focal spot and travel in straight lines; 
the zone they radiate increases with 
distance from the focal spot and dia- 
phragm (diaphragms are used to con- 
strict the beam of radiation), Marked 
improvements have been made in 
radiological equipment and procedure. 
The trend in recent years is toward the 
use of fast film, small focal spot, in- 
tensifying screens, and the highest kilo- 
voltage compatible with adequate 
contrast (Krogman and Sassouni, 1957). 
Intensifying screens (calcium-tung- 
state-impregnated bristol board placed 
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in contact with each side of the film) 
make it possible to reduce exposure time 
with little increase in the granular 
appearance of the image. 

It is a methodologic objective to 
place the structure under study in a 
defined, reproducible, and anatomically 
adequate position with respect to the 
plane of the cassette and the path of 
the central ray emanating from the tube. 
Positioning for an anteroposterior view 
of the child’s leg has been studied by 
Stuart, Hill and Shaw (1940) and by 
Reynolds (1944). In the former in- 
vestigation the child was placed in a 
supine position on a flat surface with 
the lower limbs extended, both ischium 
of the pelvis resting evenly on the 
surface, the dorsal aspect of the right 
leg in relaxed contact with the cassette, 
and the right foot perpendicular. An 
assistant supported the foot and ap- 
plied sufficient pressure over the lower 
right thigh to prevent flexion at the 
knee joint. The focal spot was centered 
over the greatest thickness of the calf 
at a distance go centimeters from the 
film. Roentgenograms from two in- 
dependent positionings for each of 26 
children showed the tissue components 
of the leg to be highly reproducible by 
this procedure, 

Reynolds obtained roentgenograms 
on 107 children, using the same supine 
position and also an erect position. The 
latter required the child to stand with 
his weight evenly distributed through 
the lower limbs, his feet pointing 
directly forward, and his calf touching 
the cassette without pressure. Breadth 
of bone was identical in the two series, 
but breadth of muscle was greater for 
the supine series than the erect. 


Reynolds suggested that although the 
supine procedure can be defined and 
duplicated satisfactorily it is inadequate 
anatomically, since the amount of 
spreading of the calf on the cassette 
tends to vary directly with age and in- 
versely with firmness of the leg mus- 
culature and/or adipose tissue. An- 
atomic adequacy could be enhanced 
further by setting the mid-point of the 
calf at a constant distance from the 
focal spot (say 100 centimeters) and 
at a fixed distance from the cassette 
sufficient to allow for age increases in 
calf thickness. 

Studies relating to placement of the 
child’s manus (wrist and hand) for a 
posteroanterior roentgenogram have 
been made by Bayer and Gray (1933), 
Baldwin, Busby, and Garside (1928), 
and Flory (1936). On the basis of a 
broad historical and experimental in- 
vestigation on positioning the manus 
for research purposes, Bayer and Gray 
recommended the following standard 
orientation: (1) child seated on a low 
stool so that his armpits approximate 
the height of a table carrying a hori- 
zontally placed cassette; (2) arm in 
go degrees abduction, forearm in go 
degrees flexion, and hand in complete 
pronation; (3) axis of the middle meta- 
carpal parallel with the axis of the 
forearm; and (4) fingers moderately 
fanned, each in line with its metacarpal. 
The hands of a child and a young adult 
positioned on cassettes in this manner 
were radiographed by Baldwin, Busby, 
and Garside at film-tube distances of 50 
and 100 centimeters. Bone areas on the 
radiographs taken at the shorter distance 
were found to be larger than corre- 
sponding areas at the greater distance 
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by almost 1 per cent, Using similar 
film-tube distances, Flory radiographed 
metal blocks 2.5 centimeters square and 
graduated in thickness 1.2 to 5 centi- 
meters, The blocks were placed on the 
cassette to simulate thickness variations 
for different parts of the manus and 
suspended 1.2 centimeter above the 
cassette to simulate increase in thickness 
of the manus with age. At the 100- 
centimeter film-tube distance, radio- 
graphic area was 7 per cent less for the 
block 1.2 centimeter thick than for the 
block 5 centimeters thick. Areal en- 
largement from raising the blocks 1.2 
centimeter was around 2 per cent. 

In general, a standard prone position 
for the manus can be specified and 
duplicated. Placement of the manus 
in contact with the cassette, although 
an anatomically adequate procedure for 
ossification studies based on inspectional 
data (Stuart, 1934; Greulich and Pyle, 
1950), is not a fully adequate procedure 
for research on bone size. For the latter 
purpose, it would be preferable to orient 
the median plane of the manus 5 
centimeters from the film and approxi- 
mately 100 centimeters from the focal 
spot. 

Except as research problems neces- 
sitate deviation (Popovich, 1957), it is 
common laboratory practice when 
radiographically studying the head 
(calvaria and face) to work from the 
norma lateralis roentgenogram (Björk, 
1947; Brodie, 1953; Higley, 1954) and/ 
or the norma posteroanterior roentgen- 
ogram (Woods, 1950; Newman and 
Meredith, 1956). In obtaining both, the 
head is placed with the lowest point 
of the external margin of the left 
orbit, and the highest point of the ex- 
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ternal margin of each acoustic meatus, 
in a plane at right angles to the long 
axis of the trunk. For the lateral view, 
the central roentgen ray passes through 
the child’s acoustic meatuses at right 
angles to the cassette; for the antero- 
posterior view, the central ray passes 
through the mid-point of the trans- 
meatal line at right angles to the cassette. 
The head is oriented and supported 
with the aid of a cephalostat. There 
are several varieties of this instrument 
(Krogman and Sassouni, 1957); all re- 
quire painstaking use and full subject 
cooperation. Björk (1947) meticulously 
positioned the child’s head in a cephalo- 
stat and maintained it with a short rod 
in each acoustic meatus and a chin 
rest. Using fixed distances of 155 centi- 
meters from the focal spot to the mid- 
sagittal plane of the head and 9 
centimeters from the midsagittal plane 
to the film, he obtained two independent 
norma lateralis roentgenograms on each 
of 20 boys. His positioning procedure 
was found to give roentgenograms suf- 
ficiently dependable for measurement to 
the nearest 0.01 centimeter. 

Björk (1955) described a method 
involving the combined use of radio- 
graphs and metallic implants for the 
study of human osteogenesis. Employ- 
ing an especially designed instrument, 
he placed a small number of vitallium 
pins 0.2 centimeter long in the right 
jaws of five children between the ages 
of 4 and 14 years. A local anesthetic 
was used and the pins instantaneously 
inserted to a controlled depth. Radio- 
graphs were made at the time of in- 
sertion and successively for a period of 
two years. Bjérk’s report indicates 
that the method has promise as a means 
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of investigating the sites of bone dep- 
osition and resorption during child- 
hood. 

A roentgenogram transposes anatomic 
structures that are three-dimensional 
to two-dimensional surfaces. This tele- 
scoping to a plane gives rise to a class of 
methodologic problems. Some anatomic 
structures and/or structural com- 
ponents are difficult to locate by 
roentgenogram inspection, and, con- 
versely, some features of the roentgen- 
ogram are difficult to identify on the 
basis of training in anatomic dissection. 
For the most part the technical task 
of resolving such problems is not 
complicated, In locating anatomic 
points, one may attach small radiopaque 
markers to cadaveric or living tissue at 
desired points and then reproduce the 
designations radiographically (Potter 
and Meredith, 1948). In determining 
the anatomic basis of radiographic 
shadows, one may use the method em- 
ployed by Etter (1955). Etter dis- 
articulated the component bones of dry 
skulls and radiographed them separately 
and in groups. Intercomparison was 
made of radiographs for (1) single 
bones posed in skull position, (2) skulls 
with one or more bones removed, (3) 
skulls with all components replaced, 
and (4) heads of living subjects. 

Anatomic structures cannot be re- 
produced radiographically in correct 
size. This is inconsequential when 
roentgenograms are used as a source 
for observational data (Greulich and 
Pyle, 1950), angle measurements 
(Higley, 1954), and ratios (Meredith, 
Knott and Hixon, 1958). In the use 
of roentgenograms for the study of 
absolute dimensions. the matter should 
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always receive consideration and ad- 
justments must usually be made (Ritt 
and Sawtell, 1930; Adams, 1940; Woods, 
1950; Newman and Meredith, 1956). 
Thurow (1951) has systematized the 
corrections that may be necessary 
under three captions: enlargement, dis- 
tortion, and blurring. Enlargement is a 
methodologic variable specific to single 
structural planes. For example, skeletal 
nose height is a dimension in the mid- 
sagittal plane (Meredith, 1958). En- 
largement of this dimension (1) 
increases with distance from the mid- 
sagittal plane to the film, (2) decreases 
with distance from the focal spot to 
the film, and (3) increases with size of 
the dimension. 

Distortion pertains to radiographic 
magnification when the termini for a 
measurement lie in different anatomic 
planes. To illustrate: if the length of the 
lower left jaw were required from a 
norma lateralis roentgenogram, the end 
of the dimension lying in the midsagittal 
plane would be magnified differently 
than the end lying more laterally. Ad- 
justment for distortion is correction for 
such differential magnification. En- 
largement and distortion corrections are 
calculated on the assumption that the 
X-rays emanate from a point. In fact, 
X-rays emanate from a small area and 
add a slight blurring variable. Blurring 
(1) increases with the size of the focal 
spot, (2) decreases with distance be- 
tween the focal spot and plane of the 
object, and (3) increases with distance 
from the plane of the object to the 
film. 

In accumulating seriate roentgen- 
ograms as a source for data on anatomic 
development, the responsible investi- 
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gator takes steps to minimize radiation 
exposure (Stanford and Vance, 1955; 
Schubert and Lapp, 1957). He does this 
through efficient research design, selec- 
tion of radiological equipment, and 
protective laboratory procedures. Ra- 
diation exposure is reduced by using 
filters to absorb extraneous radiation, 
diaphragms to restrict the width of the 
roentgen beam, fast films and intensify- 
ing screens to reduce exposure time, and 
leaded aprons to shield critical tissues 
(Blackman and Greening, 1957; Graber, 
1958; Tanner, Whitehouse and Powell, 
1958; Hixon, 1960). 

Photographs and Silhouettes. Photo- 
graphic methods make possible the 
permanent visualization of anatomic 
structures from chosen viewpoints. This 
sometimes is advantageous for both 
cross-sectional and longitudinal in- 
vestigations, To assemble several hun- 
dred children of a given age for 
intercomparison with respect to one or 
more body features is not practicable; 
it is practicable, however, to amass and 
compare silhouettes on such a sample. 
Photographs of individual children 
taken at a series of developmental stages 
have unique value in supplying time- 
free panoramas on the ontogenesis of 
surface anatomy. 

A far greater variety of photographic 
source materials is accessible on child 
cadavers than on the living organism. 
To date, investigations utilizing data 
derived from photographing the live 
child have dealt largely with individual 
and age differences in (1) postural 
variables, such as lumbar lordosis, knee 
hyperextension, and foot pronation, 
(2) circumpuberal variables, such as 
breast contour, scrotum size, and dis- 
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tribution of pubic hair, and (3) body 
form! variables, such as limb and trunk 
stockiness, shoulders—hips relationship, 
and bulgings of adipose and muscle 
tissue. 

Investigators of anatomic ontogenesis 
employ photographs and/or silhouettes 
in a number of separable roles. In some 
studies the role is that of providing il- 
lustrative examples of the phenomenon 
under discussion; in others photographs 
are utilized to define, or help define, the 
categories of rating scales; and in still 
others they are the sole source, or a 
primary source, of data on a growth 
problem. 

Shuttleworth (1951) assembled pho- 
tographs to illustrate (1) sequential 
modifications in over-all appearance of 
the external body during childhood and 
adolescence, and (2) individual dif- 
ferences at selected circumpuberal ages 
in appearance of the breast region, 
pubic region, and body as a whole. 
Others have used photographs to il- 
lustrate such matters as changes in loca- 
tion and composition of the deciduous 
teeth during prenatal life (Diamond, 
1944); variability of physique in infancy 
(Bayley and Davis, 1935); changes in 
the facial profiles of children brought 
about by orthodontic treatment (Hell- 
man, 1935); body build of siblings and 
of parents and offspring (Reynolds, 
1951); association between precocious 
puberty and ossification of the manus 
(Greulich and Pyle, 1950); circum- 
puberal breast development in boys 
(Greulich et al., 1942); and variations 
in the relation between breast develop- 


1 Alternative terms are body shape, physique, 
body build, morphologic type, body propor- 
tion, physical constitution, and somatotype. 
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ment and pubic-hair development in 
girls (Reynolds, 1948). 

Posture-rating scales for use with 
children 2 to 10 years of age were 
constructed by Robinow, Leonard, and 
Anderson (1943). The scales, derived 
from examination of several hundred 
photographs, are for five regional 
posture variables: rear-view photo- 
graphs define scale points for knock- 
knees and foot pronation; lateral-view 
photographs define scale points for 
hyperextended knees, lumbar lordosis, 
and slumped back. Kubitschek (1932), 
drawing from direct observations on 
730 boys, developed a three-category 
photographic rating scale for each 
annual age, 11 to 18 years. These scales 
are front-view photographs showing 
moderately early, typical, and moder- 
ately late development of the scrotum, 
penis, and pubic hair. Reynolds and 
Wines (1948) used front-view photo- 
graphs of girls to define partly the 
categories for rating circumpuberal 
growth in the breast and pubic regions: 
photographs complement verbal de- 
scriptions for four stages of breast 
shape, three stages of breast size, five 
stages of breast-areola-papilla relation- 
ship, and three stages of pubic-hair dis- 
tribution. 

From direct and photographic exam- 
ination of 757 boys, 10 to 19 years 
of age, Greulich et al. (1942) de- 
veloped a scale for subgrouping boys 
on the basis of development of the ex- 
ternal genitalia and the pubic, axillary, 
facial, and perineal hair. Here, verbal 
definitions for five developmental stages 
are supplemented with front-view pho- 
tographs of the full body and of the 
pubic region. Bayley and Bayer (1946) 
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used photographs to illustrate a set of 
definitions for rating androgyny in 
young adults: rear-view photographs 
supplement verbal description of a five- 
step masculinity-femininity scale fo: 
each of eight variates of limb and trunk 
contour. From photographs of 4000 
young adult males (front, side, and 
rear views), Sheldon, Stevens, and 
Tucker (1940) drew three samples of 
100 considered extreme in predomi- 
nance of subcutaneous fat, predomi 
nance of voluntary musculature, and 
paucity of muscle and fat. These were 
used to obtain rating criteria; i.e., three 
deviant categories were verbally de- 
fined by enumerating the distinguishing 
features of each group of photographs. 

Profile silhouettes were the sole 
source materials utilized by Crook 
(1937) for studying variation of erect 
posture in 76 children between 2 and 
5 years of age. Robinow, Leonard, and 
Anderson (1943) used rear-view and 
profile photographs on 170 children for 
studying age trends in five posture 
variables over the period 2 to 12 years 
of age. Profile or side-view photographs 
also were employed by Schwartz, 
Britten, and Thompson (1928) for de- 
termining the effect of systematic ex- 
ercise on five posture variables in a 
group of boys 12 to 18 years of age. 

Photographs and silhouettes, in con- 
junction with roentgenograms, direct 
observations, family-history records, 
and health-care protocols, were used by 
Stuart (1939) in a developmental and 
comparative study of six related chil- 
dren covering the early years of post- 
natal life. Front-view photographs were 
employed by Seckel, Scott, and Ben- 
ditt (1949) as one of several media for 
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describing precocious sexual develop- 
ment in six children between 2 and 9 
years of age. Tracings of anterior and 
posterior photographs at ages between 
4 and 19 years were utilized by Boyd 
(1955) in a longitudinal study of body 
size and form combining data from the 
living child, the roentgenogram, and 
the photograph. In a study of 112 boys 
9 to 19 years of age, many of whom 
were examined at semiannual intervals 
over a seven-year period, Stolz and 
Stolz (1951) made joint use of data 
from front, side and rear photographs, 
medical examinations, and direct meas- 
urement of the body. 

Photographs were the primary source 
materials utilized by Sheldon, Stevens, 
and Tucker (1940) in an investigation 
of physique variations among 4000 
young adult males and by Bullen and 
Hardy (1946) in an investigation of 
physique variations among 175 young 
adult females. For samples of young 
adults of both sexes, photographs were 
used by Bayley and Bayer (1946) in a 
study of androgyny and by Dupertuis, 
Atkinson, and Elftman (1945) in a 
study of patterns of pubic-hair distribu- 
tion. 

In obtaining photographic source 
materials, as elsewhere, procedural 
requirements vary with research pur- 
poses (Meredith, 1955b). The extent 
to which such matters as precise posi- 
tioning of the subject, controlled light- 
ing, constant subject-lens distance, 
standardized processing, and water- 
proof-base printing paper are method- 
ologic requisites depends upon the 
investigator’s problem. For some in- 
spectional and illustrative purposes, 
fairly crude photographic technic is 
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adequate; as a source for measurement 
data, photographs must meet exacting 
specifications (Tanner and Weiner, 
1949; Geoghegan, 1953). 

Clough and Murlin (1928) described 
an installation designed to produce 
photographs of school-age children 
suitable for the extraction of metric 
data. Their apparatus and method held 
constant the  subject-film distance, 
camera height, light arrangement, and 
type of exposure. By mirror photog- 
raphy and a stereoscopic camera, two 
lateral-view images and two anterior- 
view images were secured simulta- 
neously. Centimeter scales, cemented to 
iron posts, were at the child’s right and 
rear, The child was positioned with his 
heels separated by 12 centimeters, his 
sacrum and upper back against the rear 
scale, and his eyes in a horizontal plane 
passing through the external acoustic 
meatuses. 

A procedure whereby photographs 
could be used as a source of measure- 
ments of the posture of preschool chil- 
dren was developed by Sweeny et al. 
(1929). Following laboratory experi- 
mentation with various body markings, 
photographic backgrounds, and scal- 
ing devices, they recommended (1) 
using a flesh pencil to spot the acromion 
processes, inner and lower borders of 
the scapulae, spinous processes of the 
vertebrae from the seventh cervical to 
the sacrum, posterior superior iliac 
spines, xiphisternal junction, center of 
the patellae, external malleoli, and head 
of the fibulae, (2) photographing the 
child against a dull walnut-brown back- 
ground marked with a white line along 
its lower edge and another down one 
side, (3) photographing a grid of hori- 
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zontal and vertical lines 5 centimeters 
apart, with a mid-vertical line heavier 
than the others, and (4) printing simul- 
taneously a negative of the grid and 
a negative of the child. Superimposition 
of the negatives was made so that their 
horizontal lines were parallel and the 
mid-line of the grid passed through the 
cleft of the buttocks in the rear view, 
the left patella in the front view, and 
the external malleolus in the side view. 

Halverson (1936) discussed a double- 
exposure method for deriving face 
measurements in infancy. He recom- 
mended photographing the infant dur- 
ing sleep, then adding to the negative a 
centimeter grid exposed at the same 
distance. 

Gavan, Washburn, and Lewis (1952) 
published an experimentally based study 
on the technical aspects of procuring 
a scaled photograph of such excellence 
as to constitute an adequate source for 
metric data on body size and form. 
They treated lighting and positioning, 
reduction and distortion, and selection 
of film and printing paper. Their rec- 
ommendations, made with reference to 
photographing subjects in the erect 
position, include (1) using a long focal- 
length lens and a minimum lens-subject 
distance of 600 centimeters, (2) vary- 
ing the floor-lens distance with the 
height of the subject, (3) marking 
needed skeletal points with gummed 
stickers (the type used to reinforce 
holes in loose-leaf paper), (4) lighting 
the subject with a single high-speed 
strobe light mounted directly above the 
camera and also lighting the backdrop, 
(5) including in the photograph a size 
scale and a scale with ten graduations 
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from white to black,! and (6) using 
moderately fast, fine-grained film and 
nonshrinkable printing paper. Dupertuis 
and Tanner (1950) gave particular 
attention to the positioning of the sub- 
ject for erect front-, side-, and rear- 
view photographs. They conside 
orientation of the head, posture of the 
limbs and trunk, and placement of the 
feet. Regarding foot placement, it was 
recommended that the subject stand on 
a turntable with his heels separated by 
4 centimeters and his feet turned out 
wards at angles of 10 degrees each 
from the midsagittal plane. aera 
(1953) discussed a number of additional 
body positions and demonstrated the 
usefulness of some of them in estimat- 
ing the surface area of the body. 

Hunt and Giles (1956) described a 
commercial photographic installation 
manufactured by the Photo-Metric 
Corporation. This assembly includes 
camera, mirrors, and electronic flash 
units so arranged that one exposure 
records four views of the subject: front, 
left side, back, and overhead. A projec- 
tor and mirror enlarge the obtained 
glass positives to one half life size. A 
photographic method that gives a 
contour map of body surfaces has been 
described by Sassouni (1957). He rec- 
ommends securing a view of the con- 
formation of face depth as follows: (1) 
place the subject’s head in morma 
lateralis; (2) from a distance 250 centi- 
meters in front of the subject, project 
a millimetric grid upon his face; and 
(3) with a camera at right angles to 
the midsagittal plane and 150 centi- 


‘In the adequately exposed and developed 
negative each graduation is distinguishable. 
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meters from the mid-point of the 
transmeatal axis, photograph the con- 
tour-lined face. 

Stuart (1939) discussed photography 
of the nude child with respect to allay- 
ing apprehension, obtaining natural- 
ness of pose, dealing with modesty, 
and counteracting fatigue. Dupertuis 
and Tanner (1950) emphasized the need 
for training in positioning the child. 
Shuttleworth (1951) stressed the im- 
portance of preserving anonymity in 
the use of nude photographs, e.g., by 
masking the eyes or blocking out the 
face. 

Molds and Casts. In the study of 
anatomic ontogenesis molds (impres- 
sions, negatives) and casts (models, 
positives) have been made for the 
palate, viscera, voluntary musculature, 
and entire body of the child cadaver 
(Boyd, 1935; Freiband, 1937; Wilmer 
and Scammon, 1945) and for the 
maxillary and mandibular dental arches, 
face, and entire body of the living child 
(Boyd, Scammon, and Lawrence, 1930; 
Cohen, 1940; Schwarz, 1933). 

For what sorts of datum is the mold 
or cast a uniquely valuable source? It 
is more fatiguing and time-consuming 
for a child to serve directly as a source 
of measurement of the surface area of 
the body than for him to submit to 
the preparation of a body mold (Boyd, 
1935). Data for height of the palate 
(Channing and Wissler, 1908), depth 
of the dental arches (Meredith and 
Cox, 1954), and width of the crowns 
of the molar teeth (Moorrees et al., 
1957) can be obtained more precisely 
and conveniently on a cast than by 
measurement directly in the mouth. 
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Being three-dimensional, the cast is 
superior to photographs as a source of 
data on the surface area of body seg- 
ments and superior to roentgenograms 
as a source of data on the form of the 
dental arches. Although the cast has 
no advantage over direct inspection as 
a source of data on age or order of 
tooth eruption, when casts are obtained 
for other purposes (¢.g., studying age 
changes in tooth position), they con- 
stitute an alternative source for secur- 
ing tooth-eruption data. The cast, in 
common with the roentgenogram and 
photograph, fixes a structural state and 
makes it possible for the investigator 
to accumulate series of such states for 
large-scale intercomparison and ex- 
tended study. 

As sources for determining the area 
of the external body surface, molds 
and casts have been used by Sawyer, 
Stone, and DuBois (1916) and by 
Boyd, Scammon, and Lawrence (1930). 
The former made a mold for a boy 
12 years of age. After covering the 
boy’s entire body with cloth (union 
suit, thin gloves, ‘and cotton bandage), 
the gloves were infiltrated with paraffin, 
and all other parts, except the area of 
the anterior nares, were wrapped with 
gummed paper tape. The resultant firm 
mold was cut to permit removal, and 
additional study was made of those 
surfaces not contacted by the mold, 
i.e., the cutaneous areas ‘between the 
toes, behind the ears, and around the 
scrotum and penis. 

Boyd, Scammon, and Lawrence ob- 
tained nine cast reproductions of chil- 
dren between the ages of 3 and 5 years. 
Plaster of Paris was used for mold 
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and model, the mold being made on a 
large galvanized-iron tray. To prevent 
sticking, each child’s body was greased 
with stearine (stearic acid and kero- 
sene); his eyelids were closed, coated 
with vaseline, and covered with soft 
paper, and his head hair was protected 
with a snugly fitting silk cap. The mold 
was made in three stages: (1) semi- 
solid plaster was placed in the tray, 
and a mold of the dorsal half of the 
body was made by having the child 
assume a supine position—his feet were 
about 10 centimeters apart, his hands 
about 10 centimeters from the body, 
and his fingers moderately spread; (2) 
the following day the child returned 
to this dorsal half-shell and a mold was 
made of the ventral portion of the 
body from soles to neck; (3) the ventral 
mold was removed, a small segment near 
the neck was replaced, and a mold 
was obtained for the remainder of the 
head and the face, excluding, of course, 
the anterior nares region. Several cast- 
ing procedures were tried before a 
satisfactory one was devised. It was 
found necessary to grease the mold 
generously, to reinforce the limb and 
finger regions with iron rods, to allow 
the assembled and well-clamped mold 
to dry for several days, to place the 
mold in the upright position and pour 
liquid casting plaster through a hole at 
the vertex, and to use exceptional care 
in removing the mold. 

Employing materials that eliminated 
the need for greasing either the sub- 
ject or the mold, Hixon (1958) made 
facial casts for three young adults. In 
mold preparation, tubes were placed 
in the nares, an alginate-base hydrocol- 
loid (Jeltrate) was applied on ‘the face 
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to a depth of approximately 1.5 cm. 
and the flexible Jeltrate was then 
covered with a fast-setting plaster for 
reinforcement. After removal of thi 
mold from the face, a hydrocal cast 
was poured.' 

The reported findings on child 
growth derived from casts of the dental 
arches are in part the outcome o! 
planned research and in part the by 
products of dental practice. Many 
of the by-products of dental prac- 
tice have the value inherent in the in 
dividual instance but lack the gen- 
eralization possibilities attainable by 
purposeful sampling and statistical tests. 
Johnson (1937) reported eight child- 
hood instances of dental malocclusions 
associated with thumb-sucking, lip-bit- 
ing, lip-sucking, tongue-thrusting, and 
abnormal swallowing. Instances of 
medial and lateral tooth movements 1n 
the absence of orthodontic treatment 
were reported by Ainsworth (1935) 
and Spiller (1913). Ainsworth’s subject 
showed change between ages 6 and 10 
years from crowded mandibular in- 
cisors to regular spacing; Spiller’s sub- 
ject showed change between ages 7 and 
16 years from widely separated mandib- 
ular incisors to regular spacing. Rogers 
(1922) reported two childhood in- 
stances of increase in dental-arch width 
associated with a period of special ex- 
ercise of the masseter-temporal muscles. 
Seriate measures of dental-arch width 
were reported by Colyer (1920) on 
three normal, healthy children 
amined seven or eight times between 
ages 4 and 12 years. 


ex- 


‘Hydrocolloid and hydrocal are discussed 
on p. 222, together with recommended pro- 
cedures in their use. 
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In cross-sectional investigations casts 
of the gum pads or dental arches have 
been used by Freiband (1937) in de- 
termining size and shape of the palate 
at different ages between the third 
prenatal month and term; Clinch and 
Young (1934), in determining size and 
relationship of the maxillary and man- 
dibular gum pads at birth; Goldberg 
(1929), in comparing size of the tooth 
crowns in identical twins at 3 years to 
early adulthood; Cohen (1935), in com- 
paring volume and surface area of the 
tooth crowns in the deciduous and 
permanent dentitions; Meredith and 
Hixon (1954), in determining fre- 
quency, size, and bilateralism of Cara- 
belli’s tubercle in children 9 years of 
age; Channing and Wissler (1908), in 
determining dimensions of the palate in 
feeble-minded children, adolescents, and 
young adults; Moorrees and Reed 
(1954), in studying crowding and spac- 
ing of the teeth in young adult females; 
and Koski and Hautala (1952), in 
studying incidence of shovel-shaped 
incisors in young adults of both sexes. 

In longitudinal investigations casts of 
the dental arches have been utilized by 
Lewis and Lehman (1932) and Moor- 
rees et al. (1957) for investigating re- 
lationships between deciduous and suc- 
cedaneous teeth; Cohen (1940) and 
Sillman (1951), for investigating lateral 
development of the dental arches; Bar- 
row and White (1952), for studying 
age changes in dental occlusion; Gold- 
stein and Stanton (1936a, 1938), for 


*Hixon and Oldfather (1958) made joint 
use of casts and roentgenograms in develop- 
ing a procedure for predicting the size of 
the crowns of the permanent canine and 
premolar teeth. 
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studying associations between dental 
occlusion and movements of the teeth 
anteroposteriorly and laterally; Hol- 
comb and Meredith (1956) and Mere- 
dith and Hopp (1956), for studying in- 
dividual differences in size and growth 
rate of the dental arches. All of these 
research publications deal with dental 
ontogenesis during the first two decades 
of postnatal life, four of them using 
seriate casts covering periods of more 
than a decade. 

The validity with which molds and 
casts reproduce anatomic structures de- 
pends upon the materials and methods 
used and, with the living organism, 
upon the cooperation of the subject. 
In the study of the gum pads and 
dental arches molds have been made 
from plaster of Paris (Goldstein and 
Stanton, 1936a), modeling compound 
(Burson, 1952), wax (Bakwin and 
Bakwin, 1936), and hydrocolloid (Bar- 
row and White, 1952); casts have been 
prepared from plaster of Paris (Gold- 
berg, 1929) and hydrocal (Freiband, 
1937). 

Wax is used acceptably to obtain 
molds of the gum pads on infants; it 
lends itself to rapid use, and the gum 
pads are not complicated topographi- 
cally. When undercuts and complex 
formations are to be reproduced, as in 
securing molds of the dental arches, the 
elasticity of the newer hydrocolloids is 
needed. Skinner and Hoblit (1956) 
have found that the agar-base and 
alginate-base hydrocolloids, used in 
conformity with manufacturer’s speci- 
fications, each give highly accurate re- 
sults. For obtaining molds of the dental 
arches on children, alginate-base hydro- 
colloid is preferable, since its setting 
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time is shorter (Skinner and Carlisle, 
1956). Alginate-base hydrocolloid is a 
powder which, on the addition of wa- 
ter, becomes a plastic material and, 
within a period of ten minutes, passes 
from the plastic state through an elastic 
state to a fairly rigid state. 

After hydrocolloid has been mixed 
with water, it is carried to the mouth 
in a dental impression tray. Impression 
trays are made in a variety of shapes 
and sizes; the one chosen for making 
a particular mold should be well suited 
to the child’s dental arch. Phillips and 
Ito (1951), on the basis of extensive ex- 
perimental work, conclude that in the 
use of alginate-base hydrocolloid (1) 
the impression tray should be centered 
on the teeth, (2) the mold should be 
removed from the teeth with more of 
a jerking than a teasing movement, and 
(3) the cast should be poured within 
fifteen minutes after withdrawal of the 
mold from the mouth. 

As sources of measurement data, hy- 
drocal (dental stone) casts are superior 
to plaster of Paris casts, since, when 
set, they are much harder (Skinner, 
1954). At the time hydrocal is poured 
the mold should be vibrated to assist in 
removal of trapped air and the pouring 
should proceed at a slow rate and with 
approximately even dispersion (Phil- 
lips, Price, and Reinking, 1953). The 
dimensional change of hydrocal casts 
during setting is less than o.1 per cent, 
and under ordinary conditions of tem- 
perature and humidity there is no 
change once the setting reaction is 
completed (Skinner, 1954). 

Using alginate-base hydrocolloid for 
molding, hydrocal for casting, and the 
procedural stipulations listed above, it 
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is possible to produce casts that are 
highly satisfactory as sources for metric 
data representing the dental arches and 
their units. Meredith and Cox (1954) 
measured the minimum distance be- 
tween the maxillary first molars (1) on 
casts of this type and (2) directly in 
the mouth. Their subjects were five 
children, 9 years of age. Both investi- 
gators took series of measurements on 
each child and each cast, measuring to 
the nearest o.o1 cm. The ten averages 
for direct measurement did not differ 
systematically from those for cast 
measurement. 

On several occasions source matce- 
rials have been devised in which pho- 
tography complemented molding and 
casting. Sawyer, Stone, and DuBois 
(1916) flattened pieces of a cloth-paper 
mold on photographic paper, exposed 
the paper to the sun, and obtained areal 
duplicates less difficult to measure than 
the mold areas. Goldberg (1929) de- 
rived a source for measuring tooth size 
by photographing plaster casts and im- 
posing a millimeter grid on the prints. 
A combination of casting, photogra- 
phy, and tracing was used by Speck 
(1950). Casts of the mandibular dental 
arch were photographed, the negatives 
projected to a common enlargement 
(3X), and tracings made of the en- 
larged tooth images. The number of 
steps from the teeth of the child to 
Speck’s indirect source for metric data 
is seen on specifying the sequence, child 
to mold to cast to photographic en- 
largement to tracing. 

This section has discussed five 
sources of data: the living child, the 
child cadaver, the roentgenogram, the 
photograph, and the cast. For each, in 
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turn, an attempt has been made to in- 
dicate both the usefulness of the source 
and the methodologic problems con- 
fronted in its use. It should be clear 
that no source is satisfactory for all 
purposes; conversely, each source has 
pre-eminence in the study of some 
aspects of anatomic ontogenesis. 


TYPES OF DATA 


Whatever source or combination of 
sources is employed, the investigator 
may obtain ratings and/or measure- 
ments. Ratings are made with a de- 
scriptive scale and measurements with 
a metric scale. For some purposes, in- 
spectional judgments and/or metric 
readings are analyzed directly; for other 
purposes, these data are utilized to de- 
rive composite ratings, ratios, or incre- 
ments. 

An important function of the in- 
vestigator is to decide upon the kind 
of scale to be used in a particular study. 
Sometimes, as in designing an investi- 
gation on anatomic form, there is an 
early planning stage in which the feasi- 
bility of a rating approach is clear, 
while the practicability of a measure- 
ment alternative is obscure. It is wise 
to act on the principle that ordinal data 
should not be gathered without 
thoughtfully considering a way to 
study the problem metrically (Thorn- 
dike, 1951). In the event a choice ex- 
ists, metric data should be amassed in 
preference to inspectional data. This 
follows from regarding maximization 
of objectivity and mathematic adapta- 
bility as desiderata in science (Wilson, 
1952). 

Ratings. A rating is a datum obtained 


223 


with the aid of a descriptive criterion 
or scale. If a scale is used, two or more 
categories are defined, and each subject 
is assigned the symbol of the category 
he best fits; if a criterion is used, a 
single category is defined, and each 
subject is assigned a symbol denoting 
the time at which the criterion occurs. 
The latter is illustrated by defining the 
category “erupted” with the criterion: 
a tooth is considered erupted at the 
time it has ruptured the gingiva suffi- 
ciently to expose an area of tooth 
enamel o.ı centimeter in diameter." 

The number of classes or categories 
employed in studies of human struc- 
tural variation has extended from the 
single class (Pyle and Sontag, 1943), 
through the partitioning of a con- 
tinuum into as many as 20 classes 
(Schonfeld and Beebe, 1942), to a set 
of tricomponent categories numbering 
more than 70 (Sheldon, Stevens, and 
Tucker, 1940). Identified in inverse 
order, the three illustrative references 
deal with varieties of external body 
form, differences in volume of the 
testes, and differences in age at which 
a specified stage of ossification is 
reached. 

Criterion ratings have been used 
widely in estimating the time of oc- 
currence of specific anatomic events. 
They facilitate investigation of onto- 
genetic sequences for the acquisition 
and resorption of tissues, the addition 
and discard of organs, and the appear- 


1In rating by a sole criterion a three- 
category scale is implied. There is a middle 
“age of occurrence” category (for which 
records are made), and there are two un- 
used bordering categories representing ages 
“earlier” and “later” (Crampton, 1908a). 
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ance and loss of structures (Garn, 
Lewis, and Shoemaker, 1956). They 
help provide answers to questions such 
as when is mesoderm derived, the liver 
differentiated, the pronephros obliter- 
ated? When are the limb buds acquired, 
the fingers individuated, the touch pads 
lost? In what order are muscles 
formed, bone centers deposited, de- 
ciduous teeth shed? 

Multicategory ratings have been 
utilized in studying the number, size, 
and form of structures. Examples are 
counts of the number of ossified carpal 
and tarsal centers (Wallis, 1931); 
counts of the number of permanent 
third molar teeth congenitally missing, 
impacted, and erupted (Hellman, 
1936); estimates of the number of pig- 
mented axillary and pubic hairs as few, 
moderate, and many (Pryor, 1936); 
ratings of the vertebral border of the 
scapula as convex, straight, and concave 
(Graves, 1925); ratings of the male 
genitalia as small, average, and large 
(Kubitschek, 1932); and classification 
of mammary hypertrophy in adolescent 
boys as absent, slight, moderate, and 
marked (Jung and Shafton, 1935). 

Rating categories have been defined 
verbally, pictorially, and by samples or 
models. Three classes of dental occlu- 
sion were described verbally and pic- 
torially by Angle (1900), three stages 
of pubic-hair growth were described 
verbally by Crampton (1g08a), four 
stages of female breast development 
were defined verbally and pictorially 
by Stratz (1909), five categories of 
lumbar lordosis were defined pictorially 
by Robinow, Leonard and Anderson 
(1943), and five stages of genital and 
body-hair growth were described ver- 
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bally and pictorially by Greulich, et 
al. (1942). Thirty samples of head hair 
were arranged in a Haarfarbentafel by 
Eugen Fischer (Martin, 1928), 20 glass 
specimens simulating eyes were set in 
an Augenfarbentafel by Rudolf Martin 
and Bruno Schultz (Martin and Saller, 
1956), and 23 hard-rubber models simu- 
lating testes were graded in an orchido- 
meter by Schonfeld and Beebe (1942). 

The Schonfeld-Beebe orchidometer 
was used to estimate testis volume at 
successive ages from birth to earl; 
adulthood. Estimates were made on the 
living child by palpating a testis with 
one hand and, with the other hand, se 
lecting the model judged to have the 
same volume. The validity of this rat- 
ing procedure was investigated with 
respect to reliability and relevance.’ In 
studying relevance, the volume of each 
of 400 clay models made to simulate 
testes of varying size was (1) rated 
with the orchidometer and (2) meas- 
ured by water displacement. There was 
a slight tendency for small volumes to 
be overestimated and large volumes, 
underestimated. Reliability was studied 
from two series of ratings for 65 boys, 
11 and 12 years of age; in a few in- 
stances the estimates were in error by 
as much as 20 per cent. 

The number of categories used in in- 
vestigating a given growth phenome- 
non has varied in different studies. In 
studies of metaphyseal obliteration four 
categories were used by Stevenson 
(1924) and nine categories by Todd 
(1937). In studies of tooth calcifica- 
tion four stages of root development 

1A datum is valid to the extent that chance 


error is minimized (reliability) and systematic 
error eliminated (relevance). 
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were defined by Bengston (1935) and 
eight stages by Gleiser and Hunt 
(1955). Greenwood (1891) employed 
a four-category scale for rating hair 
color; the scale developed by Eugen 
Fischer has 30 classes. 

Category variation has occurred par- 
ticularly in the investigation of tooth 
eruption, There have been numerous 
studies based on the age at which a 
criterion stage is attained (Meredith, 
1946a; Falkner, 1957; Hurme, 1948; 
Leslie, 1951). Suk (1919) utilized three 
categories: gingiva not ruptured, tooth 
erupting, and tooth fully erupted. Four 
categories were used by Cattell (1928): 
gingiva not pierced, crown less than 
one-third exposed, crown between one- 
and two-thirds exposed, and crown 
more than two-thirds exposed. James 
and Pitts (1912) employed a six-step 
scale: gingiva not ruptured, tooth just 
erupting, crown less than half erupted, 
crown more than half erupted, crown 
almost fully erupted, and tooth in oc- 
clusion. On roentgenograms, Bengston 
(1935) made use of five categories: 
morsal surface of crown more than 0.20 
centimeter beneath the alveolar crest, 
0.20 to 0.05 centimeter beneath crest, 
breaking through crest, midway be- 
tween gingival crest and line of occlu- 
sion, and in occlusion. 

Careful attention must be given to 


1 Gleiser and Hunt (1955), in an investiga- 
tion of the mandibular permanent first molar 
teeth, found alveolar eruption to precede 
gingival eruption by an average of ten 
months. In a study of the permanent canine, 
premolar, and second molar teeth Hellman 
(1930) found gingival emergence to ante- 
date full eruption by periods of seven months 
(maxillary second premolar) to fourteen 
months (maxillary canine). 
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the definition of categories. Crampton 
(1908a, 1908b) devised a three-stage 
classification of pubic-hair develop- 
ment. His intermediate category is de- 
fined in the first paper to include chil- 
dren with any amount of pigmented 
hair less than a dense covering; in the 
second paper it is defined to include 
children with heavy down and those 
with nonkinky pigmented hair. An 
earlier ontogenetic period is encom- 
passed by the second definition than 
by the first. Several months elapse, 
sometimes more than a year, between 
the time a child has some pigmented 
hairs with the characteristic kink or 
twist, and the time he shows a dense 
covering of hair. 

Single category ratings for tooth 
emergence were used by Gleiser and 
Hunt (1955) and Leslie (1951). In the 
first study the criterion was the initial 
piercing of the alveolar bone, in the 
second it was the initial piercing of 
the gingival tissue. McKern and Stew- 
art (1957) noted similar systematic dif- 
ferences in criterion definitions of 
diaphyso-epiphyseal union, Meredith 
(1940) called attention to the need for 
explicit definition of portions of three 
scales used in a study by Sheldon, 
Stevens, and Tucker (1940).? 

The procedure in defining categories 
and testing their objectivity can be il- 
lustrated by reference to a study by 
Meredith and Hixon (1954). Major 
steps in this study included (1) ex- 
amining 100 maxillary permanent first- 
molar teeth and making notations on 
the furrows, pits, and protuberances in 
the region of Carabelli’s tubercle, (2) 
using these notations to derive a six- 

2 See also Humphreys (1957). 
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category descriptive scale, (3) testing 
the usability of this scale, (4) altering 
the number of categories and revising 
the definitions, and (5) determining the 
reliability of the revised four-category 
scale. Two investigators, working in- 
dependently, rated a series of 200 teeth; 
there was 73 per cent agreement on 
category placement, and all of the dis- 
crepancies except one were between 
adjacent categories. 

Categories that have been defined 
precisely may not yield dependable 
ratings. Robinow, Leonard, and Ander- 
son (1943) used rear-view photographs 
to define a six-stage scale for knock- 
knees and side-view photographs to 
define a five-stage scale for slumped- 
back. Ratings were found to be con- 
siderably more reliable for knock-knees 
than for slumped-back. Seemingly, the 
child assumed much the same lower- 
limb posture on different occasions, 
whereas his trunk posture varied ap- 
preciably. 

Ratings frequently involve interpola- 
tion (Pyle and Sontag, 1943). Assume 
that a criterion for appearance of os- 
sification centers has been defined in 
an unambiguous and precise manner, 
eg., a center is considered present at 
the time its density makes it clearly 
identifiable on the roentgenogram and 
its diameter approximates 0.1 centime- 
ter. Further, assume that roentgeno- 
grams have been obtained at semian- 
nual ages, and a particular center has 
been found absent at one age and 0.3 
centimeter in diameter at the succeed- 
ing age. An estimate for age of appear- 
ance of the center can be derived only 
by interpolation. It follows that esti- 
mates of this type become more de- 
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pendable as the interval between ex- 
aminations is decreased. Garn, Lewis, 
and Shoemaker (1956), in a study of 
the sequence of calcification for man- 
dibular teeth, noted that interpolation 
is less difficult in estimating the order 
in which teeth begin to calcify than 
in making close estimates of the time 
at which they begin to calcify. 

Many problems investigated initially 
by means of ratings have been sub- 
jected at a later time to metric inves- 
tigation. These transitions from inspec- 
tional data to measurement data con- 
stitute a major expression of method- 
ologic advancement in the study of 
anatomic ontogenesis. Schwartz, Britten, 
and Thompson (1928) replaced ratings 
with measurements in a study of pos- 
ture. By measuring selected angles and 
distances on lateral-view photographs, 
they secured quantitative data for pos- 
terior extension of the shoulders be- 
yond the hips, anterior extension of the 
abdomen beyond the thorax, relation 
between the long axes of the trunk and 
lower limbs, and stoop of the shoulders. 
A procedure for quantitatively express 
ing the amount of flat-footedness in 
children was developed by Irwin 
(1937). He found the procedure to be 
highly reliable and recommended its 
substitution for ratings of flat-footed- 
ness. 

A metric method of studying man- 
dibular tooth eruption in childhood was 
devised by Carlson (1944). Curves in- 
dicating the amount and rate of erup- 
tion for different teeth during a ten- 
year period were derived from seriate 
radiographs by measuring the distance 
from the highest point of a tooth to 
the lowest point of the mandible di- 
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rectly beneath the tooth, Continuing in 
the area of dentition, procedures have 
been developed by Goldstein and 
Stanton (1936b) for measuring the 
amount of dental overbite and by 
Moorrees and Reed (1954) for meas- 
uring crowding and spacing of the 
teeth. 

In the area of ossification Kelly 
(1937), Meredith (1957b), and others 
have developed methods of data collec- 
tion that substitute measurements for 
ratings. Meredith developed a quantita- 
tive method of studying change in the 
profile of the osseous chin during child- 
hood. Kelly (1) reviewed and ap- 
praised several metric approaches to 
osseous growth of the manus devel- 
oped during the period 1920-1935 and 
(2) devised a practical and highly re- 
liable measure designated the CM/WD 
index. The CM/WD index expresses 
the attained size of 14 carpal and epi- 
physeal diameters in relation to a meas- 
ure of wrist size. 

Similarly, in the area of physique 
numerous measurement proposals have 
been made. Many of those made prior 
to 1935 have been colligated and evalu- 
ated by McCloy (1936). Davenport 
(1925) computed an index of over-all 
build (chest girth/stature) at successive 
ages from birth to early adulthood. He 
found that the typical child becomes 
slenderer with age, during infancy and 
childhood, then increases slightly in 
stockiness throughout adolescence. 

Meredith and Culp (1951) and Mere- 
dith and Sherbina (1951) studied three 
slender-to-stocky continua represent- 
ing the upper limb (arm girth/upper 
limb length), lower limb (leg girth/ 
lower limb length), and body stem 
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(chest girth/stem length). They found 
that in childhood there is a low rela- 
tionship between stockiness of the body 
stem and stockiness of each limb (7’s 
ca. 0.4), and no more than a moderate 
association between upper-limb stocki- 
ness and lower-limb stockiness (7’s ca, 
0.7). 

The same three ratios, plus two ra- 
tios portraying shoulder width in rela- 
tion to hip width and chest girth in 
relation to abdomen girth, were studied 
by Trim and Meredith (1952) and 
Meredith and Meredith (1953) on girls 
7, 9, and 11 years of age and boys 7, 
10, and 15 years of age. After finding 
that the shoulder- -hip and chest-abdo- 
men ratios were not highly associated, 
five-variable frames of reference were 
constructed for the different age-sex 
groups? and illustrations were given of 
their usability in securing physique pro- 
files for individual children. It was rec- 
ommended that these frames of refer- 
ence be lengthened to include more 
aspects of body form and that addi- 
tional frames be developed in order to 
provide metrically based normative dis- 
tributions for each consecutive annual 
age during childhood and adolescence. 
Other variables suggested for inclusion 
were ratios depicting head form (Bay- 
ley, 1936), face form (Goldstein, 1936), 
chest form (Meredith and Knott, 1937), 


*Tt will suffice to describe this construction 
with respect to one variable and a single age- 
sex group. A reference frame was developed 
by determining the tenth, thirtieth, seventieth, 
and ninetieth percentiles of the distribution 
for the upper-limb index on 4-year-old boys 
and then employing these percentiles to 
subdivide the distribution into five successive 
parts named Slender, Moderately Slender, 
Average, Moderately Stocky, Stocky. 
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and the relation of lower-limb length 
to stem length (Meredith and Knott, 
1938). Heath (1945), in a study of 
young adult males, derived profiles of 
body form based on a somewhat differ- 
ent selection of ratios. 

Measurements, A measurement is a 
datum obtained by means of an instru- 
ment or apparatus having a scale gradu- 
ated in equal intervals. Nearly all of 
the instruments used in the study of 
anatomic ontogenesis have scales that 
begin at an absolute zero point and are 
graduated in units of the metric system. 

Instruments of many kinds and va- 
rieties are available for measuring body 
structures. These include slide-type 
and spread-type calipers? for measuring 
diameters (Steggerda, 1949°), metric 
tapes for measuring girths (McCloy, 
1936), planimeters for measuring areas 
(Baldwin, Busby, and Garside, 1928), 
platform-type scales for measuring 
weights (Goldstein, 1937), scaled tanks 
for measuring volumes (Tilton, 1930), 
and variously named instruments for 
measuring angles (Schultz, 1929; Salz- 
mann, 1945). 

The development of anthropometric 
precision instruments has occurred 
largely in the period since 1830. Plat- 
form-type weighing scales were pro- 
duced first in 1831 (Goldstein, 1937), 
and spread-type calipers, a few years 
later (Woodbury, 1952). Between 1850 


*Slide calipers have parallel arms, one fixed 
at right angles to a calibrated scale and the 
other movable along the scale. Spread calipers 
have hinged arms, with the scale attached to 
one and crossing the other at a set level. 

*See also the catalog of “GPM Anthro- 
pological Instruments” available from Siber 
Hegner and Company, 183 Madison Avenue, 
New York 16, N. Y. 
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and 1870 a number of metrically scaled 
boards, rods, and calipers were designed 
for anthropological research on skeletal 
material and living adults; Paul Broca 
was a leading contributor to this im- 
provement of the equipment armamen- 
torium (Hrdlicka, 1920). The develop 
ment of instruments especially adapted 
for measuring fetuses and young chil- 
dren began shortly before the end of 
the nineteenth century (Chapin, 1894) 
but received scant attention prior to 
1920. In the period 1920-1940 a variety 
of these instruments was described by 
Schultz (1920), Baldwin (i921), 
Thompson (1929), Wilson et al, 
(1930), Carey (1936), Cates and Good- 
win (1936), Stuart (1939) and Knott 
(1941). 

The design and production of im- 
proved measurement tools continues. 
During the last decade Tanner and 
Weiner (1949) have described spread 
calipers with a scale X 5 enlargement; 
Woodbury (1952) and Tanner and 
Whitehouse (1955) have described 
dial-type spread calipers; Tanner and 
Whitehouse (1957) have described 
counter-type slide calipers; and Karpo- 
vich (1951) has described a modified 
tape for measuring trunk girths. 

Research pertaining to the selection 
of instruments has been reported by 
Whitacre (1934) and Gray (1935)- 
Two laboratory instruments commonly 
used in measuring stature are the fixed 
vertical rod and the fixed vertical board. 
Whitacre measured 400 children by 
each method in succession and 25 chil- 
dren by each method on two different 
days. All positioning was the same ex- 
cept that when the rod was used the 
child “stood free” and when the board 
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was used his heels, buttocks, and shoul- 
ders were in contact with it. Compared 
with the measures by rod, the measures 
by board were systematically larger 
and considerably more reliable. This 
study illustrates the principle that one 
way to increase reliability is by con- 
trolling the posture of each body seg- 
ment included in a dimension. 

Gray’s investigation shows how in- 
strument selection can be related to 
what is measured anatomically. Depth 
(anteroposterior diameter) of the chest 
was determined on 370 adult males by 
using both slide and outside-spread 
calipers.! The readings were larger for 
the slide calipers than for the spread 
calipers by an average of 10 per cent. 
This systematic discrepancy was a 
function of the contour of the chest 
and the manner in which the instru- 
ments made contact with it. The ends 
of the spread calipers contacted points 
in the midsagittal plane; the arms of the 
slide calipers contacted points in more 
lateral planes on the posterior and an- 
terior chest walls. Succinctly, the one 
instrument measured chest depth in the 
mid-line and the other measured maxi- 
mum chest depth. 

Prior to research use, every piece of 
metric equipment should be sent to the 
Bureau of Standards for testing or 
checked against a scale that has been 
sent for testing. It is courting bias to 
assume that commercially available 

* Outside-spread calipers have arms that are 
curved first outwardly and then inwardly to 
facilitate contact with external points. There 
are also inward-spread calipers that are curved 
to facilitate contact with internal points and 
slide calipers with special adaptations for 
contacting both internal and external points 
(Ashley-Montagu and Trevor, 1937). 
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products arrive at the laboratory in 
correct calibration; this is especially 
true of linen tapes, wood meter sticks, 
and inexpensive metal calipers. Nor 
should the testing of instruments made 
or modified in the laboratory be over- 
looked. 

Instrument maintenance is no less im- 
portant than initial testing. This in- 
cludes careful handling, protective 
storage when not in use, and re-testing. 
In particular, there should be frequent 
checking of weighing scales, tapes, and 
instruments with slender arms or 
pointed ends. Meticulous attention to 
measuring equipment is one of the 
principal avenues by which investiga- 
tors can seek to exclude systematic er- 
ror from metric data. 

Diameters of body structures are 
obtained between specified termini, 
angles are determined by reference to 
selected lines, and girths are taken in 
chosen planes. The points that define 
these termini and lines and planes are 
called landmarks. 

The importance of landmarks is at- 
tested by the fact that they are given 
primary emphasis in two international 
agreements. These agreements, adopted 
at Monaco in 1906 and Geneva in 1912, 
list approximately 100 body dimensions 
and recommend landmarks for securing 
them (Hrdlicka, 1920). The Monaco 
document pertains to landmarks in the 
head region, the Geneva document to 
landmarks on the trunk and limbs, 
Those who investigate anatomic onto- 
genesis are obligated to become familiar 
with these agreements and to follow 
their recommendations except as de- 
partures are indicated by methodologic 
advances and/or problem requirements. 
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Landmarks are defined in several 
ways. The landmark “vertex” is defined 
in terms of an orientation: it is the 
highest point of the skull when the 
lowest point on the external border of 
the left orbit and the highest points on 
the external borders of the right and 
left acoustic meatuses are in a plane at 
right angles to the long axis of the body 
(Tildesley, 1931). Many landmarks are 
defined in terms of skeletal projections, 
margins, or appositions; e.g., menton is 
the lowest point on the mandible in the 
mid-line, iliocristale is the most lateral 
point on the crest of an ilium of the 
pelvis, bregma is the point of juncture 
of the coronal and sagittal sutures on 
the external skull, and nasion is the ex- 
ternal point of junction of the nasal 
and nasofrontal skull sutures. These 
definitions serve useful purposes in that 
they (1) specify sites for instrument 
application and (2) lead to terse iden- 
tification of dimensions. Notice the 
abridgment, without loss of specificity, 
in stating that measures were made for 
vertex-menton diameter, bregma-nasion 
diameter, and bi-iliocristal diameter. 

Some measurement sites are defined 
instrumentally. In measuring leg girth 
(calf circumference) a tape is passed 
around the leg at right angles to its 
long axis and a number of applications 
are made at different levels until the 
largest girth is found. In measuring 
maximum head width, the ends of out- 
side-spread calipers are placed against 
the sides of the head and moved about 
horizontally until the greatest trans- 
verse diameter is found. To assign 
names to sites that are instrumentally 
located is needlessly encumbering; e.g., 
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it is more parsimonious to state that 
maximum head width was measured 
than to assign the name “euryon” to 
an end-point of this dimension (Martin, 
1928) and state that bi-euryon diameter 
was measured, 

Nasion is one of the commonly used 
landmarks. It serves as a terminus for 
many vertical, anteroposterior, 
oblique dimensions obtained on the 
roentgenogram and skull. Although it 
cannot be directly located on the liv- 
ing child or photograph, its estimated 
site is employed in measuring nose 
height, face height, and other facial 
diameters. A highly valid procedure for 
estimating the location of nasion was 
discovered by Ashley- Montagu (1937): 
preliminary experimental work sug- 
gested that nasion, considered with re- 
spect to its placement vertically and 
transversely, is situated at the point at 
which a line tangent to the highest 
points of the superior palpebral sulci 
intersects the midsagittal plane. This 
hypothesis was tested on ten adolescent 
boys and 115 adult cadavers of both 
sexes. The method with the cadavers 
was to bore a hole o.2 centimeter in 
diameter through the soft tissues at the 
point postulated, dissect the nasofrontal 
region, and observe whether the hole 
penetrated nasion. With the living sub- 
jects, a piece of wire was placed at the 
level stipulated, and a norma lateralis 
roentgenogram taken. Nasion was 
found to be located correctly on 98 
per cent of the cadavers and all of the 
living subjects. From studying soft tis- 
sue and skeletal relationships in the re- 
gion of the external ear, Ashley-Mon- 
tagu (1938) also provided experimental 


and 
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support for a procedure used in esti- 
mating the location of vertex on the 
living subject.! Further research on the 
determination of these and other land- 
marks is needed; for example, the as- 
sociations found by Ashley-Montagu 
should be investigated, using young 
children as subjects. 

Not infrequently bilateral structures 
are of unequal size. One shoulder may 
be higher than the other, one arm 
thinner than the other, one hand 
broader than the other, one leg shorter 
than the other, one ankle thicker than 
the other, one foot longer than the 
other (Davenport, Steggerda, and 
Drager, 1934). Studies of correspond- 
ing dimensions on the right and left 
limbs have shown both individual and 
group differences (Morton, 1886; 
Bjelke-Petersen, 1902; Van Dusen, 
1939). Consequently, it is necessary to 
identify the side on which a measure- 
ment is taken, The Geneva agreement 
recommends that whenever plans are 
made to collect data on one side only 
the left side should be measured. 

In applying calipers and tapes to 
nonrigid structures, the anthropome- 
trist is confronted with the task of 
controlling the degree to which tissues 
are compressed. Hall (1895), in an 
early description of methods of meas- 
uring girths of the trunk and limbs on 
schoolboys, emphasized that special ef- 
fort must be made to avoid the intro- 
duction of error caused by variation in 
the amount of tension applied to the 
tape. Steggerda (1942), in a compara- 
tive investigation of the measurement 
techniques being used at ten labora- 

‘This procedure is specified on page 205. 
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tories, found wide deviation in the pres- 
sure exerted in measuring width of 
shoulders, width of hips, and other di- 
ameters of the body stem. 

The first step toward control of the 
compression variable is clarification of 
purpose. If the investigator’s purpose 
were to measure maximum width of 
hips at the bi-iliocristal level, the arms 
of the calipers would be brought into 
light contact with the skin; if his pur- 
pose were to approximate closely a 
skeletal measurement of bi-iliocristal 
diameter, firm pressure would be ap- 
plied in order to reduce the thickness 
of the overlying soft tissues as much as 
practicable. Having a clear purpose, 
the compression decided upon should 
be specified as objectively as possible, 
and the measurer(s) well trained to the 
criterion as defined. Illustrative defini- 
tions are (1) the tape is brought into 
sufficiently light contact with the skin 
that there is no indentation of the tis- 
sues, (2) the headpiece is moved down 
until it crushes the hair and makes firm 
contact with the head at the vertex, and 
(3) heavy pressure is applied to the 
arms of the calipers in an attempt to 
approximate maximum compression of 
the skin and subcutaneous adipose tis- 
sue (Meredith, 1951b). Laboratory 
utilization of these definitions necessi- 
tates painstaking effort and care. For 
many dimensions on the living child, 
reduction of variation in application of 
the instrument is a large part of attain- 
ing technical competence. 

There are many sources of measure- 
ment error. In fact, each step in a meas- 
urement procedure involves one or 
more possibilities for the occurrence 
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of uncontrolled variation. The investi- 
gator has the responsibility of minimiz- 
ing these multiple sources of extraneous 
variation to the extent necessary for 
satisfactorily studying particular prob- 
lems. What must be considered? Posi- 
tion of the subject: is this described in 
sufficient detail, is the description clear 
to the measurer(s), and does each child 
appear to meet the position specifica- 
tions at the time of measurement? In- 
strument: is this suitably designed for 
the task, is it adequately engineered and 
calibrated, and have the measurers de- 
veloped proficiency in handling it? 
Landmarks: are these precisely defined, 
are the definitions clear to each meas- 
urer, and is each measurer skilled in 
locating the landmarks and applying 
the instrument to them? Other facets 
of method: does the pressure or ten- 
sion approximate that stipulated, is the 
correct limb being measured, and are 
there obstacles to correct measurement, 
e.g., hair fasteners or braids, a limb in- 
flamation or scab, fatigue or lack of 
cooperation? Reading and recording: 
is the scale of the instrument being read 
at the proper place, does the measurer 
exercise care in reading the scale, and 
does the recorder give close attention 
first to registering correctly the values 
that are called and then to announcing 
what has been recorded for approval 
by the measurer? 

It follows that measurement data for 
studies of anatomic ontogenesis should 
be collected only when the following 
criteria are satisfied: (1) a written 
statement specifying each technique 
employed i is prepared; (2) those who 
participate in amassing the data are 
trained to meet stipulated standards of 
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dependability; and (3) there is fre- 
quent checking to assure that thes: 
standards are maintained. Meredith 
and Goodman (1941) gathered nco- 
natal stature measures under these con- 
ditions. The written statement required 
(1) placing the nude infant in dorsal 
recumbency on a specially constructed 
metric board, (2) standardizing the po- 
sition of the head, (3) bringing the 
knees together and extending the lower 
limbs, and (4) setting the two uprights 
of the apparatus in firm contact with 
the head and soles.! high level of 
training and its sustenance were dem- 
onstrated as follows: two 
working independently over a period 
of eight weeks, measured 100 full-term 
neonates on the day of birth, obtaining 
(1) a reliability coefficient of r = 97 
and (2) agreement within 0.5 centi- 
meter for ọ2 per cent of the infants. 
Comparison was made of the stature 
distribution from these records, and the 
stature distribution from routine hos- 
pital records for the same 100 neonates. 
Although the two means were alike, the 
magnitude of chance error differed so 
markedly that the standard deviations 
were 1.6 and 2.4 centimeter, respec- 
tively. This manifestation of different 
operational levels indicates how widely 
measurement efficiency can vary and 
sounds an alert on any tendency to un- 
derestimate the importance of proce- 


measurers, 


* An assistant maintained the head position 
while the measurer used one hand to main- 
tain the position of the knees and the other 
to bring the sliding upright of the instrument 
snugly against the heels. The indispensability 
of an assistant in securing this measurement 
is one example of the importance of the 
assistant’s role in many measuring situations 
(Knott, 1941). 
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dural design, adequate training, and sus- 
tained attention to method. 

Knott (1941), in a major monograph 
on measurement of the child, (1) re- 
viewed reliability studies made prior to 
1940 on fetuses, infants, and older chil- 
dren and (2) investigated reliability for 
35 external body measurements on chil- 
dren 3 to 6 years of age. Other reports 
on the reliability of measurements taken 
on the living child were made by Good- 
man (1941), O’Brien, Girshick, and 
Hunt (1941), Newcomer and Meredith 
(1951), Stolz and Stolz (1951), 
Edwards et al. (1955). Collectively, 
these studies are based on samples of 
children at different ages, from birth 
to early adulthood, and encompass nu- 
merous lengths, widths, girths, and 
measures of thickness. They show that 
many traits of body size can be meas- 
ured by competent anthropometrists 
with high dependability, e.g., paired 
series of independent measures on chil- 
dren homogeneous in age, sex, race, 
region, and secular period yield reli- 
ability coefficients above .95 for about 
50 per cent of the dimensions studied 
and above .go for over 80 per cent 
of the dimensions studied. 

Reliability investigations were re- 
ported by Björk (1947) for measure- 
ments on roentgenograms, by Tanner 
and Weiner (1949) for measurements 
on photographs, and by Boyd (1935) 
and Meredith and Hopp (1956) for 
measurements on casts. Potter and 
Meredith (1948) compared reliabilities 
for measurements taken on the roent- 
genogram and the living subject; 
Geoghegan (1953) compared reliabili- 
ties for measurements taken on the pho- 
tograph and the living subject. Among 
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other findings, these studies report 
standard errors of measurements below 
o.10 centimeter for several facial diame- 
ters obtained from roentgenograms and 
below 0.02 centimeter for some dental 
diameters obtained from casts. 

The comparability of measurements 
obtained from the photograph and the 
living subject was studied by Tanner 
and Weiner (1949) and Gavan, Wash- 
burn, and Lewis (1952). Ritt and Saw- 
tell (1930), Clarke, Geser, and Hunsdon 
(1956), Garn (1956), and Newman and 
Meredith (1956) studied the compara- 
bility of measurements obtained from 
the roentgenogram and the living sub- 
ject. 

When different laboratories and in- 
dividual investigators employ the same 
measurement procedure in collecting 
data for a given trait, the procedure is 
referred to as “standardized.” The im- 
portance of standardization of method 
was recognized in drafting the inter- 
national agreements of 1906 and 1912, 
and it has been stressed less formally 
in papers of more recent date (Hrd- 
licka, 1936; Steggerda, 1942). 

Standardization is seen in correct 
perspective as an ongoing process with 
advantages to be sought and excesses 
to be avoided. It functions effectively 
in (1) facilitating comparative and 
composite use of findings on anatomic 
ontogenesis (Meredith, 1943), yet (2) 
must not stifle the development of new 
techniques and the adoption of proce- 
dural improvements. 

Standardization is not always a 
straightforward matter; there are on- 
togenetic, phylogenetic, methodologic, 
anatomic, and historic considerations 
to be weighed. Sometimes, as the fol- 
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lowing questions indicate, these con- 
siderations are difficult to harmonize 
and implement. Since stature must be 
obtained in the recumbent position on 
infants, should this position be con- 
tinued throughout ontogeny? Should 
the measurement of stature be replaced 
by stem length, since the latter is the 
longest diameter that can be measured 
in early prenatal life and on most mam- 
mals other than man? Is it preferable 
to adopt the hip width that can be de- 
termined most reliably on children (bi- 
iliocristal diameter) or a hip width that 
is obtainable on most young mammals 
(interspinous diameter or bitrochan- 
teric diameter)? Should lower face 
width be taken on the living child, con- 
tinuing historic practice, or should the 
greater precision of the radiographic 
approach outweigh precedent? 

Having commented on the intricacies 
and tentativeness of standardization, it 
remains to give due emphasis to its re- 
search significance. For any selected 
trait, the aggregate knowledge acces- 
sible from a group of studies is directly 
related to the comparability of the 
measurement procedures utilized. This 
is illustrated in syntheses of research 
for head width (Meredith, 19532) and 
upper-limb length (Meredith, 1947). 
The standardization of head width has 
led to a substantial body of knowledge, 
whereas the unnecessary variation in 
method of measuring upper-limb 
length has produced a relatively small 
tienufic yield. In 12 investigations of 
upper-limb length use was made of 
three different limb orientations, three 
proximal landmarks, and two distal 
landmarks. 

Derived Measures. A derived meas- 
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ure is a datum obtained from joint 
utilization of two or more measure- 
ments. Examples of data of this type 
are a weight increment for the first 
postnatal year, derived by subtracting 
a weight measurement secured at birth 
from a weight measurement secured on 
the same child one year later (Simmons 
and Todd, 1938); a facial index at age 
7 years, derived from dividing a meas- 
urement for face length by a measure- 
ment for face width (Goldstein, 1936); 
and a measure of length of the head 
and neck at age 17 years, derived by 
subtracting a measurement for trunk 
length from a measurement for stem 
length (Meredith, 1939a). 

Before use is made of derived meas- 
ures, it should be shown that they meet 
high reliability standards. The special 
need for this becomes apparent on rec- 
ognizing an important difference be- 
tween measurements and derived meas- 
ures. Associated with any measurement 
there is probability of chance error; 
associated with any derived measure 
there is an error probability arising 
from no less than two measurements. 
The way in which these probabilities 
express themselves in reliability coeffi- 
cients is indicated by the following ex- 
ample. On a sample of 100 boys 15 
years of age, two sets of data were ob- 
tained for chest girth, abdomen girth, 
and the ratio of chest girth to abdomen 
girth. The coefficients of reliability 
were 7 = .ọ8 for each of the measure- 
ments (Newcomer and Meredith, 1951) 
and r=.85 for the derived measure 
(Trim and Meredith, 1952). 

A procedure for increasing the de- 
pendability of ratio data was described 
by Trim and Meredith (1952). Each 
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subject was measured by four anthro- 
pometrists designated T, N, S and P. 
The “single record” method of data 
collection was compared with a “team 
record” method. The dependability of 
the former was symbolized by corre- 
lating series of ratios derived from the 
measurements of T and N; that of the 
latter by correlating ratios derived from 
averaging the measurements of T and 
S and those of N and P. For the ratio 
of chest girth to abdomen girth, the 
obtained 7’s were .85 for the “single 
record” method and .96 for the “team 
record” method. Corresponding 7’s for 
the ratio of shoulder width to hip 
width were .91 and .98, respectively. 
In each instance the change represents 
a 20 per cent improvement in relia- 
bility. For the study of many problems 
pertaining to the body form of indi- 
viduals, a procedure whereby the con- 
sistency of ratio data can be increased 
by 20 per cent merits adoption. 

There is no more demanding task in 
the collection of data for studying ana- 
tomic ontogenesis than that of amassing 
adequate increment data. The validity 
and research usefulness of increment 
data are related to the growth rate for 
the trait under investigation, the length 
of the ontogenetic interval covered, 
and the magnitude of the error com- 
ponent for the data. 

The validity of an increment datum 
varies directly with the reduction of 
chance error in the measurements used 
to derive it. For a given period of 
childhood, it is more difficult to attain 
adequate reduction of error in an in- 
crement of chest girth than head girth, 
stem length than stature, arm girth than 
leg girth, shoulder width than hip 
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width, hip width than dental arch width 
(Knott, 1941; Meredith and Cox, 1954). 
Even so, the error component of incre- 
ment data cannot be controlled satis- 
factorily for any trait except through 
rigorous procedures and constant vigi- 
lance (Meredith, 1955b). In all incre- 
ment studies made in the writer’s labo- 
ratory over the past twenty years the 
basic measurements have been obtained 
by two anthropometrists and required 
to meet explicit standards, i.e. when- 
ever the two records fail to satisfy an 
adopted criterion for “close agree- 
ment,” each anthropometrist takes an- 
other measurement. Examples of the 
criteria for “close agreement” are as 
follows: agreement within 0.02 centi- 
meter on skeletal nose height, skeletal 
width of the lower face, and dental 
arch widths; 0.1 centimeter on head 
width, upper face width, bi-iliocristal 
width of hips, and leg girth; 0.2 centi- 
meter on stature, arm girth, and foot 
length; 0.4 centimeter on biacromial 
width of shoulders, stem length, upper- 
limb length, and chest girth. 

Given a procedure for which the 
probability of error is known, and spe- 
cifying an ontogenetic interval, the 
validity of an increment datum varies 
directly with the velocity at which a 
trait is increasing or decreasing. Since 
many dimensions change more rapidly 
during early postnatal life than later, 
it is often practicable to use smaller 
increment periods in infancy (Kasius 
et al., 1957) than in childhood (Mere- 
dith and Meredith, 1950). Again, since 
over the childhood years some dimen- 
sions change more slowly than others 
(e.g, head width more slowly than 
lower limb length), increment analyses 
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sometimes are tenable for periods of 
shorter duration on one trait (Mere- 
dith, r9ssa) than another (Holcomb 
and Meredith, 1956). 

Given a trait that is changing at a 
constant velocity and a measurement 
procedure for which the probability of 
error is known, the validity of an in- 
crement datum varies directly with the 
length of the temporal period. To il- 
lustrate: the same amount of error may 
be 40 per cent of an obtained increment 
for one month, 5 per cent of an annual 
increment, and 1 per cent of a quin- 
quennial increment. Frequently, be- 
cause numerous anatomic traits do not 
change at a constant velocity, the use 
of long periods is contraindicated. 

Efficient procedure in the accumu- 
lation of increment data requires close 
correspondence between the ontogene- 
tic interval selected for study and that 
studied; ie, when a problem calls for 
increments representing the period be- 
tween the fourth and fifth birth anni- 
versaries, or between menarche and the 
first anniversary of menarche, the ob- 
tained data become less valid as the 
examination ages (or stages) vary for 
different subjects. In ongoing research 
programs the desideratum of regularity 
is almost attainable by a person dele- 
gated to maintain an appointment reg- 
ister and to strive for precise timing of 
examinations, Under the writer's direc- 
tion, increment studies have been car- 
ried out to cover quarterly, semiannual 
and annual intervals all defined with 
reference to date of birth. Sustained 
effort has been made to schedule ex- 
aminations within three days of the ex- 
act age termini decided upon. This 
standard has been found practicable; 
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deviations have been few, except for 
occasions of illness or travel. 

It follows from the contents of the 
foregoing paragraphs that the investi- 
gator’s opportunities for amassing 
highly valid increment data lie in trait 
selection, interval selection, interval 
precision, and the reduction of meas- 
urement error, 

Derived measures have been used in 
investigating variability of aspects of 
body form at different ages (Freeman, 
1933), changes in body form with age 
(Meredith and Sherbina, 1951), asso- 
ciations between components of body 
form (Trim and Meredith, 1952), in- 
tragroup variation in index trends 
(Kelly, 1937), interindividual variation 
in index trends (Meredith, 19552), and 
the influence of environmental 
on index trends (Meredith, Knott, and 
Hixon, 1958). They also have been 
employed in studies of relationship be- 
tween size at a given age and growth 
rate over an ensuing period (Meredith 
and Meredith, 1958), relationship be- 
tween growth rate and illness history 
(Evans, 1944), associations between age 
of menarche and growth rate (Shuttle- 
worth, 1937), variability of absolute 
and percentage increments for different 
ontogenetic periods (Meredith and 
Carl, 1946), interindividual variation in 
absolute increment trends (Meredith, 
1939b), intraindividual variation in per- 
centage increment trends (Nanda, 
1955), subgroup intercomparison of 
absolute increment trends (Shuttle- 
worth, 1939), subgroup intercompati- 
son of percentage increment trends 
(Meredith, 1958), relationship between 
increments for adjacent age intervals 
(Simmons and Todd, 1938), associa- 
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tions among increments for different 
traits during a given period (Newman 
and Meredith, 1956), and construction 
of increment norms for infaney (Ka- 
sius et al, 1957) and childhood (Mere- 
dith and Meredith, 1950). 


SUBDIVISION OF DATA 


After the data for a study of ana- 
tomic onvogenesis have been accumu- 
laced, they are usually subdivided in 
some manner prior to statistical analy- 
sis. For any given trait, the data may 
be classified ontogenetically and/or 
subgrouped with respect to one or 
more other variables, The purpose of 
ontogenetic classification is to partition 
ontogenesis into segments appropriate 
for determining sequential modifica- 
tions in the distribution of a trait, The 
purpose of subgrouping in terms of ad- 
ditional variables is to discover, or con- 
trol for, differences in trait distribution 
associated with these variables, 

The writer of a research report who 
omits specification of the basis of data 
subdivision, or describes subgroup for- 
mation ambiguously, forfeits part of the 
potential value of his investigation. Cor- 
respondingly, the reader wa research 
report who fails to give close attention 
to subgrouping procedures foregoes 
background that is pertinent to an un- 
derstanding of the findings of the study 
and to making comparative use of them. 
There is ambiguity when, in the ab- 
sence of reference to the particular 
methods used in subgroup formation, 
findings are presented for subgroups 
designated “malnourished” and “well 
nourished” or “underweight” and 
“overweight” (Jenss and Souther, 
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1940). Apparent contradictions arise 
from detaching findings on “prepubes- 
cent” and “postpubescent” subgroups 
from explicit notations on whether 
these subgroups were formed in terms 
of age of menarche, growth of pubic 
hair, breast development, or other cri- 
teria (Meredith, 1939b). Misinterpre- 
tation is fostered when a subgroup is 
given an evaluation caption such as 
“retarded” and nothing is presented to 
support the tenability of regarding the 
subgroup as anything other than a de- 
scriptive class fitly captioned “small” 
(Meredith, 1955c). 

Ontogenetic Grouping. Studying 
change during ontogenesis in the dis- 
tribution of anatomic traits has been a 
major interest of investigators of hu- 
man physical development. In the re- 
search literature this is expressed by 
the frequeney with which data are sub- 
grouped in order to describe the cen- 
tral tendency and variability of a trait 
at consecutive stages or in successive 
periods. The stages or periods employed 
are chosen with reference to events that 
occur during ontogenesis or are re- 
lated to ontogenesis, 

Classification of data on fetuses ac- 
cording to “menstrual age” illustrates 
grouping in terms of an event related 
to ontogenesis (Streeter, 1920). An- 
other example is placing data on new- 
born infants into “premature” and 
“full-term” categories, using the men- 
strual age 37 weeks (true age ca. 45 
weeks) as the lower limit of the older 
group (Meredith, 1943). 

Grouping on the basis of ontogenetic 
events includes subdivision of data with 
regard to birth, menarche, and struc- 
tural attainments. Under structural at- 
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tainments, previous reference has been 
made to the grouping of prenatal data 
for dimensions of the head, trunk, and 
limbs in terms of stature classes (Scam- 
mon and Calkins, 1929). Data on new- 
born infants have been grouped in “pre- 
mature” and “full-term” categories on 
the basis of attained body weight and/ 
or stature; common partitioning values 
have been 2.0 and 2.5 kilograms for 
weight and 45.0 and 47.0 centimeters 
for stature (Meredith, 1943). In child- 
hood ontogenetic grouping has been 
based on attained stage of tooth erup- 
tion, Hellman (1932) and Krogman 
(1951) subdivided data for facial di- 
mensions with respect to a series of ten 
selected dental stages beginning with 
eruption of the deciduous teeth and 
extending to full eruption of the per- 
manent dentition. A similar grouping 
procedure, using seven odontic classes, 
was utilized by Zuckerman (1955). 

It will be recognized that grouping 
according to attained stature, or weight 
or dental stage, is classification by the 
use of different anatomic scales. Al- 
though each scale provides groups that 
are ontogenetically progressive, the 
scales are neither perfectly intercorre- 
lated nor rectilinearly related to chron- 
ologic age. 

Several ontogenetic events have been 
used as reference points in grouping to 
obtain statistics representing equally 
spaced time intervals. Shuttleworth 
(1937) chose the occurrence of men- 
arche as the focal event and studied 
increments in a variety of dimensions 
during several years prior to and fol- 
lowing it. Burson (1952) chose erup- 
tion of a tooth and its antimere (the 
permanent mandibular central incisors) 
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as the point of reference and studied 
changes in dental-arch width during 
several years immediately preceding 
and subsequent to it. 

The reference event most commonly 
utilized is birth. Before 1890 it was 
customary to form postnatal age groups 
by the “last birthday” method: (1) each 
child was asked to state his age, and 
(2) those replying “6” constituted one 
group, those replying “7” another, and 
so forth (Meredith, 1936). In contem- 
porary research it is commoner to form 
annual age groups by the “nearest birth- 
day” method: (1) the age of each child 
is calculated from records for date of 
birth and date of examination, and (2) 
children are grouped according to their 
closest birth anniversary. The means for 
data subdivided in these ways differ 
systematically; e.g., on a sample of boys 
each within six months of his eighth 
birth anniversary the age mid-point is 
8 years; on a sample of boys each giving 
his age as “8” the mid-point is 8.5 years. 

Every investigator should define his 
ontogenetic grouping precisely. This is 
necessary for reasons pertaining to the 
clear communication of findings and 
the synthesis of findings from different 
studies. The means for an anatomic 
trait listed in the tables of two investiga- 
tions at age “1o years” are not directly 
comparable when in the one investiga- 
tion this figure symbolizes the lower 
limit of a “last birthday” age group 
(O’Brien, Girshick, and Hunt, 1941) 
and in the other it symbolizes the mid- 
point of a “nearest birthday” age group 
(Gray and Ayres, 1931). Measures of 
variability, for any trait that is chang- 
ing progressively with age, increase 
with increase in the distance between 
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the upper and lower limits of age 
groups. There is also a relationship be- 
tween the range for ontogenetic groups 
and the magnitude of correlation coef- 


ficients (Demisch and Wartmann, 
1956). 
Bivariate Subgrouping. Another 


broad interest of investigators of human 
physical growth has been the study of 
the extent to which the distribution of 
anatomic traits varies conjointly with 
ontogenetic time and a second variable. 
This interest is expressed in the research 
literature by the extensive assortment 
of investigations employing subgroups 
for birth order and chronologic age 
(Meredith and Brown, 1939), birth 
weight and chronologic age (Hammett, 
1918), diet and chronologic age (Slyker, 
et al., 1937), chronologic age and ethnic 
group (Lloyd-Jones, 1941), chronologic 
age and secular period (Meredith, 
1941), chronologic age and appearance 
of axillary hair (Richey, 1937), dental 
stage and orthodontic class (Hellman, 
1931), osseous stage and dental stage 
(Demisch and Wartman, 1956), and 
so forth. 

There is considerable variation in the 
way nonontogenetic variables are sub- 
divided. Menarcheal groups have been 
formed by sectioning the continuum 
for age of menarche into three seg- 
ments, the middle one including first 
menses between 12.5 and 13.5 years 
(Goldstein, 1939); three segments, the 
middle one including first menses be- 
tween 13.0 and 14.0 years (Richey, 
1937); seven segments, five consecutive 
half years between two open-end tails 
(Greulich and Pyle, 1950); and eight 
segments, six consecutive half years 
between two open-end tails (Shuttle- 
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worth, 1937). Socioeconomic groups 
have been formed by drawing from 
schools in the “wealthy” and “poor” 
sections of cities (Hopkins, 1947), 
dividing into two parts the distribution 
for “occupation of the principal wage 
earner of the family” (O'Brien, Gir- 
shick, and Hunt, 1941), excluding the 
middle portion of the occupational dis- 
tribution and comparing the “profes- 
sional and managerial” section with the 
“semiskilled and unskilled” section 
(Meredith, 1951a), and selecting for 
comparison the “professional” and “un- 
skilled” segments only (Bowditch, 
1879). “Tall” and “short” groups have 
been constituted at a particular age, us- 
ing the upper and lower halves of the 
stature distribution (Baldwin, 1921), the 
upper and lower quarters (Van Dyke, 
1930), the upper and lower fifths 
(Meredith, 1939b), and the upper and 
lower tenths (Meredith, 1949). When 
the trait under study varies with the 
nonontogenetic trait utilized for group- 
ing, it must be expected that the magni- 
tude of the difference between sub- 
group means for a given age, or stage, 
will increase as one progresses from 
comparison of the two halves of a 
continuum to comparison of its terminal 
segments. 

In several studies “tall” and “short” 
groups have been formed at selected 
ages and then followed for a number 
of years. From one trio of studies it has 
been found that tall girls, in comparison 
with short girls, attain menarche earlier 
(Shuttleworth, 1934), later (Van Dyke, 
1930), and at the same time (Barker 
and Stone, 1936). These seeming dis- 
crepancies are a function of the onto- 
genetic variable; the tall and short 
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groups were constituted, using stature 
distributions at age 13 years, two years 
before menarche, and at age 17 years, 
respectively. 

From the foregoing introductory 
treatment of data subdivision, it be- 
comes apparent that (1) there are in- 
timate connections between the findings 
of a study and the procedures used in 
subgrouping, and (2) as subgrouping 
practices become standardized the in- 
tercomparison of different studies on a 
given topic is facilitated. 

It is beyond the scope of this 
chapter to discuss the investigation of 
anatomic ontogenesis with respect to 
the statistical methods employed in 
analyzing data or the tabular, graphic 
and other methods utilized in reporting 
findings. Notwithstanding, attention is 
directed to the desirability of comple- 
menting textbook readings on these 
topics with readings of illustrative ap- 
plications in research and proposed 
applications in the further study of 
growth problems. Selected references 
are as follows: Boucher (1957), Boyd 
(1955), Deming (1957), Garn (1955), 
Geoghegan (1953), Hunt, Cocke, and 
Gallagher (1958), Kavanagh and 
Richards (1942), Kihlberg and Koski 
(1956), Koch and Kaplan (1950), Main- 
land (1955), Meredith (1935, 1943, 
19552), Newman and Meredith (1956), 
Quo (1953), Sholl (1954), Tanner 
(1951, 1952), Tyler (1954), Wilmer 
and Scammon (1945), and Yates 
(1950). 
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One of the great mysteries of life is the 
power of growth, that harmonious develop- 
ment of composite organs and tissues from 
simple protoplasmic cells, with the ultimate 
formation of a complex organism with its 
orderly adjustment of structure and function. 
Equally mysterious is that wonderful power 
of rehabilitation by which the cells of the 
body are able to renew their living substance 
and to maintain their ceaseless activity 
through a period, it may be of fourscore 
years, before succumbing to the inevitable 
fate that awaits all organic structure., This 
bodily activity, visible and invisible, is the 
result of a third mysterious process, more or 
less continuous as long as life endures, of 
chemical disintegration, decomposition, and 
oxidation, by which arises the evolution of 
energy to maintain the heat of the body and 
the power of mental and physical work. These 
three main functions constitute the purpose 
of nutrition. . . . Development, growth, and 
vital activity all depend upon the availability 
of food in proper amounts and proper quality. 

Chittenden (1907) 
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Life is a chemical function. Lavoisier, 
the pioneering chemist, brought to- 
gether the isolated chemical, biologic, 
aad physical discoveries leading to this 
concept more than a century ago. The 
chemistry of life and growth may be 
studied in the laboratory or in living 
organisms. Both methods are of funda- 
mental significance to the study of 
biologic growth of man. 

Child development includes 
types of growth, the one being visible, 
the other invisible. Visible growth is 
characterized by increase in the body 
size and change in its shape and propor- 
tions. Chemical and physiologic growth, 
on the other hand, is described as i77- 
visible growth, since it results from a 
sequence of biochemical and phys- 
iologic adjustments involving integrated 
chemical, metabolic, and functional re- 
actions by which the incorporation of 
foodstuff as new tissues, or as renewal 
of the body, may take place. Invisible 
growth, therefore, is concerned with 
the subtler chemical, metabolic, and 
physiologic functions and adaptations 
of the enlarging organs and tissues and 
fundamental changes in body structure 
and composition—it is the bulwark of 
physical growth, Visible and invisible 
growth ' coexist—neither can exist in 
isolation. 

In spite of the great complexities of 
chemical and physiologic growth, 
many aspects of which are only par- 
tially understood, it is a basic and in- 
tegral part of child development and a 
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necessary function to a rational program 
of improvement of health, growth, and 
development. Methods of assessment, 
therefore, are of great consequence and 
ignificance, The purpose of this chapter 
i to present a brief consideration of 
the importance of methods in the study 
f child development concerned with 
chemical and physiologic growth to- 
gether with some illustrative chemical, 
netabolic, and functional reactions in- 
volved therein and factors affecting 
them. 

The massive scientific literature 
records results from carefully designed 
research endeavors embracing a major- 
ity of the chemical, biologic, phys- 
iologic, and medical fields and is 
available to those desiring all the facts, 
therefore, only a few references per- 
tinent to this discussion are cited here. 
Our insight into the processes of life 
may be enriched by many noteworthy 
communications pertaining to the chem- 
ical aspects of life (Hopkins, 1933; 
Mendel, 1923; Needham, 1942), to 
growth (Brody, 1945; Forbes, 1952; 
Macy, 1942, 1946, 1951; Moulton, 1923) 
and development (Willier, Weiss, and 
Hamburger, 1955), to the physiologic 
adjustment of the body to inherent 
biologic function (Adolph, 1943; Bar- 
croft, 1934; Benedict, 1938; Bernard, 
1865; Cannon, 1932; Haldane, 1931), to 
biochemical individuality (Loeb, 1947; 
Williams, 1956), to the dynamic 
hormonal, nutritional, and biochemical 
changes characterizing the progressive 
stages in the life cycle (Macy and 
Kelly, 1957; Mitchell et al., 1945; Spray 
and Widdowson, 1950; Widdowson 
et al., 1951), and to body composition 
(Behnke, 1941-1942, 1953; Brozek, 
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Mori, and Keys, 1958; Cowgill, 1958; 
Friis-Hansen et al, 1951; Hunt and 
Giles, 1956; Keys and Brozek, 1953; 
Keys et al., 1950; McCance and Wid- 
dowson, 1951) and the structure of 
the body (Reynolds, 1950; Stuart and 
Sobel, 1946; Tanner, 1953, 1955; Todd, 
1937) as it grows and matures, 


CHEMICAL BASIS 
OF LIVING GROWTH 


Living growth is a process charac- 
terized by the utilization of nutrient 
substances differing more or less from 
those composing the body. The or- 
ganism must be capable of taking the 
relatively complex components of 
foodstuffs, converting them into smaller 
and simpler molecules, selecting those 
required, and then synthesizing or re- 
fashioning the simpler molecules into 
complex chemical units that are char- 
acteristic of the organism itself. By 
the processes of digestion, food is first 
broken down and changed to a suitable 
form for active participation in body 
chemistry and is then assimilated. Each 
of the nutrients absorbed, retained, and 
utilized contributes to the formation, 
maintenance, and function of parts of 
the body or the whole of the organism. 
Living growth constitutes the increase 
in mass of communities of cells in the 
tissues and organs that form the mosaic 
of the different metabolizing systems 
by which the intricate biologic proc- 
esses take place, 

The last century was enriched by 
many epoch- making discoveries leading 
to the energy concept of foods, to the 
evaluation of different foods on the 
basis of their chemical and biological 
properties, and to the modern theory 
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that foodstuffs must furnish not one 
but more than 60 different kinds of 
nutrient substances known to be present 
in the body and necessary for the main- 
tenance of life and support of growth 
and health. Concurrently, biochemical 
discoveries led to a new concept, which 
envisions a dynamic state of the body 
constituents (replacing the classic pic- 
ture of a fixed system), to extensive 
knowledge concerning increased dietary 
requirements and adjustment of body 
composition with growth, and to the 
metabolic mechanisms involved in 
changing physiologic functions, adapta- 
tions, and needs during succeeding 
phases in the life cycle. 

Modern biology envisions the living 
tissues to be in a dynamic state of 
building and tearing down (Schoen- 
heimer, 1942). This interchanging of 
elements is an essential feature of 
physiologic and chemical growth, since 
it permits the young organism to change 
in biochemical complexity and organiza- 
tion and to progress towards chemical 
maturity of body composition (Moul- 
ton, 1923) and maturity of structure 
and function. The degree of difference 
in the quality and intensity of the 
irreproducible biologic reactions due to 
environmental change, such as diet, 
however, depends in large measure on 
the inheritance, the physiologic and bi- 
ochemical state, and the nutritive back- 
ground of the individual. 

The proper application of existing 
methods and the development of new 
and more precise ones for fundamental 
studies of chemical and physiologic 
growth and nutritional adequacy is of 
public health concern and the basis for 
extending knowledge of the interrelated 
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AND DEVELOPMENT 


processes of growth. The interpretation 
of the data collected in such method- 
ologic studies must take into considera- 
tion many factors, such as nutritive 
growth requirements, food intake, 
nutritional chemistry, hormonal and 
other factors. 

Growth Requirements. World-wide 
observations have established the fact 
that inadequate diets and malnutrition 
may warp body and mind and retard 
growth. Growth increases nutritive 
requirements, and its progress is in- 
fluenced by nutritional state, hormonal 
activities, and other factors. 

Food factors are basic to growth and 
developmental progress. The first req 
uisite for optimal inherent growth is an 
adequate food supply. A nutritionally 
adequate diet provides the essential ele- 
ments and compounds, in proper pro 
portion, for the manufacture of new 
body tissues, for maintenance of nutri- 
tional stability with a “steady state” of 
internal environment, for rebuilding 
existing body structures, and for the 
energy requirements of basic metabolic 
processes and activity. 

Nutritional factors determine how 
the body uses the individual nutrients 
in the construction of new organs and 
body tissues and at the same time main- 
tains the physiologic functions com- 
mensurate with biologic age and in- 
creasing body size. Hence inquiries into 
physical, chemical, and physiologic 
growth cannot be satisfied alone by the 
study of the food eaten but must take 
into consideration nutritional factors. 

The development and maintenance of 
healthy body structures depend on the 
supply of many food substances and the 
fate of the individual nutrients as they 
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pass through the metabolic processes of 
digestion, absorption, and excretion or 
utilization. In fulfillment of growth and 
developmental needs, there must be 
sufficient energy present in the diet to 
meet the requirements of activity, of 
maintenance of body temperature and 
basal heat production, and of full utili- 
zation of the essential amino acids, 
vitamins, and minerals in the metabolic 
enzyme systems for the synthesis of 
protoplasm—true living growth. If the 
diet is short in supply of energy, the 
protein or its constituent amino acids 
cannot be utilized to the fullest extent, 
and this condition may lead to the sacri- 
fice of protein of the body for fuel to 
provide for activity requirements, thus 
depriving the enzyme, hormonal, and 
other essential metabolic activities of 
essential protein. 

Prolonged use of marginal diets may 
force the body into a lowered nutri- 
tional and functional state. Depressed 
nutritional states may develop during 
any epoch in the life cycle and influence 
the health, growth, and dietary require- 
ments in subsequent periods. Low- 
ered levels of metabolic performance 
in the body involve food intake, the 
processes of utilization, body functions, 
endocrine and vitamin balances, physical 
environment, attitudes, and social 
conditions. A relatively slight change in 
the nature of one of many chemical 
reactions proceeding in the body may 
result in an alteration of digestion, ab- 
sorption, or utilization of the food 
factors and thereby change the course 
and intensity of inherent growth by 
failure to nourish the body adequately. 
Each individual, therefore, has a bio- 
chemical individuality with a charac- 
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teristic physiologic capacity to utilize 
and store chemical substances from in- 
gested foodstuff determined by he- 
redity, nutritional state, eating and 
elimination habits, and physical and 
mental state; and these factors must be 
considered in the interpretation of data 
obtained for the assessment of chemical 
and physiologic growth. 

Hormonal factors influence different 
steps in metabolism either directly or 
indirectly by enzyme production and 
interaction of individual vitamins, 
amino acids, and minerals. The growth 
hormone produced during infancy and 
childhood increases nitrogen metabolism 
in the body in the form of protein 
and is an integral part of growth and 
synthesis of protoplasm. It aids in nitro- 
gen or protein conservation when 
dietary protein and carbohydrates are 
in short supply. The basal metabolic 
rate and protein synthesis (Sokoloff 
and Kaufman, 1959) are influenced by 
thyroid function, Other hormones play 
equally essential roles in nutrition and 
growth. It is significant that vitamins 
and amino acids are used as precursors 
of the hormones. 

Other factors influence the require- 
ments for growth. The chemical com- 
position of the body changes from rich 
protoplasmic growth during infancy to 
relatively more equalized skeletal and 
protoplasmic growth with changing fat 
deposition during childhood. Young in- 
fants retain a higher percentage of pro- 
tein fed and are more efficient in their 
utilization of it than older infants and 
children. New physiologic mechanisms 
are continually coming into activity, 
while others are receding, so that the 
child may cope with changing internal 
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and external conditions as develop- 
mental alterations take place. As the 
child grows, his organs and body 
systems are unremitting in their progress 
toward full performance of physiologic 
activities peculiar to them in mature or 
adult life, some developing at a faster 
rate than others (Scammon, 1930). The 
composition of protoplasmic growth 
reaches chemical maturity (Moulton, 
1923) and mature hydration (Hunt and 
Giles, 1956) during childhood and re- 
mains more or less constant thereafter. 

Increase in visible size is not self- 
contained but occurs amid a multitude 
of chemical changes and additions for 
enlargement of tissues, organs, and the 
body structure as a whole. These are 
accompanied by modifications in func- 
tional capacity and alterations in in- 
ternal environment that permit full 
realization of the inherent capacity of 
the child for growth and development 
commensurate with his nutritional 
status and stage of development in the 
life cycle. If the nutritive requirements 
for growth and activity are not met in 
full measure, the link between harmoni- 
ous blending of chemical growth and 
physiologic functioning is disturbed, 
and the inherent healthy pattern of 
growth and body composition cannot 
be maintained. Excess of some nutrients 
may also cause untoward results, Bio- 
chemical individuality is exhibited in 
food intake and nutritional needs, in 
blood composition, body structure and 
composition, in hormonal activity and 
enzyme levels, and in digestive, absorp- 
tive, and excretory activities. 

As the methodologies for assessing 
chemical and physiologic growth reach 
greater precision and scope, cumulative 
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evidence is emerging to explain in part 
some of the underlying individual dif- 
ferences that anthropologists, physi 
ologists, and social scientists hav 
observed to exist among boys and girl 
of the same race, of comparable 
chronologic and anatomic age, with 
similar activity, and mental and in 
tellectual acumen, and living in simila: 
economic and social conditions or cii 
cumstances. 

Relation between Internal and Ex 
ternal Environment. The organism i 
ultimately dependent on external con- 
ditions and on the integrative processc 
mediating the relationship between th 
organism and external environment. Al 
though the blood provides a hom 
ostatic internal environment, it bea 
to actual cell environment a relation 
similar to that of the external environ- 
ment but of a much closer and well 
defined sort. 

The cells within the body are bathed 
by fluids that constitute an internal en- 
vironment, the milieu interieur of 
Claude Bernard (1865). Life and 
growth of cells are possible only if the 
composition of these fluids varies within 
narrow limits. Haldane (1931) em- 
phasized the maintenance of body 
structure and the fact that structure 
and activity cannot be separated, a 
phenomenon that is the active mani- 
festation of the persistent whole life of 
the organism. Barcroft (1934) observed 
that gross variations in internal en- 
vironment do not result in devastating 
disturbances in such body functions as 
heart action, muscular efficiency, and 
kidney function; rather, variations in 
the internal environment may result 
in subtle mental disturbances and lack 
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of ability to concentrate and to think 
logically. The implications are that we 
can expect to find normal physiologic 
function and intellectual development 
only in an organism whose blood and 
lymph, bathing all the living cells in 
the body, are able through vital 
biological mechanisms to preserve 
constant the conditions of life in the 
nternal environment and a high level 
of nutritional state. 


Methodologies 


Knowledge of the chemistry of the 
body rests on measurements of the con- 
centration of organic and inorganic 
components in the body fluids: the food 
intake, the absorption and retention of 
nutrients in the body, primarily by the 
metabolic balance procedure, and the 
major chemical components of the body 
(protoplasm, fat, and water). Some 
procedures (Brozek, 1956, 1957; Jolliffe 
et al, 1950; Sinclair, 1948; Spector 
et al., 1954) used to study nutritional 
status and chemical and physiological 
growth in relation to physical growth 
and health are presented in Table 6-1. 
These procedures are based on the ap- 
plication of clinical, dietary, biochemi- 
cal, physiologic, and metabolic 
techniques. Some are direct and others 
are indirect measurements, which, 
either singly or in combination, give an 
evaluation of nutriture, invisible 
growth, and physiologic function. 


RESEARCH DESIGN 


Purpose and Degree of Accuracy 
Desired. There are many phases of 
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chemical and physiologic growth re- 
search, each of which may be ap- 
proached by a variety of different 
scientific routes, by procedures and 
methods of varying degrees of ac- 
curacy, and by involving a few or large 
number of subjects for short or long 
periods of time. The first requisite, 
therefore, for designing an investigation 
is to establish a clearly delineated pur- 
pose and the degree of accuracy neces- 
sary to accomplish the objectives. The 
requirements must be reconciled with 
proper application within the frame- 
work of the available resources of time, 
finances, space, trained personnel, 
facilities, and equipment. As a matter of 
fact, this initial state is the most chal- 
lenging one in research, inasmuch as it 
requires an appraisal of existing attacks 
and current procedures in use, an 
evaluation of available facilities and re- 
sources, and the development of new, 
more specific, and accurate methods 
that will accomplish an unequivocal 
solution to the problem undertaken 
with the least expenditure of time, 
effort, and cost. 


CHEMICAL AND PHYSIOLOGICAL 
INVISIBLE GROWTH 


The metabolic balance method is the 
core procedure in a comprehensive 
study of nutrition and chemical growth. 
The description of the method and the 
use of the knowledge gained (Macy, 
1942, p. 5-6) is presented thus: 


The sole means of determining total 
nutritive requirement of the vital func- 
tions of a living body is the metabolic 
balance, in which are determined the 
actual amounts of nitrogen, calcium, 
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TABLE 61, METHODOLOGIES FOR ASSESSING INVISIBLE GROWTH AND NUTRITURS 


1, Assessment of Chemical and Physiological Growth 


A, Cumulative chemical 


determined by the metabolic balance method, 


1. Determination of difference between analyzed nutrients in the food consumed 
and their outgo in urine and feces over successive five-day balance periods, Thes 
may include proteins (nitrogen or amino acids), heat of combustion of carbohydrates 
fats, and total energy; total clectropositive minerals (calcium, magnesium, sodium. 
and potassium); total electronegative minerals (phosphorus, chlorine, and sulfur), 


and any one of a number of vitamins. 


2. Relationship of nutrients in metabolism: ratios of calcium to phosphorus, nitrogen to 
phosphorus, nitrogen to potassium, nitrogen to sulfur, phosphorus to potassium, exce 
anions or cations stored; protein as related to energy, or energy from carbohydrar 


and far. 
B, Physi 
1. Basal ox consumption and basal heat production, 
i distribution in twenty-four hour samples of urine. 


a. Urinary nitrogen partition: total nitrogen, urea nitrogen, creatinine nitroger 
crestine nitrogen, uric acid nitrogen, ammonia nitrogen, and estimated ba-l 


nitrogen. 
b, Relationship of total nitrogen excretion to creatinine, to the creatinine coefficient, 


to sulfur, and to carbon. 


IL. Asessment of Body Composition 
A. In vive analysis. 


1. Total body water and its componen: 


t parts, intracellular and extracellular water; 


mas; and total body far, 
+. Evaluation from basal heat production, specific gravity, creatinine excretion, nitrogen 
and mineral metabolic balances; nitrogen and mineral content of blood serum and 


eryth 
B. In vitro 


Direct chemical analysis of whole cadavers or parts, 
IIL. Assessments of Nutritional Adequacy and Status (Nutriture) 
A. Nutritional satus surveys—application of integrated clinical, dietary, and biochemical 


observations. 


P. Nearikional sechropology-—application of body meavarements with emphasis on body 
C Chemical sachropology—evalustion of nutrient metabolian, sbsorpt 
metabolite excretion in relation to physical growth and ysiological function, 


ion, retention, and 


D. Nutritional status as it relates P 
iv, 5 it to fundamental problems of behavior. 
A. Anthropometric measurements including i body weight, length, width, circumference, 
and subcutaneous tissue. <5 y 


B. Estimation of body build. 


phosphorus, and other equally essential 
dietary constituents retained in the body. 


and egesta, over a 
of time, is the quantity 
ly retained in the body for its 


tween the i 
lefini } 


a ical activity. The mineral meta- 
i nce, , is not a true bal- 
ance, inasmuch as losses through the skin 
are unmeasurable over long periods of 
time or under usual environmental condi- 
tions. Tears, sweat, oral and nasal secre- 
tions constitute an additional potential 
source of mineral losses, mainly of chlo- 
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tine, sodium, potassium, and sulfur, From 
the relatively more accurate retention 
Values for calcium, phosphorus, and mag- 
nesium, the excretion of chlorine, sodium, 
and potasium through the skin can be 


approximated, 
Further it is stated; 


the 

knowledge obtained about relative 
rtions of the different substances 
incorporated into the body, and the 

ter the oppo for determining 
controlling factors in the mechanisms 

by which the materials are being used, It 
is posible to predict, from a knowledge 
of the amount of sodium, potassium, and 
chloride retained in a given time, the 
distribution of body water beeween its 
two constituent forms, extracellular and 
intracellular water, from the amount or 
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try or physics may have, yet it furnishes 
significant information on current 
metabolic performance, efficiency of 
food utilization, and biochemical and 
metabolic responses to the physiologic 
stress of with its ing 
nutritive needs, A detailed study of the 

ths of excretion provides tional 
information concerning the utilization 
of several chemical elements and serves 
as an integral part of the study of 
nutrition and growth, An investigation 
may involve three or four nutrients, or 
in a more comprehensive study of 
growth there may be inc the 
study of the intake and excretion of as 
many as a dozen or more chemical sub- 
stances by a number of children, 
supplemented by observations and de- 
terminations that may be classified as 
pediatric, anatomic, psychometric, 
physiologic, anthropometric, roent- 


— and chemical (Macy, 1942; 
Macy and Kelly, 1957). 


Isotopically and radioactive labeled 
elements are being used in biochemical 
tracer research to extend biological 
knowledge and to define more clearly 
the metabolic pathways in the body. 
Isotopes may exist in nature or may be 
created artificially by splitting the atom 
by physical force and thereby 


it the nucleus does not 
affect the chemical of the 
pe raaka jade beaut 
differences in physical that 
serve as the basis of or 
determination of these substances in 
metabolic materials, soch as food, urine, 
and feces, By separation in pure form 
the isotopic labeled portions of the 
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material may be determined quantita- 
tively, whereas the radioactive isotopes 
may be determined in smaller quantities 
by measuring their radiations by actual 
counting devices. Radioactive sub- 
stances, however, must be used with 
caution, 

As an illustration of the use of the 
metabolic balance method in the study 
of growth, let us consider nitrogen 
balance. The retention of nitrogen by 
a child who is provided with adequate 
energy to meet the demands of activity 
and growth is evidence of increasing 
protoplasmic tissue, the center of 
metabolically active living growth. The 
construction of protoplasmic tissue in- 
volves particularly the assimilation of 
nitrogen, sulfur, phosphorus, calcium, 
magnesium, potassium, sodium, and 
chlorine. The nitrogen, the sulfur, and 
part of the phosphorus are used largely 
as structural components in the protein 
combinations of body cells, whereas 
potassium, sodium, chlorine, and some 
of the phosphorus serve a functional 
purpose in maintaining the electro- 
neutrality of the tissues and an equitable 
osmotic environment for the proper 
distribution of intracellular and ex- 
tracellular fluids. On the other hand, 
calcium, magnesium, and the remaining 
phosphorus are associated with other 
elements, functioning largely in the 
growth and maintenance of the skele- 
ton. Beyond a small amount of essential 
lipids for cellular structure and func- 
tion, the accumulation of fat in the body 
is an additive but does not constitute 
metabolically active true living growth. 

In a consideration of physical growth, 
it has been stated that the study of 
growth is a study of a moving point, 
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and no measurement which does not 
allow us to measure the movement in 
time in relation to other values can be 
thoroughly successful (White House 
Conference, 1932). A study of growth, 
therefore, is enhanced by multiple 
periodic measures, This holds true also 
in the chemistry of growth, in which 
single and multiple measurements of 
several elements at succeeding levels 
of growth provide results which permit 
the study of the movement of growth 
through the relationship of one cle- 
ment to another, thus revealing moi 

clearly the changes taking place (Alli- 
son, 1958; Macy and Kelly, 1957). 

In children there is a delicate balance 
between the dynamics of energy and 
protein metabolism during protoplasmic 
growth as demonstrated by the meta- 
bolic balance method of nitrogen reten 
tion, by the twenty-four hour urinary 
excretion of total nitrogen and crea- 
tinine, and by the determination of the 
basal metabolic rate. Basal metabolism 
and urinary creatinine have been as 
sociated in one way or another with the 
protoplasmic mass contained in the 
body. Records of oxygen consumption, 
urinary creatinine excretion, and nitro- 
gen storage, singly or together, are in- 
tegral parts in the evaluation of growth 
and nutrition; they serve to validate 
observations on the movement of proto- 
plasm during growth. By combining 
these methods, it has been demonstrated 
that as few as ten calories per kilogram 
of body weight per day may determine 
success or failure in making satisfactory 
growth progress (Macy, 1952). The in- 
dividual food components that serve 
as structural and functional constituents 
of invisible growth of protoplasmic 
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tissue are measured by the metabolic 
balance. The difference between the 
total food intake and total outgo of 
major nutrients, such as calcium and 
itrogen, over a given period of time 
represents the amount that the body 
etains for its structural and functional 
i DL Irposes, 

The metabolic balance method pro- 
vides valuable information on the nu- 
tritional status of the individual. This 
nutritional status markedly influences 
the amount and rate at which nutri- 
ments are used or rejected by the body. 
When the plane of nutrition is lowered, 
ihe malnourished individual will tend 
to store larger amounts of nutriments 
vhen his diet is improved. When a 
surplus of nutrients is supplied to body 
and tissues which have been de- 
prived of adequate protein or calcium, 
for instance, the cells gradually improve 
their nutritive condition until they 
reach an optimum, at which point they 
lose their power to attract additional 
protein and calcium. There is provision 
in the tissues for deposit of certain 
elements in excess of current needs, a 
bodily reserve to meet the stress of 
sudden increased demands. The meta- 
bolic balance method is useful in a wide 
variety of studies of both visible and 
invisible growth and nutrition. The 
various types of results obtained by the 
metabolic balance procedure, including 
the evaluation of nutrient metabolism, 
absorption, retention, and metabolite 
excretion in relation to physical growth 
and physiologic function, contribute 
basic data to the development of nutri- 
tional anthropology (Brozek, 1956) 
and chemical anthropology (Macy and 
Kelly, 1957). 
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The value of the metabolic balance 
method depends on the length of the 
study interval, the accuracy with which 
representative homogeneous samples of 
food, urine, and feces are collected, and 
the precision of the chemical determina- 
tions. Existing chemical methods for 
some elements have greater sensitivity 
than others; some procedures are more 
satisfactorily applied than others. One 
fact cannot be too strongly emphasized 
in the pursuit of knowledge of invisible 
growth, namely, that a few data from 
coordinated procedures carefully ex- 
ecuted, recorded, interpreted, and 
clearly presented are of far greater value 
than a large number of less reliable 
figures (Macy, 1942). 

An adequate diet is the first requisite 
in the study of the normal processes of 
nutrition and of chemical and phys- 
iologic growth and function, Such a 
diet contains the essential nutrients in 
the desired amounts and proportions. 
It furnishes a generous supply of protein 
with its components of essential and 
other amino acids and of the various 
minerals and vitamins for the synthesis 
and maintenance of protoplasmic and 
skeletal tissues. To suppress growth in 
childhood by depleted or minimal food 
intake, judged essential by some in- 
vestigators for determining food re- 
quirements, may be unwise. The normal 
pattern of growth may be disturbed or 
even thwarted, since the body receiving 
an inadequate food supply is required 
to make other physiologic adjustments 
to attain homeostasis at a lower plane 
of nutrition. 

Criteria for Selecting Experimental 
Subjects. It is difficult to make detailed 
observations on a child, especially when 
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experimental control and conditions re- 
quire regulation of his existence and 
environment beyond a few days. A 
study of the chemistry of tissue re- 
placement and growth, and the es- 
tablishment of chemicophysiologic 
criteria for identifying and evaluating 
progress in the child, requires the 
maintenance and observation of the 
subjects under consistent environmental 
conditions at regular intervals; at the 
same time, a battery of biochemical, 
physical, clinical, and nutritional tests 
should be given (Nutrition for Na- 
tional Defense, 1957; Jolliffe, Tisdall, 
and Cannon, 1950; Lowry, 1952; Wohl 
and Goodhart, 1955). 

In selecting experimental subjects, 
we cannot separate physical and mental 
characteristics of an individual and at- 
tribute them to their respective causa- 
tions; therefore, it is necessary to accept 
normal individuals who conform to 
accepted standards of growth, develop- 
ment, nutrition, and behavior and who 
possess no detectable manifestation 
commonly considered abnormal, 

By repeated observations and meas- 
urements under controlled conditions 
favorable to growth and development, 
an estimation may be made of the sub- 
ject’s progress in biologic time, though 
we cannot determine the relation be- 
tween the actual change and the 
capacity for change. 

Consistent Control of Experimental 
Subjects. Consistent control is important 
in a study of the intricate chemical and 
physiologic processes of invisible 
growth, and equal consideration should 
be given to emotional and physiologic 
factors, for they are deeply intertwined 
and each is directly influenced by the 
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other. Cardinal requirements include 
wholesome diet, good habits of hygiene, 
regular periods of sleep and exercise, 
wholesome companionship and con 
structive entertainment, and sympa 
thetic understanding and security, thus 
inducing joy and happiness in living 
These factors contribute to the interest 
and well-being of the subject and the 
successful organization and conclusion 
of the study. An adequate dietary in 
take is basic to the study of normal 
growth processes. The Recommended 
Dietary Allowances of the Food an 
Nutrition Board of the National Re 
search Council (1958) is a useful guid 
in planning a satisfactory intake for 
children of different age levels (Table 
6-2). 

Cooperation of Experimental Sub- 
jects. Cooperation is essential in all 
metabolic procedures and observations. 
The children should be protected from 
fear, for emotional stress may, in it- 
self, disturb natural chemicophysiologic 
functioning. It is desirable to try out 
each possible subject with a few simple 
tests to determine his qualifications for, 
and adaptability to, intensive training in 
metabolic balance procedures. Should 
the subject prove unsuitable, large ex- 
penditures of time, effort, and money in 
collecting unreliable data can be 
avoided. Each study should be con- 
tinued as long as possible, unless ex- 
tended observation is not compatible 
with unhampered physiologic and psy- 
chologic EES “ = 

Nutrients Studied. Some sixty nu- 
trients participate in the structure and 
functioning of the human body. Many 
are known to be essential for life and 
health and must be obtained directly 


263 


CHEMICAL AND PHYSIOLOGIC GROWTH 


“Aourjur Sump uraiord 107 uaard 10u ore SOUBMOTTY "Paunu, st Surpaay asvarq uəym quezur 42 UsArd spooz Arequauatddns pue 
Se[MUIO} ypu s.mod Aq popsoye se yeyur yuarINU o3 ured paas SE SUOHEPuaUTUODDY 'SUONIUNJ JUIOPUI PUE ‘AJoI9IOx9 təansəfıp 
JO uonermew JO sawr iurəyp 03 ənp sisersoawoy ur sdeorpuey moys Suezu, vəya ypuow asy aya Sunnp pawərpur Ajsejnonied st 
Burpooy astorg ‘ajty Jo yauour asry I 107 poaeas ore SOUEMOTIE ON ‘FT JO spuow asTY oy ur syUOWerMber qwa—NU Sunəəu 107 oanp 
-əə01d pəmsəp pue 3səq əya st Surpaay aseəaq WYA spPa} pue sıuejur 10} pooy permu ayy st ypu uewmny eya saziusose1 peog oy] , 

“uovu WRIST 1 
sjenbə ueydoydAn swerSyprut o9 ‘ueydoaddn ‘1osanoard ƏY? pue upueaa pauizoysrd aya Jo soornos Areio1p apnpour squgyeamba UEIN ; 
“pauyop Tes SSoy us9q OAR squatuarmbar ueumy yaya 10} sSIUINMNU 
Japo Surpraosd ‘spooz uowwos yo Area e M pourene oq ULI SIDULMOJE papudUTWOdaI JYJ, SƏSSƏNS Puəwuonauə jensn rap 
-un sI PUN I Ur OAT] Aaya se suosxəd qewou sow Suowre suonenea yenprArpur 19409 OF pepuslul ore S[PAI] SOUR MOTTE UL, 


"9861 pastaay ,‘ssourmoypy 4maq seq Papuawtuosay pounog 
tPrwosoy [PUONEN ‘preog uonmny pue poog əya wory pardepy 


oob og 91 or vI 000$ $1 Ei SL ootz t9 zor ozi $$ 61-91 

oob og Li oz £i 000$ $i Ei og oo9z fọ 091 gor 6b S1-€1 spats) 
oob oor Sz Sz gt 000$ Si ti 001 o09t 69 Sli 6€1 £g 61-91 

oob 06 1z rz gı 000$ Sı tr Sg oo1f to £91 gor 6b S1-£1 skog 
oot SL Ar gI &r oo$t zi vi ol oo$z ZS thr 6L of zI-o1 

oob 09 tı Sai rı oost or or 09 oorr 1$ 6z1 o9 z 6 -L 

oot of Ir Er 60 ooSz 8 or of ood f 601 ob gt 9+ 

oot sE 8 or Lo 0007 L or ob oor tE lg Za > 21 €-1 uIP 
oot of be g'o S'o oo$1 4 go oor X'y gz ol oz 6 ow z1-L 

oot o£ 9 fo to 00S 1 $ g'o ozi X Sy Zz 09 £19) ow 9-7 Suezu 
‘AT Su ‘amba “YI 

a poy $u -3u Sw y “Bur “ws ws ur uo qy “BY sreo 


TEA ‘SY MOEN Oq we uwexg uoa umppep UIDIOIg səamoeo ayfeoH qem aay 


(wu areraduiaa e ur sanoe Ayfeurtou suosiod Joy pəpuaur are saouemolpy) 
SWIS PNUN AP ur suosiog ÁeəH Jo uonmny poor jo souruaiulEy) ay2 107 paudisaq, 


TIONNOD HOUVASAN TVNOLLYN FHL JO 
QAYOA NOILINLON GANY GOOd ‘NIUQTIH? XOA SIƏNYVMOTTY AAVLIIA GAGNIWWOIIA 7-9 TT4Y.L 


264 


from the food eaten. Each essential nu- 
trient plays a specific role in metabolic 
and enzymatic processes and phys- 
iologic function. Because the oppor- 
tunity for study is so great, an in- 
vestigator must choose carefully the 
area within which to work. For detailed 
methods of chemical analysis and the 
collecting and handling of the meta- 
bolic materials, specific texts should be 
consulted (Macy, 1942, 1946, 1951). 
New chemical procedures should also 
be sought out. 

Subjects of metabolic balance studies 
can be standardized to some extent, 
preliminary to actual study of chemical 
and physiologic growth. They can be 
established in the environment in which 


First month 


Second month 
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the observations are to be carried out 
the routine of the study can be initiated 
and diets can be given which will insure 
ample quantities of all essential food 
constituents. Even after all these efforts 
further standardization may occur dut 
ing the course of metabolic studies ov 
extended periods of time. Figure 6 
illustrates the use of the metaboli 
balance method in studying calciu 
nutriture and growth and progressi 
standardization of children, 4 to 8 yea: 
of age, over six successive months. 
The children received diets compo: 
of the same foods, in daily quantit: 
considered adequate, and containing 
average of 0.9 gm. of calcium per d: 
As the study proceeded, the childre 


Third month 


Fourth month 


Fifth month 


Sixth month 


| 


Days 


Days 


Cie ed 20 30 
Days 


Fig. 6-1. Cumulated calcium retentions for six consecutive months (Macy, 1942, p. 40). 
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egardless of age or size, exhibited less 
iriation from each other, and they be- 
ime more alike in the amount of 

iicium they were able to retain. 
Fhe accretions of the chemical units 
nitrogen and calcium determined by 
metabolic balance procedure have 
en used as a measure of cellular and 
letal formation and as a means of 
serving the metabolic patterns of 
protoplasmic and skeletal activities as 
body adapts to growth and devel- 
nent or reacts to injury or processes 

f regeneration, 


IYSIOLOGIC METHODS 


Growth and nutrition may be studied 
by physiologic methods arising from 

\{ferent sources and following different 
metabolic routes. They are equally use- 
ful, accurate, and readily applied. These 
methods include measurement of oxy- 
gen consumption and urinary creatinine 
excretion, Since the results of these 
methods have long been associated with 
the amount of metabolically active 
protoplasmic tissue present in the body 
at a given time, they contribute sup- 
plementary and related data to those 
obtained by the nitrogen metabolic 
balance method. 

Determination of Oxygen Consump- 
tion. The rate at which an individual 
is utilizing oxygen without the influence 
of food or muscular work is the basis 
for computing the basal metabolic rate 
(BMR), a measure of the energy re- 
quired for the fundamental processes 
which must go on in the body to main- 
tain life. 

The assessment of oxygen consump- 
tion and basal heat production measures 


265 


the metabolically active protoplasmic 
tissue in the body and therefore is an 
integral part of the problem of the 
evaluation of growth and nutrition 
(Lewis, Kinsman, and Iliff, 1937; 
Nylin, 1929). The determination of 
the basal energy expended by the proto- 
plasmic tissues when the modifying fac- 
tors of activity, food intake, and 
environmental temperature are produc- 
ing their minimal effect gives the 
minimal heat production of the energy 
metabolism of muscles and internal 
organs of the body, including the in- 
voluntary muscular activity of breath- 
ing, of the heart in the resting state, and 
of the smooth muscles of the alimentary 
canal. The test records the performance 
of the body at a specific time under a 
given set of conditions. The relative 
proportion of active and inactive tissue 
varies greatly with different individuals 
and may vary considerably in any one 
individual at different times. The proto- 
plasmic tissue is metabolically active 
and influences oxygen consumption, 
whereas the fat (neutral) and water of 
the body composition are inactive. 
The Benedict-Roth closed-circuit 
portable apparatus (Kleiner and Orten, 
1958) or a comparable instrument may 
be used. The subject breathes pure oxy- 
gen from a closed system through a face 
mask or some similar device, and the air 
returned by the patient passes through 
soda-lime, which removes the carbon 
dioxide produced. The apparatus con- 
sists of a device that contains oxygen 
and records changes in its volume. From 
the rate of oxygen consumption, the 
rate of heat production is computed. 
The test is made in the postabsorptive 
state, no food having been consumed 
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since the evening before. For accurate 
results, the child should be carefully 
trained in the technique until he is un- 
afraid, is emotionally stable, and will 
remain quiet and relaxed during the test. 
He should be kept warm and com- 
fortable but awake and should rest a 
half hour before the test is made. 
Coincident with the determination of 
basal metabolic rate, records of pulse 
rate, body temperature, respiratory 
rate, and blood pressure are tabulated. 

Determination of Urinary Creatinine 
Excretion. Urinary creatinine excretion 
per twenty-four hours is considered an 
index of the amount of active proto- 
plasmic tissue in the body. Early in- 
vestigators showed that endogenous 
protein metabolism, which results in 


THE STUDY OF BIOLOGICAL GROWTH AND DEVELOPMENT 


the constant, even production of crea 
tinine, is a manifestation of cell meta- 
bolism (Folin, 1905) or an index of 
muscular mass (Shaffer, 1908). There is 
an increase in both the absolute and 
relative excretion of creatinine from 
birth to puberty (Clark and Thompson 
1949; Stearns et al., 1958). Creatine, 
likewise, increases in early childhood 
but levels off at about 4 years of ag 
and decreases during puberty. Figur: 
6-2, illustrating the relationship of 
urinary creatinine and creatine excr< 
tion of children from birth to age 
has been furnished through the courtesy 
of Dr. Genevieve Stearns. 

Urinary creatinine and basal heat 
production are associated with the 
amount and activity of the protoplasmic 
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creatinine and creatine excretion of children with age (courtesy of 
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Fig. 6-3. Creatinine excretion versus basal heat production (Macy, 1942, p. 146). 


tissue present in the body and with the 
rapidity of growth and maintenance of 
protoplasm. Basal heat production and 
creatinine excretion are related and in- 
crease with body size and age (Fig. 
6-3). Both result from the metabolic 
activity of all the individual body 
organs and tissues. The fact that meas- 
urements of oxygen consumption and 
creatinine excretion are derived from 
different sources may account for the 
various patterns of growth when con- 
sidered on the basis of different stand- 
ards, such as height, weight, and body 
surface area. 

The protoplasmic cellular component, 
which is an essential element of all 
normal body mass of a given species, is 
the center of metabolic activity. The 
oxygen consumption and the urinary 


creatinine excretion are directly as- 
sociated with the amount of active 
protoplasmic tissue present in the body 
and also may be greatly influenced by 
the rapidity with which protoplasm is 
forming. Although it increases with 
growth in physical size, metabolic 
activity varies also with the amount, 
stage, and rapidity of development of 
various organs and tissues, since on a 
unit basis some tissues have greater 
metabolic activity than others, as shown 
by the studies of oxygen consumption 
of isolated organs and tissues. These 
methods are useful in assessing chemical 
growth and physiologic function and 
nutritional adequacy and status (Macy 
and Kelly, 1957). 

In the assessment of nutritional ade- 
quacy and chemical and physiologic 
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growth of children, the metabolic 
balance and the physiologic procedures 
of determining oxygen consumption 
and urinary creatinine excretion can 
be used successfully in evaluation of 
chemical growth. 


COMPOSITION 
OF THE HUMAN BODY 


There are two methods of acquiring 
knowledge on the composition of the 
human body with respect to its major 
components of total water, fat, and 
solids-not-fat. The direct in vitro 
method is based on the chemical 
analysis of whole corpses or parts of 
bodies obtained at autopsy. The in vivo 
analysis of the living human being is 
concerned with various procedures, 
used singly or together, in breaking 
down the body composition into water, 
fat, and protoplasmic mass of nonfat 
solids and assessing the relative 
quantities of each, The concentrations 
of the major constituents change with 
age and nutritive state. Therefore, the 
assessment of the composition of the 
body and of the gain or loss in body 
weight has special significance during 
growth and development. 

In vitro chemical analysis has been 
applied to a few human bodies. The in- 
dividuals reaching autopsy and made 
available to the biochemist for chemical 
analysis too frequently have had ques- 
tionable health records, and conse- 
quently the data resulting from the 
determinations may not represent nor- 
mal body composition values. They are 
helpful, however, as guides and for 
general interest. 

More data have been accumulated on 
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the composition of fetuses and you: 
infants than on adults. Only one ma 
child, aged 4% years, possessing a body 
weight of 14.0 kilograms and height 
107 centimeters, has been analyz 
Figure 6-4, adapted from the data of 
Widdowson, McCance, and Spi 
(1951), illustrates the changes that may 
occur in body composition with 
These investigators state that “many 
more bodies will have to be analy 
establish reliable figures for the normal 
average and its variation.” 

Allometry is a mathematical tech- 
nique for characterizing the relation- 
ships of size or concentration of two 
parts or constituents of the body. Hunt 
and Giles (1956), using selected cheni- 
cal analyses in the literature, made a 
valuable study of the allometric growth 
of body composition in man and other 
mammals. Describing in detail the 
various allometric equations by which 
developmental trends in body composi- 
tion of man were determined, they ap- 
praised the normative composition of a 
10-year-old boy to be about 24 per cent 
fat, 55 per cent water, and 21 per cent 
nonfat solids, as compared with 14 per 
cent fat, 61 per cent water, and 25 per 
cent nonfat solids in the adult reference 
man. 

In vivo assessments of body composi- 
tion are indirect estimates. Although 
estimates are based on certain assump- 
tions, if a number of assessments made 
independently display the same trend, 
it would justify the use of these esti- 
mates for observing growth and de- 
velopment. A major advantage of in 
vivo assessments, other than their 
feasibility, is that they can be used for 
longitudinal studies. 
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adulthood, as adapted from the data of Widdowson et al. 


Fat. Physical anthropologists estimate 
over- and underweight with varying 
degrees of control; for example, in ad- 
dition to sex, a single factor, such as age 
or height; two factors, such as age and 
height; three factors, such as age, 
height, and hip width; four factors, 
such as age, height, width of hip, and 
chest breadth (Pryor, 1943); five 
factors, such as age, height, width of 
hip and knee, and chest circumference, 
with chest and hip measurements cor- 
rected for fat (McCloy, 1938). In each 
case the standard weight used as a 
reference point is an average of a group 
of individuals similar in one or more 
factors. The addition of some measure 
of body type to that of age and height 
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6-4. Protein, water, and fat composition of the body during infancy, childhood, and 


(Macy and Kelly, 1957, p. 71). 


increases the value of the estimated 
standard weight. The ratio of observed 
to standard weight is a measure of the 
amount of weight over or under the 
reference point, but it may be the 
result of fat, muscle, or water, 

Other kinds of fat assessments have 
been used, as from skin fold or roent- 
genographic measurements (Reynolds, 
1950; Stuart and Sobel, 1956). With the 
roentgenographic technique, it has been 
observed that the thickness of sub- 
cutaneous fat (calf breadth) shows a 
rise in infancy, some decrement in 
childhood, and another rise in adoles- 
cence and that the early-maturing child 
showed a greater mean relative thick- 
ness of fat (calf breadth) than late- 
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maturing children, These estimates are 
for specific areas and are not over-all 
estimates. 

Specific Gravity. Two substances of 
equal size may have very different 
weights. For example, a bar of alumi- 
num weighs less than a bar of equal 
size made of iron. Similarly equal por- 
tions of fat, water, and body solids 
differ in weight. Equal portions of fat 
weigh less than water, which in turn 
weighs less than other substances, such 
as muscle, bone, and minerals. Density 
or the ratio of mass (or weight) to 
volume serves as a measure for com- 
paring weight on a unit volume basis. 
Specific gravity or the ratio of the 
density of a substance to the density of 
water at a specified temperature is a 
more refined measure for comparing 
substances of equal size on a unit weight 
basis as it is corrected for temperature 
changes. 

On the assumption that the specific 
gravity of the nonfat portion of the 
body remains fairly constant, the person 
with the lowest specific gravity would 
have the greatest percentage of fat. 
However, there are exceptions. Edema 
would alter the rating, but corrections, 
when edema is suspected, could be 
made. Other factors, such as the chang- 
ing specific gravity of the skeleton due 
to ossification during growth, may in- 
fluence the estimate. In order to deter- 
mine the specific gravity of a person, 
the only difficult measure to obtain is 
the volume. The weight can be obtained 
by the usual methods under controlled 
conditions, such as nude values for a 
specified room temperature, and the 
density of water for specified tempera- 
tures has been recorded in chemistry 
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and physics handbooks. The volume of 
a person can be obtained by means of 
water displacement; that is, a bo 
submerged in a tank of water displaces 
a portion of water equal to the volun 
of the body. It is necessary, however, 
to have a tank constructed in such a 
manner that water displacement can 
be accurately measured under con- 
trolled temperature conditions by 
trained personnel. Zook (1932) used the 
method of submersion of the body to 
determine the specific gravity as a 
measure of body composition. Boyd 
(1933) has reported formulas for esti- 
mating specific gravity from height and 
surface area that check the specific 
gravity values for children obtained by 
Zook with the submersion technique. 
Most values reported in the literature 
have been for adults. 

Formulas have been derived for es- 
timating body fat by specific gravity, 
but at the present time the available 
values for the specific gravity of children 
when applied to this formula give un- 
usually high results. The specific gravity 
values of Zook and Boyd give fat 
percentage values of 38 to 51 per cent 
for children 4 to 12 years of age; 
these values are based on the fat- predic- 
tion equations of either Pace and Rath- 
bun (1945) or Keys and Brozek (1953). 
The latter investigators suggested that 
the reported submersion values for 
children may not have been corrected 
for air in the lungs and respiratory pas- 
sages and for gas in the gastro-intestinal 
tract. 

Body water can be estimated by the 
dilution method. A substance of known 
quantity is injected into the blood 
stream, and by the process of osmosis it 
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permeates the body fluids. After allow- 
ing sufficient time for the concentrations 
f the substance to equalize throughout 
the body fluids, the concentration level 
of the substance is determined in the 
blood sample and from it the total 
olume of body fluid can be cal- 
culated. The substances’ that have 
sroved most satisfactory have been 
leuterium (heavy) water, antipyrine, 
ind urea. Friis-Hansen et al. (1951) 
used the deuterium oxide and anti- 
pyrine dilution methods for determining 
the body water for children up to 11 
of age. Using total water es- 
‘mates and physical measurements, they 
derived formulas for estimating body 
water from weight and also from sur- 
face area. Studies on animals on different 
planes of nutrition support the theory 
that the fat-free body has a fixed com- 
position. On the basis of this assump- 
tion, it would be possible to estimate the 
fat-free portion of the body from body 
water by simply dividing the body 
water value by the percentage of water 
in the lean body mass (fat-free). We 
have used the value 73.2 as the per- 
centage of water in the lean body mass 
for children. This value is slightly 
higher than that of adults as a result of 
analysis made on young and adult 
bodies (Macy and Kelly, 1956, 1957). 

Water can be separated into ex- 
tracellular and intracellular water. Ex- 
tracellular fluid volume (McCance and 
Widdowson, 1951) can be obtained 
from a measure of the thiocyanate 
space. The subject, having emptied his 
bladder, is given intravenously a solu- 
tion of sodium thiocyanate. After a 
period of time, during which the subject 
moves about the room, about 10 milli- 
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liters of venous blood, as well as the 
urine excreted during the period, is 
collected. The concentration of thio- 
cyanate in the serum and in the urine 
is measured, and the thiocyanate space 
is determined. The measure of thio- 
cyanate space should be corrected for 
the water in the red cells. Although the 
thiocyanate space may be larger than 
the true volume of extracellular fluid, 
at present it provides the most practical 
way of measuring extracellular fluid. 

Basal metabolism and creatinine ex- 
cretion have long been associated with 
active protoplasmic tissue. Behnke 
(1953) and Miller and Blyth (1952), 
using the specific gravity value ob- 
tained from underwater weighings to 
calculate lean body mass, derived 
formulas for estimating the lean body 
weight of adults from basal oxygen 
consumption, Miller also derived for- 
mulas for estimating lean body mass 
from urinary creatinine. The formulas 
are for adults, No similar studies have 
been made for children, though for- 
mulas can be adapted for children on the 
assumption that the relative lean body 
mass is on the average the same during 
childhood and adulthood (Macy and 
Kelly, 1956, 1957). 


NUTRITIONAL ADEQUACY 
AND STATUS 


Although we have no objective 
method of estimating the influence of 
an individual’s genetic inheritance, nu- 
tritional investigations during and after 
World War II produced ample experi- 
mental evidence to demonstrate that 
the food eaten may enhance or distort 
the growth potentialities conferred by 
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heredity (Toverud, Stearns, and Macy, 
1950). It is believed that biochemical, 
nutritional, and pathologic studies 
focused on cells, organs, and tissue 
systems, as well as on the organism as 
a whole, will detect some nutritional 
deficiencies at their source (Jolliffe, 
Tisdall, and Cannon, 1950; Wohl and 
Goodhart, 1955). 

Nutritional adequacy and status (nu- 
triture) may be studied in one of 
several ways: by the nutritional-status 
survey technique of applying integrated 
clinical, dietary, and biochemical pro- 
cedures (Nutrition for National De- 
fense, 1957; Spector, Peterson, and 
Friedemann, 1954), by nutritional an- 
thropology (Brozek, 1956), in which 
body measurements with emphasis on 
body composition are applied, and by 
chemical anthropology (Macy and 
Kelly, 1957), in which chemical analy- 
sis applied to food eaten and meta- 
bolites excreted yields an evaluation of 
the nutrients absorbed and retained in 


relation to physical growth and 
physiologic function. 
Nutritional status assessments have 


varying degrees of accuracy and value, 
depending on the methods used in ob- 
taining the results and the precision 
with which they are applied. They 
range from qualitative to quantitative, 
used singly or in combination. Some 
measurements have the advantage of 
being direct determinations; others are 
indirect. 

Not all measurements of nutritional 
status are equally practicable or equally 
informative at all ages from birth 
through maturity. Some determinations 
require little equipment; some require 
expensive equipment less readily avail- 
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able and skilled personnel; others a 
practical in the laboratory but rars 

practical in the field. Nutritional stat 
may be determined in different w: 

within the framework and scope d 
termined by the objectives of each 

vestigation. 

Good nutrition depends on selection 
of a balanced diet and a knowledge of 
nutritive value of foods. It does not 
necessarily depend on economic fact 
although a higher economic status n 
permit a higher nutritional level. The 
serious effects of nutritional deficiencies 
may be greatly increased by growth 
needs, The nutritional status of the in- 
dividual markedly influences the amount 
and the rate at which nutrients sre 
used by the body. Tissue nutrition de- 
pends on the relation between the 
supply of nutrients to the tissue and the 
tissue requirement. The tissue essent al 
and the dietary essential are different. 
Kruse (1955) comments that the dietary 
essential must be present in the diet if 
nutrition is to be successful—a tissue es- 
sential is a nutrient that a tissue must 
receive for its nutrition. 

Field nutritional surveys, which are 
organized to indicate the presence or 
absence of relationships between nu- 
tritional status and health, employ a 
tripart combination of clinical inspec- 
tion and medical history, dietary his- 
tory, and biochemical determination of 
sera blood components (Nutrition for 
National Defense, 1957; Jolliffe, Tisdall, 
and Cannon, 1950; Spector, Peterson, 
and Friedemann, 1954). In other studies 
the nutritional spectrum for any single 
nutrient may be based on response to 
dietary therapy and urinary excretion 
of the metabolite of that nutrient. 
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Assessment of nutritional status is a 
separate problem for each nutrient. A 
est that is appropriate for one nutrient 
may be of no value for another. Each 
nutrient has a specific type of measure- 
nent suited to one purpose and not to 
nother. The essential components de- 
termined in the blood may represent 
arious developmental, functional, and 
metabolic processes and involve several 
body systems. 

The clinical and physical examina- 
s include signs of malnutrition and 
consist in a thorough medical examina- 
tion to detect other diseases or con- 
ditioning factors that may influence 
nutritional status and special examina- 
tion to detect clinical signs in identifica- 
tion of deficiency diseases. 

The purpose of the dietary history is 
to detect unsatisfactory dietary patterns 
that may cause deficiency diseases or 
malnutrition. If a deficiency disease 
exists, an indication to its etiology may 
be discovered; if disease is not detected, 
the connection with an unsatisfactory 
nutritional state may give direction for 
improvement of the diet. Dietary his- 
tories may be qualitative or quantitative, 
depending on the purpose of the study. 

Biochemistry is an aid in the study 
of nutritional status. Information now 
available concerning the chemical struc- 
ture and function of essential nutritive 
factors permits the application of bio- 
chemical procedures for measuring the 
levels of these substances in the blood 
and urine, and for detecting metabolic 
abnormalities that occur when the 
supply of these nutrients is too low for 
health, With an adequate intake of a 
particular nutrient, it is possible to 
establish the corresponding parameters 


tion 
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for tissue, blood, and urine, which may 
be used in nutritional assessment. When 
the food intake or tissue level is in- 
adequate, the enzymatic capacity of that 
tissue or the nutritional adequacy of the 
whole body is diminished; under these 
conditions, it becomes easier to assess 
that intake or tissue level. 

Specific nutrients studied with the 
aid of biochemical methods are hemo- 
globin, sera total protein, ascorbic acid, 
vitamin A, carotenoids, thiamine, ribo- 
flavin, and niacin. Others depend on the 
worker's interest. Many recent publica- 
tions are ready and reliable sources of 
helpful information on methods of 
sampling, on clinical appraisal of nutri- 
ture, on biochemical procedures for 
collecting, storing, and analyzing sam- 
ples of blood and urine, on dietary 
evaluations, and on suggested methods 
of interpretation. For the investigator 
who wishes specific details, the original 
publications should be consulted. The 
integrated results of the clinical, dietary, 
and biochemical surveys indicate the 
presence or absence of relationships be- 
tween nutritional status and health. 
The method can be used in the study of 
population groups or as a supple- 
mentary aid in other types of studies, 
such as growth. 

Dietary intake studies may be qualita- 
tive or quantitative and of varying dura- 
tion, depending on the requirement of 
the investigation. Records taken for 
seven successive days or longer on the 
quality of food eaten at intervals of 
time are informative. When supple- 
mented by carefully taken dietary his- 
tories and twenty-four-hour recall 
records of the food intake, a broad 
qualitative estimate of the dietary 
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pattern may be obtained for comparison 
with the Recommended Dietary Al- 
lowance of the Food and Nutrition 
Board, National Research Council (Na- 
tional Academy of Sciences, 1958). Re- 
gardless of the limitations attributed to 
evaluation methods based on dietary his- 
tories and records, carefully taken and 
skillfully evaluated qualitative estima- 
tions for large population groups 
provide information on food intake 
and dietary habits that otherwise could 
not be recorded quantitatively (Burke, 
1947; Hunscher and Macy, 1951). Ap- 
proximations of nutrient intake for 
groups possessing racial, socioeconomic, 
and other differentiating characteristics 
indicate the need for public health 
measures and individual instruction 
aimed at desirable improvements in diet. 

Biochemical studies of the blood and 
urine are helpful in the measurement of 
levels of essential nutrients for detecting 
metabolic abnormalities. Compounds 
in serum vary in their behavior: some, 
such as carotenoids and ascorbic acid 
(vitamin C), tend to reflect the im- 
mediate past dietary intake; others, such 
as proteins, may remain at concentra- 
tions that are within the statistically 
normal range, even though clinical mal- 
nutrition may be present. 

Blood Methods. Peripheral blood 
samples may be collected and analyzed 
by microchemical methods developed 
by Bessey and Lowry (Lowry, 1952; 
Moyer et al, 1948). Approximately 
0.3 milliliter of blood is drawn from the 
child by finger puncture after cleansing 
the skin with alcohol. Hemoglobin and 
total serum protein are determined im- 
mediately after collection and prepara- 
tion of the samples. Samples and pro- 
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tein-free filtrates of serum are prepared 
and measured into microtubes 1 
stored in deep freeze for later det 
minations of ascorbic acid, alkaline 
phosphatase, vitamin A, and carote- 
noids. Accuracy of the results may be 
attained by taking the sample in the 
morning, preferably representative of 
the fasting state. Variations caused by 
anxiety, fatigue, recent food consump- 
tion, and diurnal influences should be 
decreased to a minimum. Some in- 
vestigators may prefer collect 
samples of blood by venipuncture 
use in macrochemical determinations. 
Venous samples should be collected 
with the minimum stasis, and capillary 
samples should be obtained from a frec- 
flowing puncture. Each investigator 
may choose the methods best suited to 
the fulfillment of the objectives of the 
investigation that can be carried out 
with available personnel, facilities, and 
funds. A large number of analyses may 
be made in a routine manner with 
reasonably simple equipment and a high 
degree of accuracy. 

An illustrative application of the 
tripart clinical, biochemical, and dietary 
intake evaluation of the ascorbic-acid 
nutritional status of underprivileged 
children during a stay of eight weeks 
in a health camp is presented in Fig. 6-5. 
Successive determinations of ascorbic 
acid in the blood serum were made 
during the first week, in the third and 
fourth weeks, and after four weeks of 
residence in the camp (Cooperstock 
et al., 1948). Clinical and dietary as- 
sessments accompanied the biochemical 
determinations of blood vitamin C. The 
results demonstrate the beneficial 
physiologic effect of an adequate diet 


CHEMICAL AND PHYSIOLOGIC GROWTH 75 
Blood Serum Vitamin C 
July 8- July 18- August 3- 
July 12 August 2 August 13 
221 0 G6 o 
21 00 o o 
zom o oo 
19| ooo o 
E ooo 00000000 
i7 ooo 0000000 
16| o 00000 00000000 
15| o 000000 0000000 
14| 0000000 000000 
13] ©00000 00000000 
12| 000000 000000 00000 
P11] oo0000 oo 0000000 
10| 0o00 00000 000 
09 0000 oo o 
os| oooo ooo o 
07| ooooo0o0o00000 |o 
06| 20000000 o o 
05| oooo0o0o0000000 |o o 
04| ooo oo o 
03| 0000 
02| 900000000 
01] oooo0o00 
oo| coo000 


Fig. 6-5. Fasting blood serum vitamin C levels of children during first week, in third and 
fourth weeks, and after four weeks residence in a health camp (Cooperstock et al., p. 207). 


and healthful environment on improved 
health and vitamin C nutriture. They 
demonstrate also that nutritional con- 
ditioning may be rapidly achieved by 
the majority of the poorly nourished 
children. Clinically, the 99 children, 
aged 6 to 16 years, improved and in- 
creased their body weights 0.1 to 25.1 
per cent during the camp experience. 
Vitamin C is one of the nutrients most 
likely to be low in the diets of under- 
privileged populations. 

Ascorbic acid is important in the 


regulation of normal cell function, in 
the formation of intercellular cement 
substances, and in the formation of the 
steroid hormones of the adrenal cortex 
under the stimulus of the adrenotropic 
hormone of the pituitary gland. It is 
especially necessary for the growing 
child because young tissues are richer 
in vitamin C than adult tissues, and the 
tissues of highest metabolic activities are 
characterized by high concentration of 
this nutrient. Since blood levels of 
ascorbic acid reflect the immediate past 
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dietary intake, nutritional recondition- 
ing of the children at the health camp 
would be expected in this group living 
under improved diet, medical care, and 
living conditions. 

Nutritional status surveys are useful 
as a guide in the improvement of the 
environment of so-called healthy chil- 
dren, for such children are often sub- 
ject to the untutored whims of adults 
in whose care they fall. For example, 
children living in a modern child-caring 
institution with devoted E 
may be subjected unwittingly to health 
hazards caused by misconceptions in 
food values (Harrington et al., 1950). 
The same type of experience may arise 
in the home. Table 6-3 shows the aver- 
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cause of her impression that its bulk 
prevented the consumption of a m 
desirable and larger intake of milk. Iy 
failing to give the citrus fruit, which 
provided for the largest source of as- 
corbic acid in the daily diet, she forced 
the children to a lowered level of vita- 
min C nutriture. 

In comparing the children of two 
cottages, the highest and lowest con- 
sumptions of ascorbic acid are associ- 
ated, respectively, with maximum and 
minimum ascorbic acid concentrations 
in the blood. The normal level in 
healthy children ranges from o.7 milli- 

ram and above of ascorbic acid per 
100 milliliters of blood, whereas o.4 and 
0.6 milligram per 100 milliliters is the 


TABLE 6-3. FASTING BLOOD SERUM VITAMIN C LEVELS 
FOR TWO GROUPS OF 10 CHILDREN EACH LIVING IN 
AN INDIVIDUAL COTTAGE, ON THREE SUCCESSIVE 
DAYS 


Values in milligrams per 100 milliliters 


Cottage A Cottage B 
Day Mean Range Mean Range 
I 1.13 0.92-1.33 0,12 0.02-0.28 
2 1.06 0.54-1.52 0.17 0.00-0.34 
3 1.05 0.66-1.51 0.12 0.05-0.23 


age blood levels of ascorbic acid of ten 
children in each of two cottages on 
three successive days. Each cottage was 
provided with a generous supply of 
carefully selected foods from which 
adequate diets could be formulated and 
served by the housemothers. In one 
cottage the well-meaning housemother 
knew that milk was good for growing 
children, so she did not serve the citrus 
fruit provided by the institution be- 


range in borderline cases and below 
o.4 milligram per 100 milliliters in de- 
ficient subjects. The children unwit- 
tingly deprived of an adequate dietary 
intake had deficient blood levels; in- 
deed, in some cases the vitamin C blood 
levels were comparable to those in chil- 
dren with scurvy. Although the clini- 
cal examinations revealed no obvious 
disease, the discovery of consistent low 
blood levels led to an investigation of 
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the diet, which revealed the cause. Im- 
mediate improvement of the diet by 
restoring the citrus fruit brought the 
blood levels back to normal, thus im- 
proving ascorbic acid nutriture rapidly 
and preventing malnutrition and sup- 
pression of growth. 

The scatter of the results of blood 
vitamin C shown in Fig. 6-5 demon- 
strates that the individual child has a 
characteristic physiologic capacity to 
respond to improved dietary intake, 
predetermined by his health and the 
extent of his nutritional impairment. 
With but few exceptions, normal vita- 
min C nutriture was attained by most 
underprivileged children within six 
weeks, The so-called healthy children 
within the cottage receiving inadequate 
intake of this essential nutrient re- 
sponded even more quickly. Other 
types of nutritional deficiencies, how- 
ever, may respond more gradually over 
a longer period of time. 


OTHER SURVEYS 


Electrophoretic studies for separat- 
ing the plasma proteins into their true 
chemical entities constitute one of the 
major advances of modern biochemis- 
try. The method is based on the migra- 
tion of charged colloidal particles in a 
solution or suspension when electromo- 
tive force is applied. In electrophoresis 
all negatively charged particles migrate 
toward the positive electrode; positively 
charged particles move in the opposite 
direction to the negative electrode. The 
particles migrate with different speeds 
dependent on the number of charges 
each carries—the greater the number of 
charges, the faster the migration. 
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Blood proteins in health and disease 
have primary significance in the meta- 
bolism of the organism, The amount 
and distribution of the blood proteins 
at any time may reflect diverse proc- 
esses concerned with the synthesis, 
maintenance, and dissolution of differ- 
ent body constituents; and the protein 
structure of this fluid, which nurtures 
all tissues, seems to depend more on the 
physiologic state of the individual as 
a whole than does any other tissue, 
Changes in the blood proteins with 
age are of great interest and should be 
investigated thoroughly. Electropho- 
retic analysis of the proteins of the 
serum or plasma provides a more de- 
tailed and reliable picture of this labile 
biologic system than other methods. It 
permits accurate estimation of the ratio 
of albumin to globulin, the two major 
protein fractions, and also provides 
quantitative data for the alpha,, alphas, 
beta, and gamma components of plasma 
globulin, in addition to fibrinogen (phi 
globulin). Albumin normally represents 
the largest fraction of the total blood 
(plasma) protein and has three impor- 
tant functions: (1) maintenance and 
stabilization of blood volume and equi- 
librium of fluid exchange between the 
intravascular and extravascular fluid 
compartments of the body, (2) carry- 
ing of organic and inorganic substances, 
and (3) providing nutrients for the tis- 
sues and possibly supplies for the for- 
mation of blood (plasma) globulins. 
An outstanding characteristic of the 
globulin group is the ability to act as 
carriers for other substances, notably 
lipids, lipoproteins, and metals such as 
copper and iron. Fibrinogen is impor- 
tant for its well-known role in blood 
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coagulation and in the phenomenon of 
erythrocyte sedimentation. Gamma 
globulins are the carriers of immune 
bodies so important in the defense of 
the body against infectious disease. The 
breakdown of alpha into the alpha; and 
alphas fractions and the fractionation of 
blood proteins into albumin, alpha, 
beta, gamma, and fibrinogen portions 
are especially useful in disease and dur- 
ing the reproductive period. 

Twenty milliliters of blood are 
needed for the chemical analysis of 
total protein and for clectrophoresis; 
this amount is drawn from a vein in the 
antecubital fossa (depression or pit in 
front of the elbow) with minimum 
stasis that will affect the proteins. Each 
sample must be handled carefully and 
determined with the greatest care by 
the most accurate procedures and 
equipment described in the latest sci- 
entific literature. It is beyond the scope 
of this chapter to discuss in detail pro- 
cedures and application of results. 

Hemoglobin and other blood pro- 
teins are of primary significance in 
maintaining health, and dietary pro- 
teins must be relied on to furnish the 
essential amino acids of which blood 
proteins are composed. Concentration 
of hemoglobin and total serum protein 
are among the most commonly used 
measures of nutritional status. Many 
types of hemoglobin exist (National 
Academy of Sciences, 1958), however, 
and they, too, may be separated and 
studied electrophoretically. 

Other nutritional status studies re- 
lated to growth may consider the min- 
eral nutrients and lipid distribution of 
blood cells and plasma or be based on 
quantitatively analyzed diets for a num- 
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ber of nutrients, to name but a few 
types of study that give valuable in- 
formation on the relation of nutritional 
status to growth. 

Ata Conference on the Role of Body 
Measurements in Evaluation of Human 
Nutrition, nutritional anthropometry 
came into being (Brozek, 1956). lt re- 
fers to the 


.. . application of body measurements 
for the purpose of characterizing | n’s 
nutritional status (nutriture), with em- 
phasis on body composition. Somatic 
measurements, properly made and prop- 
erly interpreted, are useful in scs ral 
nutritional contexts. These include evalua- 
tion of caloric requirements, assessment of 
nutriture of a given school child or patent, 
description of the present nutritional status 
of different populations and the dem- 
onstration of improvement resulting from 
better economic, agricultural or dietary 
practices, and a continuous assessment of 
the biological adequacy of the caloric 
supply during periods of food restriction 
and food shortages. While starvation is a 
continuous threat in some parts of the 
world, elsewhere obesity has been con- 
sidered as a serious health hazard. For 
the study of this problem a clearer dif- 
ferentiation between gross underweight- 
overweight and leanness-fatness is de- 
sirable. In the description of man’s body 
build the separation of the contribution 
of the skeletal frame from the amount and 
distribution of specific soft tissues will in- 
crease the precision of the portrait (p. iii). 


Further, it was stated: 


Through contact with the science of 
human nutrition, the trends toward a more 
functional, “dynamic” orientation in 
hysical anthropology are strengthened. 
The biological significance of human 
somatic variations at a given time 1S 
measured in terms of association with 
physiological functions and biochemical 
characteristics, while morbidity and 
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mortality constitute the long-range criteria 
of the importance of differences in human 
physique (p. iv). 

Chemical anthropology, likewise, has 
arisen in the biochemical field (Macy 
and Kelly, 1957). It is concerned with 
the evaluation of nutrient intake and 
of absorption and retention and meta- 
bolite excretion in relation to physical 
growth and physiologic function, body 
metabolism (Williams, 1956), and phys- 
iologic efficiency. Nutritional status is 
being explored as one of the fundamen- 
tal processes of behavior. In discussing 
the impact of diet on behavior, Mac- 
Leod (Brozek, 1957), speaking for the 
psychologists present at a Symposium 
on Nutrition and Behavior held in 1957, 
said, 


. this symposium is a welcome indica- 
tor of the growing realization not only that 
psychologic research has something to 
contribute to the science of nutrition but 
also, and more important for us, that 
nutritional studies can help to clarify some 
of the fundamental problems of be- 


havior (p. 5). 


Body weight, so often used as an 
index of nutrition and growth, may not 
be so reliable as a criterion of satisfac- 
tory growth conditions for children as 
a thorough-going evaluation of nutri- 
tional status, for, even with food intake 
sufficient to allay hunger and produce 
weight gain, the diet may not contain 
essential substances in amounts suffi- 
cient to maintain in the blood and body 
tissues the levels requisite to healthy 
body composition. If the body has be- 
come conditioned to an inadequate diet 
and a lowered nutritional state exists, 
weeks may be required for nutritional 
reconditioning on an adequate diet with 
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the necessary physiologic adjustment of 
the body tissues and fluids. 

Quality of Survival. Constitutional 
and environmental factors that arrest or 
distort development have significant 
implications in the rate and quality of 
growth taking place at any period in 
infancy and childhood. The tempo and 
dynamic quality of growth and devel- 
opment are influenced by the interplay 
of chemical processes and their response 
to different internal and external en- 
vironments, to the metabolic perform- 
ance, and to the rate of excretion of 
nutrients and waste. When unham- 
pered, all the physiologic and meta- 
bolic processes concerned with growth 
flow in an orderly fashion in physio- 
logic time toward maturity of struc- 
ture and function. If interrupted by an 
inadequate diet and a lowered plane of 
nutritional state for a time, growth will 
return in its original proportions or 
in some altered form of progression and 
symmetry when an adequate diet is 
provided. Maintenance of a healthy 
chemical structure depends on the pres- 
ence in the food of many nutrients and 
on an environment favoring full utili- 
zation. 

Evidence indicates that many chil- 
dren do not receive enough of the es- 
sential nutrients to satisfy their usual 
body needs, to acquire reserves suffi- 
cient to meet sudden and unexpected 
periods of physiologic and pathologic 
stress arising from ineffective fulfill- 
ment of biologic demands, and to meet 
growth and other structural and func- 
tional requirements. Children whose 
plane of nutrition has been lowered 
because of insufficient calories in the 
diet will respond by a slower growth 
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rate. Children who are malnourished 
because of disease or are undernour- 
ished because of chronic inadequate 
food consumption will require an en- 
larged nutrient intake to make up the 
deficits and to condition or recondition 
their bodies before a normal growth 
rate can be attained. Stearns et al. 
(1958) observed that children whose 
plane of nutrition is always high and 
who have been largely free from illness 
tend to be taller and heavier for their 
age than children whose diet and en- 
vironment are poor. 

There is a basic progression and sym- 
metry of life and growth in accordance 
with the individual’s biologic time that 
links the many physical, chemical, meta- 
bolic, and developmental processes into 
a functioning unit. The inherent physi- 
cal, chemical, and physiologic growth 
changes taking place as the body ma- 
tures are accomplished by an increase 
in body size and changes in visceral 
organs and body composition, Func- 
tional changes accompanying dimen- 
sional and ponderal growth are attained 
by the enlargement and functional mat- 
uration of the energy-releasing organs 
(i.e. the nervous, endocrine, and di- 
gestive systems) and the cardiac, res- 
piratory, and renal systems. All these 
components are integral parts of growth 
and differentiational processes, each 
advancing in its own way and at a 
different rate. Cellular changes, synergy 
of growth rates for all systems, and 
proportionate functioning among them 
permit developmental differentiation to 
proceed in a normal fashion. 

In summarizing the epoch-making 
experimental chemical and biological 
enlightenment of the last half century, 
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Sherman (1952), the renowned chemi 
cal nutritionist, says, 


.. the activities upon which the lifc 
of the body depends involve a continuous 
expenditure of energy and also a neve! 
ceasing exchange of material. Body sub 
stances may be, and often are, drawn upor 
to meet temporary deficiencies of th 
nutrient intake; but ultimately the body 
is dependent upon food for the fuel 
substances (oxidizable organic nutrients) 
which supply energy; for the substances 
which make good the exchange of bod 
material; and (either by supplying actual 
substances or their precursors) for all 
those regulatory factors which keep th: 
reactions running at the right rates, and 
keep the internal conditions of the bod) 
within the zones of delicate adjustment 
which the life processes requires (p. 7): 


The body is an organized, coordi- 
nated whole, which may behave a 
something more than a mere summa- 
tion of its parts. 

The new and increasing numbers of 
methodologies for more precise study of 
chemical and physiological growth will 
continue to add new knowledge to the 
chemistry of life. Challenging and fertile 
opportunities await those with imagina- 
tive and creative minds who apply these 
methods and develop other procedures 
to broaden the scope and contribute 
greater depth and meaning to child 
development. New methods may arise 
in any field of science, for there are no 
real boundaries between the increasing 
numbers of branches of science that 
have to do with man. Each overlaps 
with the other. 
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Receptor 


Functions 
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When the functions of receptors are 
studied in the intact organism, the ef- 
fects of stimulation are typically meas- 
ured after neural mediation and by 
means of some response indicator. An 
isolated sensory cell of the skin, cochlea, 
or retina may be activated by energy 
from the external environment, yet 
contribute nothing to neural processes 
or behavior. When it does, its contri- 
bution is relative to that of other re- 
ceptor cells and highly dependent upon 
a background of concurrent neural ac- 
tivity. Hence, the effect of an afferent 
process may range from a relatively 
constant to a highly variable one, de- 
pending in part on the extent of inter- 
vening neural mediation that is in- 
volved. To understand the conse- 
quences of stimulation for behavior of 
the living organism, the investigator 
measures physiological or behavioral 
response at the level or levels appro- 
priate to the particular problem of his 
interest. In developmental studies spe- 
cial care is necessary to avoid equating 
or confusing the different levels. 
284 


The present chapter is aimed at a 
review of those methods of investiga- 
tion that are useful for the measur 
ment of sensory capacities by behavioral 
effects of stimulation. When it is clear 
that data are relevant to a restricted 
behavioral class or level of response, 
such restricted applicability is pointed 
out. For example, when the human in- 
fant moves its eyes laterally in response 
to the sweeping motion of a series of 
vertical stripes across the field of vi- 
sion, the observer concludes rightly 
that receptors of the eye are activated 
and that a complex neuromuscular sys- 
tem is brought into action by the vist al 
stimulation. With only this much in- 
formation, however, the psychologist 
exercises caution in drawing conclu- 
sions about visual acuity or the gene ral 
adequacy of form vision. He knows 
that in mammalian phylogeny this re- 
sponse occurs in the absence of the 
visual cortex (Smith, 1937) and con- 
ceivably reflects strictly midbrain inte- 
grations when observed in the new born 
infant. 

For developmental psychology, the 
problems of distinguishing receptor 
from neural developmental stages, and 
of separating levels within the nervous 
system, are no less significant than the 
problems of diagnosing sensory and 
neural disorders selectively. The initial 
discovery of a deficiency is often only 
the beginning of a prolonged series of 
diagnostic observations. An apparent 
deafness may turn out to be unrelated 
to the organ of hearing, except insofar 
as a cortical integrating process, which 
may not be functioning effectively 
(Myklebust, 1954), is involved. Pre- 
sumably, both auditory and extra-audi- 
tory cortical regions would be under 
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suspicion, A demonstration that audi- 
tory reflexes can be elicited at normal 
thresholds places the site of impairment 
or developmental failure farther along 
in the neural network. 

The use of so-called “objective” 
methods (e.g, the optokinetic reflex, 
valvanic skin response, and tympanic 
reflex) for testing receptor functions 
typically involves responses that are in- 
tegrated at lower levels than the “sub- 
jective” methods, which call for the 
voluntary cooperation of the subject. 
In work with infants and children ad- 
vantages of involuntary behavioral in- 
dicators are often very great, but there 
is also danger in overlooking possible 
absence of correlation between these 
and other standard tests. More “objec- 
tive” methods are needed, nevertheless. 
Developmental norms can be greatly 
improved by their use, for their repro- 
ducibility is a distinct gain. Relations 
between various methods will also have 
to be worked out before interpretations 
based on the use of some of the newer 
methods can rest on firm ground. Re- 
gardless of the level or objectivity of 
the methods, certain principles of sys- 
tematic experimental investigation must 
be observed if data of permanent value 
in a science of behavioral development 
are to be obtained. 


Quantification in Receptor 
Sensitivity 


THE DETERMINATION 
OF THRESHOLDS 


Psychophysical Experimentation. 
Much of what an investigator does 
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in sensory psychophysiology depends 
upon his ability to control and repro- 
duce the physical stimuli to which his 
subject may respond. This problem 
will vary with the sense mode and is 
solved differently for each dimension 
of stimulation and for each technique 
within a dimension. In the present sec- 
tion of this chapter the more general 
question of procedural standardization 
is treated; in later sections the instru- 
mentation necessary for controlled 
presentation of the stimulus is discussed 
in specific terms for each type of en- 
ergy and its appropriate receptor, 

Psychophysics deals with the experi- 
mental and statistical methods needed 
for obtaining quantitative and repro- 
ducible sets of responses to standard 
stimuli. Both average response values 
and variability are determined from any 
systematic study of absolute and differ- 
ential thresholds. The number and spe- 
cific quantitative values of stimulation 
to be used in a study are selected after 
appropriate preliminary investigation. 
A sequence of trials is planned, and the 
basic numerical computations follow 
logically from the set of values that a 
method produces. This set of values 
will reflect a stimulus dimension and 
two or more categories of responses to 
each value along that stimulus dimen- 
sion. For the reader with little experi- 
ence in psychophysical theory and ex- 
perimentation, an excellent discussion 
of basic principles, with examples of 
appropriate methods of tabulation and 
statistical treatment, is available in 
Chapters 8 and ọ of Woodworth and 
Schlosberg (1954). More advanced 
presentations are available in textbooks 
on psychometrics. 
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Sequences of Stimulus-Presentation. 
Three standard methods for ordering 
the stimuli and then arriving at values 
of the physical stimulus that are cor- 
related with a change in response 
(threshold values) have long been in 
standard use. The final statistical state- 
ment will vary in detail with the 
method, as is true for measurements in 
science generally. Fundamentally, there 
is good agreement between the various 
methods in the quantitative relation- 
ships expressed. The method of choice 
is often indicated by the nature of the 
stimuli to be presented in conjunction 
with! the response required of the sub- 
ject. Since responses may shift some- 
what as a consequence of a sequence 
of stimulation, these effects themselves 
are sometimes made the central inter- 
est of experimental investigation. 

The most straightforward approach 
to the determination of a threshold, one 
which gives an initial approximation on 
the first set of observations, is known 
as the method of limits, or method of 
serial exploration. For obtaining a lower 
threshold (RL, or Reiz Limen), the in- 
tensity of the stimulus may be increased 
by small steps until the first response 
is obtained or decreased until the re- 
sponse is first absent or both. Explor- 
atory work with animal or infant 
human subjects is often done by the 
descending method of limits if the re- 
sponse has been developed through 
training (to provide some assurance 
that the response is still possible) or 
by the ascending method of limits if 
an innate response to the stimulus is 
utilized (to minimize negative adapta- 
tion or habituation). Many investiga- 
tors prefer to obtain an average value 
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compounded from both ascending and 
descending series of trials in order to 
reduce the effects of constant errors 
that come from a unidirectional ap- 
proach to the threshold. Whichever 
approach is used, psychologists under- 
stand the result to be an approximation 
or calculated value. They accept the 
concept of limen as a statistical enti 
varying somewhat with the method 
determination and not to be regarde 
as an absolute. 

For the measurement of trans 
zones, through which a subject varies 


ition 


in his response to fixed values of the 
stimulus, two additional methods are 
available and are more satisfactory sta- 
tistically than the method of limits. 
These methods may also be more sat- 
isfying to the subject if he is being 


asked to make judgments of which he 
feels unsure. Guilford (1954) may 
profitably be consulted by the student 
who is seriously engaged in designing 
an experimental investigation of this 
nature. 

The first of these methods, the 
method of average error, is peculiarly 
suited for use with an apparatus that 
can be constructed to allow the sub- 
ject to vary the stimulus values himself 
until he becomes aware of the stipu- 
lated change (presence, absence, just 
noticeable difference, or just not no- 
ticeable difference). This method has 
proved highly satisfactory for the 
measurement of loudness and bright- 
ness discriminations. The PSE (point of 
subjective equality) or the RL are sim- 
ply an average value of the settings 
obtained. Although an experimenter 
may in the method of average error 
move the stimulus value until the sub- 
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ject gives a response, this destroys one 
of its main advantages and essentially 
converts it into a method of limits. In 
fact, this is just what such a procedure 
does when the lower threshold is un- 
r investigation, 
The most refined procedures in 
psychophysics, as well as the most 
generally applicable, are the constant 
methods. These share the advantage of 
eliminating constant errors that stem 
from series effects, They are applied 
to lower, upper, and difference limens 
(DL’s) and to interval scaling prob- 
lems, Variations in these methods have 
been devised to fit special procedural or 
computing requirements, but the essen- 
tial approach is that the experimenter 
selects a limited number of stimulus 
values for each of which he then ob- 
tains a frequency of response. To do 
this E presents the selected stimuli in 
a random or prearranged mixed order 
until each stimulus has been presented 
an equal and reasonably large number 
of times. A plotting is thus obtained of 
the transition between values which 
give very few responses of a given 
category to those for which response 
approaches 100 per cent. The threshold 
is typically taken as the point (deter- 
mined by interpolation or curve fitting) 
at which the function passes through 
the 50 per cent frequency of response 
value. Two curves rather than one are 
obtained in studies of the DL when 
“equals” judgments are permitted. 
Their separation at the 50 per cent fre- 
quency level is referred to as the in- 
terval of uncertainty, which is taken 
as twice the DL. 

Many exploratory experiments with 
infants and children end at the point 
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of giving preliminary values for abso- 
lute thresholds, the RL’s. The experi- 
menters could have proceeded from the 
base of information so provided to plan 
a psychophysical experiment. This 
knowledge is needed before the selec- 
tion of stimulus values is attempted for 
use with a constant method, and, with 
this much accomplished, psychophysi- 
cal studies can proceed. 


Measuring Visual Functions 


THRESHOLDS OF SENSITIVITY 


Sensitivity Thresholds and Innate 
Response Indicators. In the newborn 
infant visual responsiveness is unmis- 
takably present but limited. Neither 
end-organ nor central nervous path- 
ways are sufficiently developed for 
more than rudimentary activity. The 
nature and exact locus of functional 
deficiencies are only partially under- 
stood. By an electrophysiological tech- 
nique limited responsiveness of the re- 
tina can be measured. This technique, 
the electroretinographic, gives a record 
of the changes in potential at the eye 
as measured in relation to a neutral 
electrode placed on a convenient skin 
surface. Zetterström (1955-1956) used 
flicker electroretinography to show a 
maturation of function during the first 
eight weeks after birth, which corre- 
lates well in time with the histological 
maturation of the macular portion of 
the retina. Cortical on- and off-re- 
sponses (Heck and Zetterström, 1958) 
to single light stimuli, when measured 
electroencephalographically through 
scalp and bone over occipital cortex, 
were first clearly evident only after the 
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Sequences of Stimulus-Presentation. 
Three standard methods for ordering 
the stimuli and then arriving at values 
of the physical stimulus that are cor- 
related with a change in response 
(threshold values) have long been in 
standard use, The final statistical state- 
ment will vary in detail with the 
method, as is true for measurements in 
science generally. Fundamentally, there 
is good agreement between the various 
methods in the quantitative relation- 
ships expressed. The method of choice 
is often indicated by the nature of the 
stimuli to be presented in conjunction 
with! the response required of the sub- 
ject. Since responses may shift some- 
what as a consequence of a sequence 
of stimulation, these effects themselves 
are sometimes made the central inter- 
est of experimental investigation. 

The most straightforward approach 
to the determination of a threshold, one 
which gives an initial approximation on 
the first set of observations, is known 
as the method of limits, or method of 
serial exploration. For obtaining a lower 
threshold (RL, or Reiz Limen), the in- 
tensity of the stimulus may be increased 
by small steps until the first response 
is obtained or decreased until the re- 
sponse is first absent or both. Explor- 
atory work with animal or infant 
human subjects is often done by the 
descending method of limits if the re- 
sponse has been developed through 
training (to provide some assurance 
that the response is still possible) or 
by the ascending method of limits if 
an innate response to the stimulus is 
utilized (to minimize negative adapta- 
tion or habituation). Many investiga- 
tors prefer to obtain an average value 
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compounded from both ascending and 


descending series of trials in order to 
rors 


reduce the effects of constant 

that come from a unidirectional 
proach to the threshold, Whic 
approach is used, psychologists under- 


stand the result to be an approximation 
or calculated value. They accept the 
concept of limen as a statistical entity, 
varying somewhat with the meth iod of 
determination and not to be regarded 
as an absolute. 

For the measurement of transition 


zones, through which a subject varies 
in his response to fixed values of the 
stimulus, two additional methods are 
available and are more satisfactory sta- 
tistically than the method of limits. 
These methods may also be more sat- 
isfying to the subject if he is being 
asked to make judgments of which he 
feels unsure. Guilford (1954) may 
profitably be consulted by the student 
who is seriously engaged in designing 
an experimental investigation of this 
nature. 

The first of these methods, the 
method of average error, is peculiarly 
suited for use with an apparatus that 
can be constructed to allow the sub- 
ject to vary the stimulus values himself 
until he becomes aware of the stipu- 
lated change (presence, absence, just 
noticeable difference, or just not no- 
ticeable difference). This method has 
proved highly satisfactory for the 
measurement of loudness and bright- 
ness discriminations. The PSE (point of 
subjective equality) or the RL are sim- 
ply an average value of the settings 
obtained. Although an experimenter 
may in the method of average error 
move the stimulus value until the sub- 
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ject gives a response, this destroys one 
of its main advantages and essentially 
converts it into a method of limits. In 
ct, this is just what such a procedure 
does when the lower threshold is un- 
der investigation. 

The most refined procedures in 
psychophysics, as well as the most 
generally applicable, are the constant 
methods. These share the advantage of 
eliminating constant errors that stem 
from series effects. They are applied 
to lower, upper, and difference limens 
(DL’s) and to interval scaling prob- 
lems. Variations in these methods have 
been devised to fit special procedural or 
computing requirements, but the essen- 
tial approach is that the experimenter 
selects a limited number of stimulus 
values for each of which he then ob- 
tains a frequency of response. To do 
this E presents the selected stimuli in 
a random or prearranged mixed order 
until each stimulus has been presented 
an equal and reasonably large number 
of times, A plotting is thus obtained of 
the transition between values which 
give very few responses of a given 
category to those for which response 
approaches 100 per cent. The threshold 
is typically taken as the point (deter- 
mined by interpolation or curve fitting) 
at which the function passes through 
the 50 per cent frequency of response 
value. Two curves rather than one are 
obtained in studies of the DL when 
“equals” judgments are permitted. 
Their separation at the 50 per cent fre- 
quency level is referred to as the in- 
terval of uncertainty, which is taken 
as twice the DL. 

Many exploratory experiments with 
infants and children end at the point 
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of giving preliminary values for abso- 
lute thresholds, the RL’s. The experi- 
menters could have proceeded from the 
base of information so provided to plan 
a psychophysical experiment. This 
knowledge is needed before the selec- 
tion of stimulus values is attempted for 
use with a constant method, and, with 
this much accomplished, psychophysi- 
cal studies can proceed. 


Measuring Visual Functions 


THRESHOLDS OF SENSITIVITY 


Sensitivity Thresholds and Innate 
Response Indicators. In the newborn 
infant visual responsiveness is unmis- 
takably present but limited, Neither 
end-organ nor central nervous path- 
ways are sufficiently developed for 
more than rudimentary activity. The 
nature and exact locus of functional 
deficiencies are only partially under- 
stood. By an electrophysiological tech- 
nique limited responsiveness of the re- 
tina can be measured. This technique, 
the electroretinographic, gives a record 
of the changes in potential at the eye 
as measured in relation to a neutral 
electrode placed on a convenient skin 
surface. Zetterström (1955-1956) used 
flicker electroretinography to show a 
maturation of function during the first 
eight weeks after birth, which corre- 
lates well in time with the histological 
maturation of the macular portion of 
the retina. Cortical on- and off-re- 
sponses (Heck and Zetterström, 1958) 
to single light stimuli, when measured 
electroencephalographically through 
scalp and bone over occipital cortex, 
were first clearly evident only after the 
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age of 6 months. Ellingson (1958) re- 
ports an earlier appearance of evoked 
potentials, with latencies that were 
much longer than in adults. 

The pupillary reflex is functional be- 
fore birth and has been used to meas- 
ure increasing sensitivity with age 
(Sherman and Sherman, 1925). Sudden 
increases in illumination also produce 
innate responses of the neck and head. 
A light of sufficient intensity produces 
a dorsal head movement. This response, 
the “eye reflex on the neck,” has been 
used by a number of investigators 
(Sherman and Sherman, 1925, Peiper, 
1926, Trincker and Trincker, 1955) to 
study the relative sensitivity of the 
newborn infant to light from different 
spectral regions and for determining 
increasing sensitivity with age. It is use- 
ful only during the early months, since 
the response weakens and dies away or 
becomes suppressed between the sec- 
ond and the sixth month after birth 
(Trincker : and Trincker, 1955). 
Thresholds are measured with light 
from narrow wave-band filters (1) for 
various selected regions of the spec- 
trum and (2) for the light-adapted as 
compared to the dark-adapted eye. In 
premature and newborn infants the 
sensitivity maximum lies close to that 
of the light-adapted adult eye. By the 
age of 1 month the curve of relative 
sensitivity under dark adaptation has 
shifted to give the typical scotopic (rod 
receptor) curve. 

Another response that has found use 
is the reduction in general motility in- 
duced by increasing the level of illu- 
mination (Smith, 1936). Automatic re- 
cording by a device known as a sta- 
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bilimeter is employed to obtain quan- 
titative measurements. This method is 
heavily dependent upon a proper con- 
trol or evaluation of the level of ac- 
tivity of the subject when the visual 
change is introduced. Methods employ- 
ing pupillary reflexes or head movement 
are somewhat less variable but even 
these are not free of this problem 

Missing from the literature thus 
is a study of the differential thresholds 
for young infants. The objective meth- 
ods described above are either por 
adapted to this task, or, what seems 
more likely, no investigator has applied 
them with sufficient skill and patience 
to work out a psychophysical func- 
tion. Even for older children, from 
whom sufficient comprehension of the 
task and cooperation for its accomplis 
ment might be expected, this research 
has not been completed, 

An innate response to movement of 
a large portion of the visual field, the 
optokinetic nystagmus, forms the b 
of a method that we shall consider 
further in connection with techniques 
for measuring acuity. The eyes are in- 
duced to move horizontally in an in- 
voluntary pursuit of the moving fie d, 
wherein each slower pursuit phase is 
interrupted by a quick return in the 
opposite direction. This method has 
possibilities for the measurement of 
absolute and differential intensity 
thresholds. It has not been systemati- 
cally exploited for this purpose nor for 
the purpose of studying the develop- 
ment of color vision. Theoretically, it 
would be suited to these purposes if 
the problems of simultaneously con- 
trolling hue and brightness could be 
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solved in conjunction with the require- 
ment that successive bands of light of 
uniform width be made to move at 
some optimal speed across the field of 
vision. 

An apparatus that proved quite sat- 
isfactory for these purposes was de- 
scribed by Meesman (1944). The im- 
age of a small sector of a rotating wheel 
bearing a translucent spiral design was 
projected onto a screen, and the lumi- 
nous intensity of the projected image 
was controlled by means of light filters 
and a variable lamp voltage. The image, 
projected in a field of total darkness, 
was 56 by 34 centimeters, viewing dis- 
tance, 30 centimeters, and the widths 
of the moving stripes, 6 centimeters. 
These values were justified by Mees- 
man on theoretical grounds and are said 
to have worked out well in practice. 
Data are given in his report to show 
the correspondence between curves of 
dark adaptation obtained by this 
method and by subjective (judgmental ) 
responses. Average absolute values and 
variability are closely parallel over the 
course of progressive increase in sen- 
sitivity, but the “objective” threshold 
is consistently above that measured by 
the subjective method. 

Measuring Thresholds in Older 
Children. Methods for testing visual 
sensitivity in older children and adults 
employ careful psychophysical proce- 
dures with accurately calibrated instru- 
ments. The best of these give valid and 
highly reliable measurements when cali- 
bration requirements and procedural 
standards are carefully adhered to. An 
adaptometer designed by Hecht and 
Shlaer (1938) for systematically vary- 
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ing the intensity of a test figure was 
used by Hunt (1941) with children 12 
to 19 years of age. A violet test figure, 
3 degrees in diameter, is viewed with 
off-center fixation 7 degrees toward the 
nasal side of the visual field. Exposure 
is for 4 second and S must report pres- 
ence or absence of the figure. This in- 
strument and its procedural require- 
ments have been given careful evalua- 
tion. Hunt and Palmer (1940) found 
some variation between three commer- 
cial units of the adaptometer and made 
recommendations for their calibration, 
They also evaluated the degree and 
sources of variation in data obtained 
from a representative group of inexpe- 
rienced subjects. They concluded that 
even with this highly standardized pro- 
cédure and well-calibrated instrument, 
“Variation in the subjective criteria of 
perceptible light flashes may be of suffi- 
cient magnitude to account for a con- 
siderable proportion of the variation 
among different individuals in final 
threshold values obtained after 30 min- 
utes of dark adaptation” (p. 424). The 
method was sufficiently reliable, never- 
theless, to demonstrate differences be- 
tween groups whose nutritional require- 
ments were being met with different 
degrees of adequacy (Hunt, 1941). 
An earlier instrument described by 
Ferree and Rand (1931) has been used 
with children 9 years old to adulthood. 
Some have taken the data from the 
Hunt study and from one by Ferree, 
Rand, and Stoll (1934) to indicate a 
lesser sensitivity after dark adaptation 
in preadolescents than in young adults 
(Zubek and Solberg, p. 179). The da- 
ta are not extensive enough to justify 
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confidence in this conclusion in view 
of the variation found and in consid- 
eration of the statement quoted con- 
cerning possible differences in the cri- 
teria of judgment. Given a group of 
subjects with somewhat more experi- 
ence as observers and carefully stand- 
ardized instrumentation, an investigator 
might obtain data with a minimum of 
these sources of error, which would 
settle the question of possible age dif- 
ferences among older children. 

In the hands of an expert the instru- 
ment of Ferree and Rand has certain 
advantages. It provides greater flexi- 
bility than the Hecht-Schlaer adaptom- 
eter for exploratory studies with 
varied sizes of test fields, largely elimi- 
nates effects of the size of the subject's 
pupil, and minimizes effects of ocular 
refractive errors and accommodation. 
An advantage of simplicity of opera- 
tion and procedure, once calibrations 
have been made, is held by the Hecht 
and Shlaer apparatus. 

Critical Flicker Frequency (CFF). 
The frequency at which a flashing light 
can just be discriminated from a steady 
light of apparently equal luminosity 
constitutes a threshold of great useful- 
ness in psychophysiological studies of 
the eye and the visual system. A stand- 
ard procedure employs a single small 
luminous area upon which intermittent 
illumination can be altered in a con- 
trolled cycle of light versus dark at a 
frequency that can be varied continu- 
ously over a range above and below 
the fusion threshold. Subjects are asked 
to judge when flashes appear to give 
way to steady light and when steady 
luminosity changes to flicker. These 
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judgments are not difficult for older 
children, but they seem not to have 
been used at younger ages when the 
method should be distinctly useful. 

For age groups beginning at 13-10, 
compared with those for each decade 
thereafter, up to 80-89 years, McFar- 
land et al. (1958) have shown steadily 
decreasing average CFF’s. This study 
is a distinct contribution to the devel- 
opment of a method that is sensitive to 
the effects of aging. The authors dem- 
onstrated that light flashes should be 
of short duration, i.e„ under 10 per 
cent of the light-dark cycle, if the 
method is to be best suited to the meas- 
urement of loss of discriminating power 
as a function of age. McFarland and 
co-workers used an instrument designed 
by W. B. Fisher, which produces light 
flashes of a square waveform. Flash rate 
could be varied 1 to 125 per second, 
and the duty cycle (ratio of light to 
dark) could be varied between 2 and 
98 per cent. This instrument is a dis- 
tinct advance over the episcotister and 
earlier electronic methods for present- 
ing controlled variations in the param- 
eters known to influence the CFF. 

For studying the development of 
visual sensitivity through thresholds for 
flicker, a method employing the opto- 
kinetic response should be useful in in- 
fants and young children. Its chief 
drawbacks would include some lack of 
direct comparability with standard 
CFF procedures and other problems 
common to the use of innate response 
indicators. The essential measure would 
have to be in terms of the speed at 
which moving striations could no 
longer be followed, as very effectively 
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employed in studies of the visual neu- 
rophysiology of animals (Crozier and 
Wolf, 1941). 


THE DEVELOPMENT 
OF COLOR DISCRIMINATION 


Differential Responses to Wave- 
length. The studies of Peiper and of 
lrincker and Trincker, discussed 
ibove, were conceived in terms of the 
duplicity theory of retinal sensitivity. 
They were designed to ascertain 
whether both photopic and scotopic 
luminosity curves could be shown to 
exist for the newborn infant, and there- 
fore the results have significance for 
both the achromatic and the chromatic 
aspects of visual function, The results 
imply that the newborn infant has re- 
ceptors that are ready to mediate cone- 
like activity in the retina when the eye 
is light-adapted. Together, these studies 
show that the neck reflex provides an 
index of the relative luminosity curve 
of infants, which corresponds to that 
of the photopic curve of the adult eye 
for wavelengths between 430 and 680 
millimicrons. This reflex gives results 
that appear to be more valid than those 
obtained from a general measure of 
body motility and its inhibition (Smith, 
1936). 

Hue Discrimination. To ask whether 
infants discriminate one color from an- 
other is to pose a more difficult ques- 
tion than the one just considered. At 
five spectral regions Trincker and 
Trincker (1955) measured the relative 
energy required to produce a neck re- 
flex, confirming Peiper’s earlier finding 
that the newborn infant exhibits the 
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luminosity curve of cone vision, The 
result also tells us that the intensity 
dimension is alone effective in eliciting 
the neck reflex. Some other technique 
is required to ascertain whether hues 
are discriminable, 

Chase (1937) used a method whose 
validity is now well supported by the 
evidence that under conditions of light 
adaptation brightness values can be 
equated for infants when we use the 
relative sensitivity to different wave- 
lengths of the adult eye. By moving a 
small area of colored light within a 
colorless field of equivalent brightness 
or within a field of a contrasting hue, 
Chase was able to obtain visual pursuit 
in a high percentage of trials. The 
method served well for infants ranging 
from 15 to 70 days of age. Control 
trials in which the hue of figure and 
ground was the same but brightness 
differed by as much as 25 per cent gave 
negative results. The study indicates 
that each of four hues were discrimi- 
nated one from the other. 

To equate luminosity over the en- 
tire screen, an area of 24 by 17% inches, 
Chase projected light through Wratten 
filters. A square hole was punched 
through the center of the gelatin of 
the surround, and the test hue was in- 
serted, The projected light flooded the 
entire translucent screen, and a Bodine 
variable speed motor served to drive 
the filter holder back and forth on a 
small track. Two observers were used 
to obtain independent records of the 
infant’s eye movements. 

For infants of 60 days to 2 years, 
Staples (1932) found a preference 
method for testing hue discrimination 
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creasing age. Again equating for bright- 
ness, she presented a pair of colored 
disks or a colored versus a gray disk 
and observed fixation times. In the 
youngest group (60 to 143 days) only 
yellow attracted significantly longer 
fixation times than grays, although for 
other colors the direction of difference 
was always in favor of the colored over 
the colorless disk. For older groups of 
children, there were reliable prefer- 
ences for some colors over others and 
some shifts of preferences with age. 
Red was markedly preferred by pre- 
school children but later gave way to 
blue. 

Other techniques that give evidence 
of color discrimination require match- 
ing of objects on the basis of color, the 
tracing of colored forms, or the nam- 
ing of forms whose outline is imbedded 
in discontinuous background figures. 
Munn (1955, pp. 242-243) has reviewed 
these studies and comments on the 
marked improvement found with in- 
creasing age of the subject. Negative 
results with a younger child may not 
represent an absence of color sensi- 
tivity: “All that can be said with as- 
surance is that, as they become older, 
children improve in color naming and 
in their discrimination of the figures in 
pseudo-isochromatic charts like those 
in the Ishihara and Dvorine tests.” 


THE DEVELOPMENT 
OF VISUAL ACUITY 


Measurement of the resolving power 
of the visual system requires accurate 
specification both of the nature of the 
test figure and of the background con- 
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ditions under which it is used. The sev- 
eral classes of acuity can be compared 
if each is expressed in terms of the 
visual angle subtended by an object or 
the visual angle of a gap that can just 
be perceived. Any test of a threshold 
of acuity shows some change in the 
minimum resolvable visual angle with 
variation in illumination or brightness. 


For a bright line or a dot that is made 
narrower and narrower within a dark 
field, the effect of luminosity is so 


strong that it can always be made to 
compensate for a reduction in size of 
the test object. Control of luminosity 
and of the adaptation of the eye are 
so critical as to be impractical for rou- 
tine or large scale testing. Measures of 
acuity therefore usually rely on a de- 
termination of the minimum separable, 
as distinguished from the minimum 
visible. When the minimum separable 
is expressed in seconds or minutes, a 
direct measure is given of the arc of 
the visual angle subtended by the dis- 
tance between test bars at the thresh- 
old. Visual acuity is defined as the re- 
ciprocal of this angle, the angle being 
measured in minutes. Under good con- 
ditions of testing, acuity reaches a value 
of 2.0, corresponding to a visual angle 
of one half minute. The conventional 
Snellen fraction represents “normal” 
acuity as 20/20, which corresponds to 
a visual acuity of 1 or a resolution of 
one minute of visual angle (Bartley, 
1958, Ch. 7). 

Innate (“Objective”) Indicators of 
Acuity. Ohm, who in 1922 originally 
described the use of optokinetic nys- 
tagmus as a method of examining the 
excitability of the human visual system, 
has more recently (Ohm, 1952) com- 
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mented on several European variations 
of this “objective” indicator. He also 
reiterates a warning against the uncri- 
tical application of this technique. One 
limitation of considerable importance 
ies in the relative contribution of the 
peripheral retina, which is more active 
in mediating the response to motion of 
ı large portion of the visual field than 
the fovea, This means that optokinetic 
vystagmus measures something less than 
naximum resolving power when the 
triations have been reduced in width 
to the point at which eye movements 
ire no longer elicited. All words of 
caution notwithstanding, the promise of 
this method for studying visual devel- 
opment during infancy and early child- 
hood should have been recognized 
sooner and exploited more extensively 
than is yet the case. 

McGinnis (1930) demonstrated that 
the response can be elicited on the day 
of birth. A proper level of alertness is 
necessary, as shown by the fact that 
the eye movements do not occur when 
the eyes of a sleeping infant are gently 
separated during movement of the op- 
tokinetic drum. Gorman, Cogan, and 
Gellis (1957) have applied a simplified 
version of the McGinnis technique to 
the measurement of 100 infants under 
5 days of age. They used a field that 
moved at a rate of 8.5 degrees per sec- 
ond, with either of two widths for the 
alternating lines and spaces. When each 
line and space subtended 33.5 minutes, 
they found nystagmus occurred in 93 
of the infants, but with widths of 11.1 
minutes of visual angle the response 
generally failed. The larger width is 
equivalent to a Snellen notation of 
20/670, Without intervening steps for 


293 


testing purposes, these authors were 
unable to find any change in acuity 
between days 1 and 5 and concluded 
that more work with this method is 
needed to determine the acuity of 
young infants. Warkentin and Smith 
(1937) have given us a more precise 
developmental study, using this tech- 
nique with kittens between birth and 
30 days of age, and report that the first 
response to stripe widths of eleven 
minutes occurs between days 23 and 
30. 

Using children as subjects, Reinecke 
and Cogan (1958) compared acuities 
determined by the optokinetic method 
with those obtained from Snellen 
charts. Snellen acuity was frequently 
higher than the index obtained with 
the optokinetic drum. The correlation 
between the measures was +.66 for 
100 subjects, These experimenters used 
lines and spaces that were equal in 
width, and they had series of widths 
ranging from 0.333 to 0.028 inch, pre- 
sented at 1 meter from the eye. For 
those subjects who still showed re- 
sponse to the narrowest stripes at 1 
meter, the distance was increased to 2.5 
meters. 

For infants older than 6 weeks, 
McGinnis (1930) reported that visual 
pursuit of a single line moving slowly 
across the visual field could be elicited 
reliably, and the response was easily 
recorded by him on motion-picture 
film. If we assume that fixation and 
pursuit of such a visual stimulus de- 
pends, as it almost certainly does, on 
foveal vision, this method could fur- 
nish the basis for a developmental study 
of foveal acuity. 

Judgmental Tests of Acuity. For 
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tion and compliance with instructions, 
a more complex form of visual stimulus 
is used for the acuity target. The 
Snellen illiterate E test is such a stand- 
ard instrument for use with preschool 
children, Stern (1953) suggests that in- 
stead of relying on the chart and a 
verbal description by the child of the 
position of the E it helps to use a series 
of solid plastic E’s. The child is given 
such an E and is shown how he can 
turn it to match the position of the 
examiner’s E. Then the examiner places 
smaller E’s in varying orientation 
against a viewing screen and the child 
continues to turn his E to correspond. 
“Children as young as 2 years old have 
been tested to 20/30 vision in each eye 
by this method.” 

Allen (1957) offers a new picture 
test, which he claims often works when 
the illiterate E test is unsatisfactory, as 
is sometimes the case with 3- and 
4-year-olds. Eight broken-line black- 
on-white pictures are used in this test. 
Most normal 3-year-olds recognized 
the pictures at 15 feet and 4-year-olds, 
at 20 feet. Using the illiterate E test 
for validation, Allen found that 20/40 
vision did not permit recognition of 
the pictures at 20 feet. Approximate 
norms and Snellen equivalents are in- 
dicated, as is also the manufacturer of 
the test equipment, Ophthalmix, Inc., 
Brookfield, Illinois. Comparing the var- 
iability of five different types of chart 
figures in common use (letters, figures, 
pictures, Landolt rings, and E-hooks), 
Jonkers (1958) reports standard devia- 
tions between .o4 (for letters) and 
.15 (for pictures). His data are derived 
from children aged 5 years and over. 
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For children of school age, several 
commercially available tests of acuity 
are in use. These are offered in com- 
bination with tests for stereopsis or 
binocular fusion and for measures of 
the degree to which the eyes orient in 
parallel along the line of sight. Such 
combined “screening” tests are much 
used in schools and in industrial estab- 
lishments, The Massachusetts Vision 
Test was given by Leverett (1957) to 
children from kindergarten age through 
the twelfth grade. A consistent increase 


in the proportion of children showing 
less than 20/20 vision was found with 
increase in grade level. The phoria test 
showed that there was a similar trend 
toward deviating positions of the eyes. 
In one study of children from g ides 


1 through 12 (Kelley, 1957) the Key- 
stone Telebinocular was selected “be- 
cause it proved better adapted to test- 
ing very young children, and it was 
desired that the same instrument be 
used with all grades.” The basic prin- 
ciple of the Keystone instrument, like 
that of the Bausch and Lomb Ortho- 
Rater, is that of the older and familiar 
Brewster stereoscope. Tests for acuity 
at near and at far (about 20 to 25 feet), 
for monocular and binocular perform- 
ance, fusion, and degrees of phoria are 
provided in the Ortho-Rater or the 
Keystone. One compact instrument 
with a series of precision-constructed 
test plates can be used for the battery 
of interrelated tests (Robinson, 1953). 

Pattern Discrimination. The prob- 
lem of more complex visual pattern 
discrimination is beyond the scope of 
this chapter. However, the transition 
between acuity measurements and pat- 
tern vision is not one with a sharp cut- 
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off. Pattern preferences in infants un- 
der 6 months of age have been demon- 
strated by Fantz (1958), who proposes 
also to use such preferences as a method 
for measuring acuity. The technique 
involves a comparison between relative 
fixation times exhibited by the infant 
to one of two simultaneously presented 
1al patterns. Horizontal stripes were 
found to be preferred over concentric 
circles, As in the training technique 
ised by Riesen (1958) with infant 
chimpanzees, the evaluation of acuity 
san be made by using stripes of vary- 
ing widths, 

Ling (1941) found a training method 
satisfactory for the study of learned 
form discriminations in babies of 6 to 
12 months. The technique is reported 
to give an initial discrimination between 
forms such as circle versus cross or 
circle versus triangle in 4o training 
trials to a criterion of four out of five 
correct responses. The method Ling 
used was to fasten one form down and 
leave the other free to be picked up 
and brought to the mouth, whereupon 
the child found it sweet to the taste. 
The circle versus oval blocks method 
was most difficult, which suggests im- 
mediately that it would lend itself to 
a measurement of differential thresh- 
olds for form acuity. 


Measuring Auditory Receptivity 


For detecting a sensitivity of the ear 
in newborn infants, some reflex re- 
sponses are useful but not so readily 
observed as in vision. Pure tones pro- 
duce little or no response of any kind. 
High intensity noise is quite often re- 
ported to startle or to produce an in- 
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crease in general bodily activity. For 
anatomical reasons, it is usually pre- 
sumed that hearing in the newborn is 
interfered with by external and middle- 
ear impediments, especially by amnionic 
fluid remaining in the middle ear, where 
it impedes motion of the ossicles. Au- 
thors have sometimes erroneously as- 
serted that newborn babies hear noth- 


ing. 


SENSITIVITY TO NOISE 
IN THE FETUS AND NEWBORN 


Sound Induced Reflexes. The gen- 
eralized muscular contraction often re- 
ferred to as the “startle response” has 
been used to test the sensitivity of the 
fetus and newborn infant. Forbes and 
Forbes (1927) reported observations of 
this response through the mother’s 
abdominal wall thirty-one days before 
term. Froeschels and Beebe (1946) 
looked for any or all movements that 
infants might make to the sound of 
tuning forks or whistles. Tuning forks 

roved ineffective in 33 newborn to 
-day-old infants, but the noisy onset 
of whistles produced responses in 29 of 
these. An acoustopalpebral reflex was 
the most frequent positive indicator 
observed. 

Whereas newborn chimpanzees, some 
other infrahuman primates, and many 
laboratory mammals reliably exhibit a 
pinna reflex to a bell or whistle, this 
response is only occasionally reported 
by those who study human infants. 
Forbes and Forbes (1927, PP- 354-355) 
cite their observation of an 8-day-old 
infant who was stimulated with “soft 
notes from a gong coming from be- 
hind. . . . The baby’s ear at once flat- 
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tened against the side of its head and 
for a few seconds it stopped nursing, 
then the ear relaxed again and nursing 
continued.” What may be the same re- 
sponse and was called “ad concham” 
is reported for four of the 33 subjects 
of Froeschels and Beebe (1946). A re- 
flex of the tensor tympani has also been 
used to indicate the presence of hear- 
ing. Kobrak (1957) describes in detail 
a technique for making this response 
visible through a system of mirrors and 
a reflected light beam that moves with 
motion of the eardrum. 

The galvanic skin response of young 
adults has been shown to vary directly 
in magnitude with the intensity of a 
pure tone (Hovland and Riesen, 1940). 
Stimuli near the lower limits of audi- 
bility were not used in this study. The 
use of sound-induced reflexes shares the 
common disadvantage that useful data 
on thresholds are seldom obtained be- 
cause all of these indicators require at 
least moderately intense stimulation if 
the reflex mechanism is to be activated. 
Lower limits of stimulus intensity and 
response could, however, prove diag- 
nostic if careful quantification on nor- 
mal populations were first carried 
out. 

For evaluating the hearing of babies 
between 3 and 5 weeks of age, Haller 
(1932) made observations of general 
somatic responses to pure tones. She 
varied these along both intensity and 
pitch dimensions. This represents an 
attempt to do a very rough kind of 
audiometry and might be improved 
upon somewhat by the use of better 
controlled conditions. The Dockeray 
and Valentine (1939) isolation cabinet 
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The 


was proposed for such purposes 
main conclusion of Haller’s study is 
that tones of high frequency and high 
intensity produce restlessness. There 
was also the suggestion that low 
quency tones have a quieting effect 
Electrophysiological Methods. The 
electrical response at the cortex to 
clicks of varying intensity can give an 
estimate of hearing impairment (Perl, 
Galambos, and Glorig, 1953). As com- 
pared with subjective thresholds, the 
EEG threshold determined by visual 
inspection of the EEG records is 
higher. Newer electronic summators 
could be employed to reduce the 
amount of discrepancy between psy- 
chophysical limens and those taken by 
electroencephalography. With pure 
tone audiometry and evaluations of the 
effects of pure tones on EEG patterns 
with respect to onset, continuation, 
termination and delayed arousal reac- 
tions, Derbyshire et al. (1956) re- 
orted statistical comparisons of thresh- 
olds obtained from 22 children (aged 4 
to 13 years) with normal and impaired 
hearing. Average differences between 
standard audiometric and EEG thresh- 
olds ranged from 4 to 40 decibels. The 
EEG responses were interpreted as re- 
flecting activation of a diffuse arousal 
system, This conclusion was based on 
the authors’ experience with a much 
larger sample of subjects for whom no 
effect of age from 3 months to 27 years 
was seen in the records and in whom 
similar responses were obtained by 
stimulation of other sensory modalities. 
Derbyshire and co-authors state that 
they obtained audiometric and EEG 
records in which these two sources of 
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information were for diagnostic pur- 
poses more useful taken together than 
either would have been alone. 

Special Evaluative Procedures for 
Infants and Children. Those specialists 
who must evaluate the hearing of in- 
fants and younger children agree that 
pure tones have little interest value and 
that the standard audiometric proce- 
dure is unsatisfactory with most chil- 
dren up to 5 years of age, Myklebust 
(1954) urges that the mental and emo- 
tional status of the child be appraised 
as preliminary steps to auditory tests 
for the reason that “failure to respond 
to an auditory stimulus does not nec- 
essarily mean inability to hear” (Mykle- 
bust, p. 242). 

Several authors (Ewing and Ewing, 
1944; Myklebust, 1954) find that the 
use of whispered speech often brings 
good response when louder sounds do 
not, Other meaningful sounds, some of 
which are learned as early as 2 months, 
serve in the assessment of receptive and 
interpretive capacity. These and other 
methods were evaluated in a survey 
study done on 91 presumably normal 
children and 170 children suspected of 
deafness (Ewing and Ewing, 1944). For 
children 3 to 6 years old, Bloomer 
(1942) utilized pictures of objects that 
children associate with sound to obtain 
responses in pure tone audiometry. The 
verbal instructions and required be- 
havior are simple. The child is asked 
to listen for a sound that will be a 
“train whistle” or a “bird song.” They 
are to touch the pictured object or ani- 
mal as soon as they hear it and to re- 
move their hands from the picture as 
soon as it stops. Bloomer suggests that 


297 
only three or four frequencies be tested 
during a given session and that for a 
complete audiogram a second session be 
scheduled for another day. 

Techniques for conditioning infants 
or children to give responses to sound 
stimuli by pairing them with a rein- 
forcing stimulus have been described 
and evaluated by several investigators. 
Hirsh (1952, Ch. 10) discussed the 
varieties of such methods and compared 
them with standard audiometric pro- 
cedures, which he regarded as not fun- 
damentally different in that they have 
in common the utilization of learned 
responses as indicators of auditory sen- 
sitivity. However, conditioning proce- 
dures that are based on the classical 
conditioning of involuntary (auto- 
nomic) responses have the possible ad- 
vantage of placing fewer demands on 
the voluntary cooperation of the sub- 
ject. For this reason, they have some- 
times been especially advocated for test- 
ing infants and younger children, 

A study whose primary objective 
was the determination of the condi- 
tionability of very young infants 
( Wickens and Wickens, 1940) success- 
fully demonstrated auditory sensitivity 
of babies aged 10 days or under. These 
investigators paired buzzer and shock 
12 times daily for three days, the shock 
being applied to the foot to produce 
a withdrawal movement. In a control 
group it was found that applying the 
shock alone was as effective as the pair- 
ing, for in the total of 24 subjects 20 
gave foot withdrawal movements to 
the sound alone, nine of these having 
received paired stimulation and 11 only 
the shock. The pseudo-conditioning 
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was thus as capable of bringing out the 
auditory responsivity of these very 
young infants as the classical condition- 
ing procedure. 

The conditioned galvanic skin re- 
sponse was found effective in screening 
hundreds of children for hearing de- 
fects at the Johns Hopkins Hospital 
(Bordley and Hardy, 1949; Bordley, 
1952). These authors report that in 
children of 1 and 2 years of age a tone 
and shock need be paired only a few 
times. Thereafter, response continues 
for many trials over a wide frequency 
range. Results were checked against a 
standard audiometry test in slightly 
older children, who gave sufficient co- 
operation to permit this procedure. Still 
another standard conditioning tech- 
nique has paired pure tones (the CS) 
with air-puff stimulation of the cornea 
(Galloway and Butler, 1956). For 20 
subjects, the difference between thresh- 
olds obtained by audiometric tests and 
those measured by this conditioned eye- 
lid response averaged approximately 5 
decibels, 

Hirsh (1952) and others have re- 
ported considerable interest in the 
instrumental (operant) conditioning 
technique of Dix and Hallpike (1947). 
A pure tone sounded for short intervals 
of time is made the occasion for rein- 
forcement during a “peep-show” game. 
After a training session, the young 
child who can hear demonstrates this 
capacity by turning promptly after the 
onset of the tone to the peep-show 
window and pressing the button to il- 
luminate the interesting picture or ob- 
ject. During silent periods the button 
is ineffective. This technique differs 
from standard audiometry primarily in 
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that it minimizes any need for the un- 
derstanding of instructions and for the 


child’s giving verbal reports of what he 
hears. 
Older children can tolerate the 


repetition necessary for a thorough 
psychophysical procedure in the audio- 
metric measurement of thresholds over 
a wide range of frequencies. Neverthe- 
less, there is something to be said for the 
time-saving advantages of an audiom- 
eter in which the subject adjusts the 
tonal intensities over his threshold range 
rather than having to make judgments 
in an arbitrary prearranged series or in a 
series that the examiner adjusts as he 
gradually rules out the more extreme 
and less useful settings. The Békésy 
(1947) audiometer produces a con- 
tinuous increase of intensity as long as 
the subject holds down a button, a con- 
tinuous decrease after release. The 
absolute threshold and difference limens 
can be recorded over a wide range of 
frequencies (500 to 15,000 cycles per 
second). Corso (1955) determined that 
an attenuation rate of 0.5 decibel per 
second was optimal; i.e., it produces the 
best test-retest reliability for threshold 
determinations at 1000 cycles per 
second. 

It is clear from the writings of the 
clinically oriented specialists that if all 
children at various ages and presenting 
abnormal as well as normal hearing 
Capacities are to be evaluated several 
techniques must be available. A syste- 
matic study of a normal population 
of very young children could be done 
with a specific technique and would 
have the merits of reproducibility as 
well as contributing to further valida- 
tion of that procedure. 
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Testing the Chemoreceptors 
TASTE SENSITIVITY 


Taste in the Newborn. For studying 
gustatory and olfactory sensitivity in 
the infant, there are more methods than 
there are developmental data described 
in the literature. There is need for 
further refinement in methodology, 
but much could be learned by the ap- 
plication of currently available pro- 
cedures. In addition to general problems 
of improving thresholds, the question of 
the possible differential development of 
the several receptor areas within the 
mouth needs investigation, and this 
would require a method of controlling 
the cite of stimulation more adequately 
than any currently in use. 

For studying taste sensitivity in new- 
born infants, the standard stimuli, 
sweet, sour, salt, and bitter and oc- 
casionally more complex stimuli, such 
as human versus cow’s milk, have been 
used. The solutions have been applied 
with cotton applicators or through a 
nipple. Pulse, breathing, body move- 
ment, and sucking responses have been 
observed or recorded automatically. 
Change from base-line scores or con- 
trast scores are used as a measure of 
differential sensitivity. 

Pratt, Nelson, and Sun (1930) used 
the applicators and measured changes in 
general activity, as recorded on the 
stabilimeter-polygraph. Citric acid and 
quinine were about equally effective in 
raising the level of activity. To sugar 
and salt the newborn infant was much 
more likely to show increased sucking 
movements than changes in general 
bodily activity. These were not part of 
the records taken on the polygraph. It 
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remained for Jensen (1932) to develop 
a technique that permitted analysis of 
detailed records of sucking responses. 
Momentary changes in pressure and 
volume consumed as a function of time 
were recorded on cumulative curves 
from his elaborate manometer system, 
timer, and polygraph. Temperature of 
the fluid stimulant was held constant 
(except when he studied it as the in- 
dependent variable). Nipples were 
calibrated carefully for rate of flow, 
matched pairs of nipples being used for 
the experimental and control feedings. 
The chief inadequacy of this method 
lies in the absence of control over the 
area that is stimulated within the mouth, 
That it employs a most natural and a 
discriminative indicator response is at- 
tested to by results obtained, and it 
is to be regretted that the method has 
not been applied to infants other than 
Jensen’s group of  2-to-16-day-old 
hospital babies. 

Jensen used five concentrations of salt 
solution for determining differential 
reactions compared with milk. Ap- 
parently a descending method of limits 
gave him the results in his Table 4, 
which showed that no infant dis- 
criminated the 0.200 per cent salt 
solution and that 13 of 14 discriminated 
the 0.900 per cent solution, All are 
weaker solutions than those recom- 
mended by Bérnstein (1940-1941) for 
the detection of taste impairment in 
man. 

Techniques with Older Subjects. 
Taste-testing the anthropoid apes 
(Fisher, Ford, and Huxley, 1939) has 
been accomplished by a procedure that 
also employs a behavioral sign of re- 
luctance or strong aversion. It would be 
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applicable to human subjects over a 
wide range of ages. About 30 per cent 
of a chimpanzee population was found 
to be clearly distinguished from the re- 
mainder of the group on the basis of 
refusal or willingness to drink a 2 per 
cent sucrose solution containing 50 
parts per million of phenyl-thio- 
carbamide. Threshold work could easily 
be done by a method of constant 
stimuli. Fisher and co-workers, for ex- 
ample, used two other concentrations 
and found that nontasters were unable 
to detect even 400 parts per million, 
whereas some tasters refused a solution 
containing only 6% parts per million. 
The use of quinine for determining 
changes in threshold for the sensory 
quality of bitter is standard practice. 
Here, again, rejection is used as a cri- 
terion (Moncrieff, 1946) when verbal 
judgments are not possible. 

Very young children and animals are 
found to accept solutions having a sweet 
taste in preference to distilled water. 
This was the basis on which sucrose 
taste thresholds were compared in rats, 
school children, adults up to 50 years 
of age, and elderly subjects, 52 to 85 
years old (Richter and Campbell, 1940). 
For the rats, the criterion was in terms 
of relative quantity of fluid consumed. 
The human subjects judged first when a 
difference between the sucrose solution 
and distilled water could be detected 
and then reported at what point they 
recognized a sweet taste. An ascending 
method of limits was used, the concen- 
trations being increased from a value 
of o.o1 per cent in a standard series of 
steps. Rats and young adult humans 
exhibited nearly equivalent sucrose 
(sweet) taste thresholds of approxi- 
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mately 0.5 and 0.4 per cent. The 58 
children averaged 0.68 per cent, and the 
52 elderly individuals, 1.23 per cent. In 
making their judgments, the human sub- 
jects were permitted to taste the sucrose 
solution and the distilled water, con- 
tained in glasses held in either hand, as 
often as they wished before giving their 
statements, 


OLFACTORY SENSITIVITY 


Even fewer developmental studies of 
odor thresholds than of taste are in the 
literature. Methods applicable specifi- 
cally to infants and children are sparse 
indeed, and depend exclusively upon 
signs of aversion, irritation, or pleasure, 
the last of these being the most difficult 
to evaluate. Lack of adequate methods 
has produced contradictory statements 
by different investigators, some of 
whom deny that newborn infants ex- 
hibit any sense of smell. 

Disher (1934) made no claim that her 
method differentiated smell from taste. 
With a syringe she pushed air or 
samples of gas from collecting tubes and 
mixed with air into the nostrils of in- 
fants. Two observers and a motion- 
picture record gave her the data on 
face and mouth movements, movements 
of extremities, breathing changes, eye 
movements, head movements, and 
vocalizations. Movements of face and 
mouth are reported to have been 25 per 
cent more frequent after use of an 
odorous substance than after the use of 
air. Other parts of the body gave lesser 
degrees of increase in activity. There 
was no clear indication of differential 
effects from different odors. The extent 
and frequency of responses did show 
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increase with higher concentrations of 
the olfactory stimuli. 

Other investigations (Pratt, Nelson, 
and Sun, 1930; Stirnimann, 1936) used 


automatic recording apparatus for 
measuring general bodily motility or 
graphic recording of respiratory and 


circulatory changes. When such strong 
stimuli as ammonia or acetic acid have 
been used, the interpretation of the 


results is confused by the question 
whether pain, gustatory, or olfactory 
receptors are involved. 


Pain Sensitivity. Cutaneous pain is 
clearly a distinct sensory process, hav- 
ing separate receptors and central path- 
ways. Relations between this mode and 
the responses to intense stimulation of 
receptors for temperature and pressure 
and the distance receptors have not 
been clearly determined. Central phys- 
iological correlates of pain need further 
study before the common factor of 
extreme discomfort from mechanical, 
thermal, light, vibratory, and chemical 
irritants can be understood (Morgan 
and Stellar, 1950, Ch. 11). 

For the study of the young organism, 
a frequently used method has been to 
observe motor and autonomic response 
to pinprick. Most observers express 
some degree of surprise at the mildness 
of reactions elicited in fetal and new- 
born mammals, including man. Car- 
michael expresses this for the guinea 
pig, as observed approximately five 
days prenatally or at 63 days post- 
copulation age. Needle stimulation 
produced flexion or extension, includ- 
ing in many cases rhythmic limb move- 
ments, but “no indication of violent 
response or of the behavioral signs of 
pain were noted” (Carmichael, 1934, 
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P- 413). The newborn human infant 
responds inconsistently or not at all to 
pinprick (McGraw, 1943, p. 102). A 
week or so later crying and general 
bodily movements occur, with some- 
times a local flexion of the member 
stimulated. McGraw interprets this as a 
reflex movement, for there is a sub- 
sequent phase in which no localization 
of the stimulation, little crying, and 
only a widening of the eyes and some 
facial muscle activity give evidence 
that neural activation occurs. The in- 
fant at 150 days first shows deliberate 
withdrawal and a visual search for the 
spot being irritated. Perhaps this is the 
time of onset of an adult type of pain- 
ful experience, prior reactions having 
been largely reflex or subcortical in 
their mediation. 

Painful cold and heat presumably in- 
volve receptors other than or in addi- 
tion to those mediating temperature 
sensitivity. Failure to react to painful 
burns and cuts is sometimes observed 
as a congenital anomaly and is not nec- 
essarily associated with other sensory 
incapacities (Boyd and Nie, 1949). Evi- 
dence obtained does not tell us whether 
peripheral sensory or central neural 
dysfunction is involved. However, this 
independence of function is not crucial 
to the question of a receptor for thermal 
pain. In Carmichael’s study of the 
prenatal guinea-pig (1934, p. 436) he 
was not certain whether true tempera- 
ture stimulation or pain stimulation or 
both were involved in the extremes of 
heat and cold used. 

The studies by Dockeray and Rice 
(1934), Sherman, Sherman, and Flory 
(1936), and McGraw (1941) all agree 
in their demonstration of marked 
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changes in response to pinprick or elec- 
tric-shock stimulation as a function of 
age and experience between birth and 
6 months. More work is needed on the 
anatomical and _ neurophysiological 
maturation of pain receptors (Bishop, 
1944), conduction pathways, and neural 
centers that are involved in pain. The 
problem of a behavioral criterion 
plagues workers with adult subjects 
almost as seriously as those studying in- 
fants. Beecher (1957, p- 115), who has 
nothing to say about this for infants, 
indicates in his voluminous review that 
the problem is difficult: “Unfortunately 
there is no bodily reaction in man which 
occurs only in response to pain.” 

Apparently, no specific methods for 
infant or child studies of other kinds of 
pain have been systematically devel- 
oped. Deep somatic pain and visceral 
pain must be regarded as distinct from 
cutaneous pain but are even more dif- 
ficult to study experimentally. 


Vestibular and Somesthetic 
Receptivity 


MECHANISMS OF EQUILIBRIUM 
OF THE INNER EAR 


The so-called “sense” of equilibrium 
is usually specifically identified with 
sensory activity from the semicircular 
canals and vestibular sacs of the inner 
ear, Ordinarily a person also uses cues 
from vision and kinesthesis to maintain 
bodily equilibrium. The functions of the 
labyrinthine receptors are consequently 
more difficult to isolate and study than 
are cither the auditory or visual func- 
tions. In other words, as Geldard (1953, 
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P- 249) has pointed out, the subject is 
unaware of any direct sensations from 
the labyrinths. A sense of motion, loss 
of equilibrium, or dizziness are pure 
indicators of activity of the labyrinths 
only when visual, kinesthetic, and other 
organic receptors are held in a steady 
state. This is a problem that must be 
faced when techniques for study are 
being developed. 

Reflex Response Thresholds. Except 
for simple reflex indicators of gross 
responsivity within the vestibular 
mechanisms, there are no studies on in- 
fants and children that attempt to solve 
methodological problems and obtain 
normative developmental data for this 
system. Animal experimentation has 
been more extensive, but the methods 
are in many instances inapplicable for 
use with human subjects. Changes in 
rate of bodily motion are the normal 
physical correlates of labyrinthine 
stimulation. Temperature and mechani- 
cal stimuli can be used to indicate the 
presence of sensory processes, but these 
we will ignore in the discussion of the 
developmental study of this system, 
useful as they are for physiological 
investigation. 

Asking the child for a judgment of 
linear or rotary motion or nonmotion is 
a potentially useful approach. It seems 
not to have been exploited system- 
atically except, however, with adults 
(Dodge, 1923). A more sensitive in- 
dicator and perhaps a much easier judg- 
ment for a young subject is the oculo- 
gyral illusion, as employed in studies 
of adults by Graybiel, Kerr, and 
Bartley (1948). This illusion depends 
upon sensitivity to rotary motion. The 
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chief technical problem is that of con- 
trol and accurate measurement of rotary 
acceleration or deceleration, The sub- 
ject sits in a specially mounted chair. 
A tachometer is typically used to meas- 
ure rotation rates, which are themselves 
produced under only approximate con- 
trol. Either a throttle or voltage regula- 
tor, depending upon the source of 
power, may be used to govern changes 
in speed of rotation, Many trials are 
given to explore the responses in the 
region of the absolute threshold and to 
provide continuous response curves 
from an average value of no apparent 
movement to consistent apparent move- 
ment in one direction or the other (to 
right or to left), 

Judgment of the oculo-gyral illusion 
is made in three categories, the chief 
requirement being that the subject 
understand the instructions to report 
whether the single visual target ex- 
posed between periods of darkness is 
seen to move to the right or the left or 
to remain still. Graybiel et al. used a 
six-pointed collimated star whose three- 
dimensional properties are said to have 
reduced the difficulty otherwise en- 
countered from the autokinetic illusion. 
They permitted judgments by exposing 
the star at intervals of twenty seconds. 
Less difficulty with nystagmoid after- 
effects of motion would be encountered 
if trials were spaced at intervals of 
sixty seconds, permitting the attainment 
of a steady state during the period of 
constant rate of rotation between trials. 
This method gives the slowest rates of 
angular acceleration thresholds in adult 
subjects obtained: on the order of 0.12 
degrees per sec.* 
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A method for the direct measurement 
of vestibular nystagmus by recording 
eye movements was devised by Dohl- 
mann (1935). Neither this method nor 
the more modern photographic methods 
for recording eye movements have been 
used for developmental studies of ves- 
tibular sensitivity. Since they would re- 
quire only the passive cooperation of 
the subject, these methods would be 
highly feasible for studies of younger 
children, Threshold values are typically 
about 1.0 degree per sec.* when this 
method is used in adults, which com- 
pares favorably with values of nearly 
2.0 degrees per sec.* when the judg- 
ment of perceived motion is required, 
Other Potential Methods. It is clear 
from the verbal judgments that children 
can make with respect to rotational and 
postrotational effects that learned re- 
sponses could be used in this sense mode 
as in hearing and vision, Further con- 
firmation of such possibilities comes 
from the successful conditioning of 
animals with angular acceleration as the 
conditioned stimulus (Spiegel and 
Oppenheimer, 1939). 


Temperature, Contact, and 
Pressure Sensitivity 


Although there has been some work 
on the first appearance of temperature, 
touch, and pressure sensitivity in the 
developing fetus and some demonstra- 
tions of sensitivity in the newborn, 
studies of these receptor modalities are 
conspicuously absent from the literature 
on infant and child development. Pratt, 
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Nelson, and Sun (1930) elicited 
vigorous bodily responses to cold by 
using a temperature cylinder. More 
satisfactory control of tactual condi- 
tions was obtained by Crudden (1937), 
who left capsules in contact with the 
skin. These had inflow and outflow 
tubes to permit the prompt alteration 
of temperature by fluid interchange. 
Motion-picture and protocol records 
were taken of respiration and move- 
ment. Indications from a few records 
taken with three older infants en- 
couraged Crudden to conclude that 
local responses increased with age. 
Respiratory changes were the promptest 
and most consistent responses noted. 

Carmichael and Lehner (1937) used 
a method of constant stimuli to study 
the development of sensitivity in fetal 
animals. Their criterion of response to 
temperature was based on a comparison 
with the effect of tactual stimulation 
from drops of fluid of 37.5°C., the 
physiological neutral point. 

Jensen (1932) reports threshold 
values in newborn infants for hot and 
for cold in the mouth. His technique 
has been described in connection with 
his work on taste thresholds (see p- 
299). 

For touch and/or pressure, our in- 
formation includes some rough com- 
parisons of different bodily surfaces. 
About the only firm conclusion is that 
the region about the mouth is one of 
the earliest to develop sensitivity and 
retains priority thereafter as one of the 
more sensitive areas (Fitzgerald and 
Windle, 1942; Hooker, 1952). Touch, 
pressure, and somesthesis are each 
greatly in need of methodological ad- 
vancement and developmental study. 
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“Look at it through the eyes of the 
child” is a favorite maxim of popular 
writers on child psychology. What, 
actually, does the child see? American 
child psychologists have seldom asked 
this question, though in Europe research 
on perception, even in children, has 
always flourished. Development of per- 
ception was studied in Peters’ laboratory 
in Germany, in the Biihlers’ in Vienna, 
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and continues in Piagets in Geneva. 
This background contrasts with the 
normative approach of American child 
psychologists during the comparable 
era. In our mental tests are many tasks 
that could be classified as perceptual: 
formboards, matching pictures, copy- 
ing figures, and so on. By inspecting 
the. test norms, it is possible to compare 
performances at different ages on these 
tests and thus gain some rough notion 
of perceptual development, But the time 
is ripe for a direct experimental ap- 
proach to the problem. With the advent 
of the “new look” in perception, re- 
search with children on the effects of 
need and value on perception began to 
appear. The present trend toward em- 
piricism (“effects of early experience”) 
has stimulated interest in the develop- 
mental course of perceptual functions 
as such, 

The psychologist about to undertake 
such research would do well to look 
first at the general sources on methods 
of studying perception; Guilford’s 
Psychometric Methods (1954), the 
chapters on psychophysical measure- 
ment in Woodworth and Schlosberg’s 
Experimental Psychology (1954), Gib- 
son’s chapter, “Studying Perceptual Phe- 
nomena,” in Andrews’ Methods of 
Psychology (1948), and Thurstone’s 
A Factorial Study of Perception (1944). 

Choice of a specific method will al- 
ways depend on the investigator’s prob- 
lem. The question asked in a sense 
dictates the method, We shall consider 
first, therefore, the principal questions 
that a psychologist interested in percep- 
tion might ask and then discuss the 
methods available and appropriate for 
each when the subjects are children of 
varying ages. 


312 
Theoretical Issues and General 
Methodology 


What do we want to know about 
perception? Three major questions can 
be asked. 


1. What are the basic correlations 
between physical events and the per- 
ceiving organism’s discrimination of 
them? For the child psychologist, the 
question becomes whether or not the 
basic correlations show some develop- 
mental course or evolution. 

2. The second question concerns the 
phenomenal qualities of experience; 
what are they for the child and how 
do they change with age? 

3. The third question asks how the 
basic correlations between physical 
events and discrimination or phenome- 
nal experience are affected when other 
factors (e.g., attitudinal ones) are in- 
troduced, This question has been of 
special interest to some child psychol- 
ogists because of the possibility that 
children’s perceptions may be more 
subject than adults’ to distortion from 
complicating factors. 


Not all psychologists are equally in- 
terested in all three questions; choice 
of a problem is bound to be a matter 
of individual preference and systematic 
bias as well as expediency. 


THE FIRST QUESTION 


What is the developmental course or 
evolution of discrimination? How pre- 
cise and accurate is the relationship 
between physical events and the per- 
son’s differential reaction to them? Does 
precision increase with growth and 
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exercise? (In other words, the two 
dimensions, physical and “os 
ential,” are thought of as correla 
some degree.) Does this corre 
show an increase with age? In answer- 
ing this question, it is assumed that the 
stimulus-experience correlations char- 
acteristic of the adult are known and 
can tell us what dimensions to vary 
and to measure in the child. The 
tion then becomes, to what extent have 
adult patterns of discrimination de- 
veloped at each age level? 

What is an adequate criterion of 
discrimination? An index of discrimina- 
tion must be differential—that is, be- 
havior must show a change that can 
be definitely correlated with a change 
in stimulation. In addition, a measure 
of the degree of specificity of response 
to the stimuli is needed. How wide a 
range of stimuli along a given dimension 
will produce the same response? An 
index of precision is needed for any 
study to be useful in a comparative 
sense. We can actually pose our first 
question just this way. What is the 
class of stimuli that will produce a 
given judgment? What happens to this 
class of stimuli genetically? Does it 
increase or decrease, become more gen- 
eral or more specific? The psychophy- 
sical methods were developed especially 
to provide adequate measures of dis- 
crimination. 

The Psychophysical Methods. For 
the older child, the psychophysical 
methods can be employed much as they 
would be for the adult. The least 
modification necessary is the best for a 
comparative approach. Detailed descrip- 
tions of the methods and statistical 
procedures are best found in such a 
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book as Guilford’s (1954). Some useful 
ones are mentioned briefly here for the 
sake of establishing a terminology. 

The method of reproduction or ad- 
justment is a psychophysical method 
often used with children, The tradi- 
tional method of adjustment varies only 
one dimension of the stimulus. The S 

a variable stimulus to reproduce, 
ctly as possible, the degree of 
some specified dimension of the stand- 
ard, The average error over a series 

trials yields an indication of the 
ecision of S’s discrimination of small 
rences in the dimension measured. 
The Galton bar is a classic apparatus 
for demonstration of the method. The S 
sets the length of the variable side to 
match the length of the standard. 

Many experimenters, using a method 
of adjustment with children, have as- 
sumed that the manipulation would be 
difficult for the child and have therefore 
done it for him, asking him to signal 
when the right point is reached. This 
procedure introduces a possibility of 
bias, since E usually knows when the 
adjustment is right and may give S 
cues by his facial expression or his speed 
of varying the stimulus. Unconsciously 
given cues from E must be guarded 
against in these experiments just as 
carefully as they would be in animal 
experimentation. We do not actually 
know to what extent a child may be 
influenced by cues from E’s expression 
or behavior; research on this problem 
with children of different ages would 
be useful methodologically and also in- 
teresting per se for social perception. 

An example of reproduction by copy- 
ing and by adjustment is found in 
research on the discrimination of paral- 
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lelism of lines by Wursten (1947, p. 
112ff.). In one series children were 
given a sheet of paper containing a 
straight line or an oblique one and 
were asked to draw ten lines like it. 
All ten lines were drawn on the same 
sheet. Departure from the vertical, or 
from the angle given, was measured 
afterward. The drawing was done in a 
group situation, and demonstration with 
a blackboard preceded it. In such a 
situation the earlier drawings may in- 
fluence the later ones. Also, S’s motor 
skill can play a large role. To minimize 
motor skill, Wursten ran, individually, 
another series in which a needle was 
presented, vertical, horizontal, or at an 
angle, and S adjusted 11 other needles 
to the same position as the model. The 
needles could be rotated to a desired 
position, somewhat like the hands of 
a clock, but all could be viewed simul- 
taneously, so that the first objection 
above was not met. The child was fre- 
quently reminded to take account of the 
position of the standard and to compare 
with it the position given each new 
needle. Average errors and constant 
errors were calculated for both tech- 
niques. Errors were least for the vertical 
and horizontal positions at all ages. 
Progressive shift of the needle’s posi- 
tion, as the series progressed was ob- 
served with the oblique lines—a greater 
and greater departure from the stand- 
ard. For all measures, the older the 
child the better the performance, 
though differences in the vertical lines 
were small. 

The method of limits is useful for 
determining absolute as well as dif- 
ferential thresholds. In this case § is 
usually not given a pair of stimuli to 
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compare. Instead, he is instructed to 
detect a change in some indicated di- 
mension of the variable stimulus. The 
stimulus is gradually changed in this 
aspect, and § reports when the change 
is first detected. The stimulus is varied 
in both increasing and decreasing 
orders, so that an upper and lower limit 
is obtained. The “zone of uncertainty” 
is the area between these limits and 
provides a measure of precision. Some 
measure (e.g., the mean) between them 
may be taken as the actual threshold. 
The traditional example of the use of 
this method is the two-point limen on 
the skin. The S, with this method, must 
keep in mind some concept of the 
change to be detected. This concept 
may itself shift with practice and per- 
haps with age. Friedmann (1927) in- 
vestigated the development of the 
two-point limen in children, combining 
the method of limits with a constant 
method. 

Upper and lower thresholds are usu- 
ally investigated by a method of limits. 
Zwaardemaker (reported in Peters, 
1927) measured the upper tonal thresh- 
old, using a Galton whistle, in three 
age groups, one below 10, one 10-12, 
and one 20-30. 

The constant method (right and 
wrong cases) is often used with adult 
S’s and is widely applicable. It yields 
the typical psychophysical curves or 
psychometric functions. Such a method 
(in getting differential limens) calls for 
a judgment of more, equal, or less when 
a standard and a comparison stimulus 
are presented (either successively or 
together). The “equal” category is 
often prohibited so as to force S to 
make a differential judgment. Variation 
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of the stimulus is accomplished by 
randomly presenting comparison stimuli 
whose values differ by scale steps along 
the critical dimension. Each value of the 
variable stimulus must be presented 
many times with the standard, since § 
may guess; the measure of S’s vari- 
ability indicates his precision of judg 
ment. Order and other intraserial effects 
(e.g., the time error and the spac 

error) must be compensated for in th 
procedure. The method is necessarily 
long and may fatigue and bore a child 
Motivation may therefore become con- 
siderably important. The Seashore test 
provide familiar examples of the con- 
stant method used to measure dis- 
crimination of pitch and loudness dif- 
ferences (Saetvit et al., 1940). The 
tests have often been used with school 
children. 

The method of recognition is fre- 
quently used with children when a 
constant method might be used with 
adults, since the procedure seems easier 
for the child to follow. The task is 
simply to say whether a given variable 
stimulus is the same as or different from 
a standard stimulus, or whether it is 
“equal” or “not equal,” if these words 
are more easily comprehended. Gilbert 
(1894), investigating thresholds for 
lifted weights in children, presented the 
child with a series of comparison stimuli 
and asked him to say which one or 
ones were equal to the standard. Re- 
quiring the child to make a “heavier” 
or “lighter” judgment for each pair 
would, presumably, have been more 
exacting. 

The standard and the variable stim- 
ulus may be presented successively, 
or they may be presented together. 
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Fig. 8-1, Nonsense items differing in three 
dimensions of variation. 


There may be only one comparison 
object present for a given judgment, 
or there may be a series. Control of 
position is necessary when simultaneous 
comparison is used, since children are 
very susceptible to place habits. If the 
child is to choose from a group of 
samples the one that is the same as the 
standard, there is the question of how 
the group or series is to be arranged— 
in some kind of order or randomly? 
And how extended is the series to be? 
If the series is scaled, how large is the 
step in the scale from one member to 
the other? These problems are con- 
sidered with specific examples under 
constancy, since the method has been 
widely used with children in this case. 

An experiment by Gibson and Gibson 
(1955) is an example of the method 
of recognition used to study develop- 
ment of form perception in children 
and adults. The increase in specificity 
of identification or recognition was the 
essential comparison. The § was pre- 
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sented with a sample item, a scribble 
drawn on a card, which served as the 
standard. His recognition of it was 
tested when it was interspersed in a 
series of 17 similar scribbles, each of 
which varied in one of three ways 
(number of coils, horizontal compres- 
sion, and orientation) from the stand- 
ard. Twelve other items, more easily 
distinguishable, were included. The 
items are shown in Figs, 8-1 and 8-2. 
The S was shown the standard for five 
seconds and told that some of the items 
to be shown would be exactly like it. 
The series of items (including four 
identical copies of the standard) was 
then presented one at a time for three 
seconds each, and S$ was asked to report 
which of them were the same as the 
standard. He was not told when his 
identification was correct or incorrect. 
At the end of the series S was again 
shown the standard, and another series, 
in a different random order, was run 
through. This procedure continued 
until S made only the four correct 
identifications in one trial. Three groups 
of S's, adults, older children (8% to 11 
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Fig. 8-2. Nonsense items differing in many 
dimensions of variation. 
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years) and younger children (6 to 8 
years) took part. Scores available were 
the mean number of trials required for 
a completely specific response, the 
mean number of undifferentiated items 
on the first trial, and the analysis of 
false recognitions by the dimensions of 
variation. Differences in the three age 
groups were apparent by all criteria. 
Comprehension of the task and control 
of cues from the E, as well as the pre- 
vention of flaws or marks on the cards 
that would permit identification of 
particular cards, are the main problems 
to be faced with this technique. Photo- 
graphic reproduction of the patterns 
and duplication of the series should take 
care of the latter difficulty. 

Many modifications of the methods so 
far described have been devised to 
suit the requirements of the problem 
and the subjects to be tested. In Piaget’s 
laboratory a variation of the method of 
limits that can be used either with con- 
stant stimuli or adjustment has been de- 
vised especially for children. It is 
known as the “concentric clinical 
method” (méthode concentrique clini- 
que) and can be used with adults as 
well. It is a shortened procedure, “clini- 
cal” in the sense that it is modified to 
suit the individual S. It is “concentric” 
in that extreme values of the comparison 
stimulus are presented first to get the 
child’s range, roughly, and then smaller 
differences are introduced, working in 
toward the limen. Alternate presenta- 
tions of values above and below the 
standard are given to determine, as 
precisely as possible, upper and lower 
thresholds. Median values are used to 
get threshold width, or area of un- 
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certainty, and their average is used 
as a point of equivalence. Many descr; 
tions of the method are available [sc 
for instance, Lambercier (1946, 
141ff.)]. Wursten (1947) used tl 
method, with constant stimuli, in 
experiments on discrimination of t 
parallelism of lines. He used cards 
which two lines were drawn, One, 
the left, was vertical. The other, 2 cert 
meters to the right of it and extend 
obliquely downward from the base 
the vertical line, was at an angle vary- 
ing from 151 to 119 degrees by step 
degrees. On the standard card the 
oblique line was always at an angle of 
135 degrees. The standard card was 
presented on the left and the variaùle 
card on the right (see Fig. 8-3). The 
subjects were asked to judge if the 
oblique lines had the same inclination; 
if one was straighter or more leaning 
than the other. From the results were 
calculated thresholds of equality (the 
average distance in degrees between the 
extremes accepted by the S’s), average 
error, and constant error for different 
age groups from 5 years to adult. All 
show a very consistent development. 
The spread of the threshold varied from 
18.3 degrees in 5-year-olds to 2.3 de- 
grees in adults. Usually, with a constant 
method, the variable “and the standard 
would be shifted from right to left to 
compensate for a possible “space 
error.” More attention to the standard, 
or fixation of it, may be responsible for 
another systematic error called by 
Piaget the “error of the standard” 
(Piaget and Lambercier, 1943). Over- 
estimation of the standard, whatever 
its position, whatever the temporal 
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135° 


8-3. Cards used for testing discrimination of the parallelism of lines (from Wursten, 1947). 


er of standard and variable, or what- 
the stimulus dimension, is char- 
ristic of this systematic error. It is 
inique type and was apparently dis- 
covered in Piaget’s laboratory. 
Fractionation (ratio judgments) is 
one of the methods used with adults 
for psychological scaling. On the whole, 
such methods are too complicated for 
children, But Piaget (1949) has studied 
the perception of the size of angles, in 
three age groups, with a method of 
ratio judgments. He wished to investi- 
gate the overestimation of acute angles 
and the underestimation of obtuse angles 
as a function of age. He used as measures 
the doubling of an acute angle and the 
halving, or taking a third of, an obtuse 
angle. The method was not applicable 
under 7 years, and “taking a third of” 
was impossible at 7 and 8 years. The S 
Was given a standard angle (60 degrees 
for the acute, 120 degrees for the 
obtuse) and then presented with a series 
of variables, one by one, varying by 5 
degrees, from which he was to choose 
the double, the half, or the third, as 
the case might be. A “concentric” 
method was used (alternate increase 
and decrease). Constant and average 
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errors were calculated, the average 
errors showing a clear decrease with 
age. The role of “the error of the 
standard” in ratio judgments is discussed 
in some detail. 

The method of absolute judgment 
(single stimuli) involves the assigning 
of specified categories or names to 
stimuli presented singly to the S. As a 
rule, these categories would be similar 
to those in the number of yards on a 
scale of yards, differing quantitatively 
along a defined continuum. But con- 
ceptual scales such as yards, inches, or 
pounds, are not available before the 
educational process is well underway. 
Less abstract categories might be tried 
—easy rating scales such as long- 
medium-short. If simple scales were 
used, with a small number of categories, 
the amount of information transmitted 
for different age groups could be com- 
pared for stimulus dimensions such as 
brightness, length, and numerosity. 

Categories that are not successive or 
scalar can, of course, be assigned to 
stimuli presented. Such a method is not 
usually classified as a psychophysical 
method, but it would nevertheless yield 
information about discrimination. Such 
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a method can be called identification. 
Naming objects or other stimuli pre- 
sented—for instance, color naming— 
would be an example. Cook (1931) in- 
vestigated color naming in children and 
found that it increased in accuracy (for 
a given sample of colors) from 25 per 
cent at 2 years to 62 per cent at 6 years. 
Vocabulary obviously plays a big role 
in performance i in such a task. 

It should be clear that different 
thresholds and consequently different 
developmental curves can be obtained, 
depending on the psychophysical 
method chosen, Concrete instances of 
such divergence are presented in a later 
section. Other methods than these 
mentioned, although infrequently used 
with children, are described in the 
general sources referred to and may 
suggest to an experimenter a modifica- 
tion especially appropriate for an attack 
on his problem. If the requirements of 
good psychophysical measurement are 
understood, there is no reason why a 
particular textbook method must be 
rigidly adhered to. 

“Adapting” a technique to suit a 
given age level should be cautious, how- 
ever. A developmental study can be 
defeated by too free adaptation for 
different ages. Instructions should be 
kept essentially the same. Frequently, 
experimenters have varied their instruc- 
tions to conform to the child’s vocabu- 
lary at different age levels, thus biasing 
their results. Failure to understand the 
task by a younger child is equally dis- 
astrous. 

Substitutes for Psychophysical Meth- 
ods. For very young children, the 
psychophysical methods must be sup- 
plemented by methods suited to the 
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abilities of the child. Verbal judgme: 
can be replaced by other indicator v 
sponses, as they are in animal expe 
mentation. Instructions can be replaced 
by demonstration and example, training 
procedures, or, ideally, by respon 
having spontaneous motivation. In a 
case, the stimuli should be varied di- 
mensionally, as they would be in 
customary psy’ chophy sical procedu 
Substitutes for those psychophys 
methods used with older children 
adults must be carefully validated 
they are to be compared from age 
age with other methods. 

The crudest tests of discrimination in 
an infant are those that simply measure 
the gross amount or change of activity 
when the critical stimulus is presented. 
That is, differential responses are not 
singled out and measured, but any 
change in degree of responsiveness at 
the onset of stimulation (e.g., magnitude 
of response, latency, duration, rate of 
emission) is noted. The general activity 
technique used by Pratt, Nelson, and 
Sun (1930) is a good example of this. 
The infant is placed on a “stabilimeter” 
platform, which wobbles whenever he 
moves. The movement is transmitted by 
string and pulley mechanisms to writ- 
ing pens of a poly graph, so that amount 
and change of activity can be recorded 
for a control interval and after the onset 
of a prepared stimulus, The infant’s 
repertory of activity is small, but such 


*Munn (1955, Ch. 8) has an excellent 
discussion of methods of studying sensory 
processes in infants and young children. K. U. 
Smith’s (1951) chapter on discriminative be- 
havior in animals is also extremely relevant, 
since the problems of measuring discriminative 
behavior in animals are much the same as in 
preverbal children. 
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responses as crying, breathing rate, and 
startle can be observed directly and 
measured in various aspects when criti- 
cal stimuli are introduced. When there 
) specific indicator response, it must 
be inferred that any change in what- 


ever aspect of the response is being 
measured is due to the stimulation E 
is interested in and not to some other 
cause. Even if E is satisfied that he has 
adequately controlled other stimuli and 
that rhe increase or decrease in response 
is caused by the stimulation he has 
presented, he has only this crude fact. 
‘The child’s discriminative ability cannot 
be measured precisely by such a 
method. 


Even with young infants, it is often 
possible to do a little better than ob- 
servation of mere amount or change of 
general activity, since a few differential 
responses exist from birth or appear 
soon afterward. (Presence or absence 
of the reaction to alternative stimuli 
or differential orientation to them can 
then be looked for.) Crying, sucking, 
smiling, following with the eyes, and 
rejecting substances from the mouth are 
relatively selective, i.e., correlated with 
Certain stimulus categories and not 
others, They have been used extensively 
in studying perception in the infant, 
Jensen (1932) studied differential reac- 
tions to taste and temperature stimuli 
in newborn infants by measuring dif- 
ferential sucking to various experimental 
stimuli and comparing it with response 
to a control stimulus. The best-known 
example is perhaps the developmental 
study of vision by Gesell, Ilg, and Bullis 
(1949). A description of their tech- 
niques is given in their appendix A. 
For the infant, observation of fixation, 
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eye-hand coordination, and direction 
of regard provide the data. Fixation 
preference has been used systematically 
with very young infants for evidence of 
color discrimination (Staples, 1932) and 
pattern discrimination (Fantz, 1958). 
Staples’ experiments are well known; 
she measured the amount of time a 
colored disk was fixated when two disks 
were placed 6 inches apart on a piece 
of cardboard. They were of equal area, 
equidistant, and of ‘equal brightness. The 
technique is similar to that of paired 
comparisons (Woodworth and Schlos- 
berg, 1954, p. 252ff.) and requires the 
usual controls, such as randomized 
sequence, compensation for possible 
space errors, and equalization of other 
possible variables, such as movement. 
Berlyne (1959) points out that the E 
holding the cards should not know 
which ones are present, lest his manner 
of holding the cards or recording fixa- 
tion time be biased. 

A response that is sufficiently specific, 
such as sucking, may be used in the 
manner of the ethologists to discover 
what they would term the “innate 
releaser.” Their method gives evidence 
at a very early age of certain stimuli 
that are discriminated from a complex 
or from other presumably similar 
stimuli. Gunther (1955), looking for the 
“sign stimulus” for sucking, found that 
tactile stimuli within the mouth pro- 
vided by the nipple and the protractile 
tissues behind it, were the critical re- 
leasers for sucking in the neonate. But 
such innate mechanisms are few in the 
human infant and besides are quickly 
replaced or added to by learned stimuli. 

Although certain responses (sucking, 
smiling, fixation, etc.) are to some ex- 
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tent differential, specificity is by no 
means complete, so that the validity of 
assessing discrimination by means of 
them is questionable. If an infant smiles 
equally at two different objects or looks 
equally long at two colors presented 
side by side, it does not insure that he 
cannot discriminate them. Only in case 
of positive evidence of discrimination, 
the occurrence of a response known to 
be specific, can we draw a firm con- 
clusion. 

Another problem of validation with 
unlearned differential responses is the 
necessity of assuming that maturation of 
motor functions antedates perceptual 
maturation. It is often assumed, for in- 
stance, that space perception in an in- 
fant is inaccurate because he makes 
errors in reaching for objects. It could 
be equally possible that the motor func- 
tions alone are unskilled or that co- 
ordination of perceptual and motor 
functioning is incomplete. 

In animals the comparative study of 
discrimination has most often been 
carried out by a learning method; the 
animal is reinforced or rewarded for 
response to one situation and not to 
another, Conditioning techniques have 
yielded much information about preci- 
sion of discrimination and even per- 
ceptual organization (Smith, 1951) in 
animals. Actually, the formation of any 
conditioned response is evidence of 
some discrimination, for the conditioned 
stimulus must be detected in the total 
flow of stimulation in the environment 
and responded to; and suitable controls 
ought always to be made to show that 
the CS was neutral (not responded to 
with the CR) before conditioning. For 
evidence of more precise discrimination 
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in conditioning, what Pavlov called 
the “method of contrasts” must be re- 
sorted to. After the CR is established, 
generalization will occur to simular 
stimuli; the critical one is then presented 
alternately with one or another stin 
designed to be negative. The positive 


stimulus is consistently reinforced, the 
negative one is not. Differentiation is 
best established by starting wita a 
negative stimulus not too close on a 
similarity dimension with the positive 
and progressing, after a successful lif- 
ferentiation, to more similar stimuli 


and a finer discrimination. Conditioning 
techniques have been used widely with 
children and infants in Russia, Razran 
(1933) reports a number of these. Con- 
trol of the stimulus is very careful in 
the Russian experiments, and the S 
is usually isolated from E. The youngest 
S’s appear to have been infants 13 to 
73 days old. Levikova and Nevymakova 
studied differential auditory condition- 
ing to food (nursing and food-seeking 
responses). The positive CS was B* on 
an organ pipe; an electric bell and B? on 
an organ pipe were to be differentiated. 
The infants were blindfolded during the 
experiment. Differential CR’s were 
found at 110-122 days, simple CR’s, 
earlier. 

Conditioning techniques traditionally 
present one stimulus at a time only. A 
simultaneous choice with differential 
reward from the beginning is, of course, 
possible and may be quicker. Food is 
generally the reinforcing agent in such 
an experiment. It can be quite literally 
in the stimulus object; for instance, by 
sweetening an object of one shape but 
not one of another shape or coloring 
liquids in a bottle and sweetening some 
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colors but not others. Ling (1941) 
studied form discrimination in infants 
as young as 6 months. Two or more 
blocks differing i in shape were presented 
on a board. The “correct” form was 
sweetened with saccharine and could be 
lifted to the mouth. The “incorrect” 
forms were fixed to the board. Dif- 
ficulty was increased by increasing the 
number of blocks. 

Weich (1939) used a technique in- 
termediate between conditioning and 
simultaneous choice (he refers to it as 
“conditioning”) to study size dis- 
crimination. Children learned to find 
candy under a small box by repeated 
presentation of it; no language was 
used. When discrimination testing be- 
gan, however, two boxes were presented 
simultaneously. The biggest contrast 
(largest and smallest) was presented 
first, until the child chose correctly. 
Thereafter, the difficulty of the choice 
Was increased step by step until S 
reached his limit of discrimination. A 
good deal of pretraining is generally 
required before the critical stimuli are 
introduced. Munn and Stiening (1931), 
using a two-choice discrimination ap- 
paratus with doors, allowed S to ex- 
amine the apparatus and find chocolate 
behind both doors before patterns to 
be discriminated were put on the doors. 

With choice-learning techniques, 
controls must be made for irrelevant 
cues that are potentially differential— 
olfactory cues, for instance, when food 
is the incentive. Unconsciously given 
cues from Æ and systematic errors 
caused by position and order are im- 
portant. Children often develop a 
tendency, for instance, to choose the 
side that was “right” last time. Geller- 
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mann (1933a,b) has a good discussion of 
irrelevant cues in discrimination-learn- 
ing experiments. It is important to re- 
member, also, that the experimenter 
can make two kinds of error—one by 
concluding that there is evidence of 
discrimination when some other factor 
may really be responsible and another 
by denying it when it was actually 
present. 

With slightly older children, the 
potential irrelevant cues rising from use 
of food as a lure may be avoided by 
putting a tiny toy within the correct 
choice box or compartment. If the 
child is allowed to keep the toys or play 
with them, this may be sufficient in- 
centive. Motivation, as well as irrelevant 
cues, is a big problem to be faced in 
establishing the validity of discrimina- 
tion-learning techniques. 

Matching methods, if the child is old 
enough to be instructed by example or 
by a few simple words, have a certain 
advantage over discrimination-learning 
techniques. It is possible to test many 
more comparison stimuli against the 
standard or “correct” one in a short 
time. Weinstein (1941) used the match- 
ing of a sample to study form and 
object discrimination in rhesus monkeys 
and in young children. Stimulus objects 
were presented on a tray that had a 
sample-object-area on the right and a 
choice-object-area on the left. Training 
was carried on with two choice objects, 
one identical with the sample. Food 
wells holding a piece of candy or other 
food were under the forms. No lan- 
guage was used in the training. The 
children were later able to transfer the 
matching principle to a four-choice 
situation and to series of new stimulus 
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objects. A control series was introduced 
to test for secondary cues in which the 
E was concealed during the experiment. 

Identification learning is another tech- 
nique that, if successful, reveals dis- 
crimination in a group or series of 
stimulus objects. It entails learning dif- 
ferent responses to each of the stimulus 
objects. When verbal responses are not 
easily available, it is not a frequent 
choice. But it may be the only pos- 
sibility in getting at a secondary prob- 
lem, such as the effect of similar and 
dissimilar learned responses on develop- 
ment of discrimination. Dietze (1955) 
used such a method with 4- and 5-year- 
old children who learned nonsense 
syllable names to sets of forms (three in 
a set). The forms were discriminated 
faster when the names were dissimilar. 
The reward here was a tiny toy when 
each set was learned, and E later made 
S a bracelet of the toys. 

The identification-learning method, 
obviously, would compare unfavorably 
with simpler techniques, such as match- 
ing, if time to complete the discrimina- 
tion were taken as a criterion. Needless 
to say, different learning techniques 
may yield different thresholds, as do 
the psychophysical techniques. A de- 
velopmental study, therefore, has the 
necessity of choosing the most ap- 
propriate technique and sticking to it 
for all age levels. 


THE SECOND QUESTION 


What is the nature of the phenomenal 
world of the child and how does it 
change with age? This question has 
been asked primarily by Gestalt psy- 
chologists (Koffka, 1931), but the 
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phenomenological approach has been 
taken by others as well, notably Piaget. 

At each stage of development 
Gestalt theorists assume that the |) 
havior of the child is in large pat 


function of the phenomenal world in 
which he lives, that is, the world of his 
experience, Very little is known ab yut 


the perceptual world of the child. Un- 
fortunately, we cannot look at the 
world thr ough the eyes of the child, and 
we do not really know what the child 


sees and in what respect it differs from 
what we see. It is necessary to discover 
the nature of the perceptual phenomena 


that the child experiences. By com- 
paring the different stages of develop- 
ment, we can determine to what extent 
and in what respect the phenomenal 
world changes as the child matures. 
However, “phenomenology in psychol- 
ogy can never be a substitute for 
psychophysics and _psychophysiology. 
Its function is rather to set the initial 
problem, to define the psychological 
datum which must then be envisaged in 
the setting of its physical and physio- 
logical correlates.” (MacLeod, 1958, p. 


34). 
Method of Research: Description of 
Experience. The phenomenological 


method requires of the subject a naïve 
description of his experience. Rather 
than having him make a perceptual dis- 
crimination, he is asked simply to tell 
the experimenter what he sees. To do 
this in a systematic way, we can set up 
controlled experiments touching on 
various aspects of perceptual experience, 
such as size, shape, movement, and 
depth. These experiments must be as 
carefully performed as the psycho- 
physical experiments already discussed. 
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show 


The primary difference is in the in- 
structions given the subject. Since the 
aim is the qualitative nature of the 
child’s experience, the experimenter 
presents the child with a given stimulus 
situation and asks, “What does it look 
like?” “What seems to be going on?” 
“What do you see?” The child is not 
directed to report on any particular 
aspect of his percept but just to tell how 
It appears to him. 

This method, frequently used with 
adults and older children, can be ef- 
fective with younger children as well. 
Olum (1956) used it with 7-year-olds 
in an experiment on developmental dif- 
ferences in the perception of causality 
by means of the Michotte disk tech- 
nique. When the disk (in Fig. 8-4) is 
rotated counterclockwise behind a 
Screen containing a narrow horizontal 
slit, the S is presented with two small 
rectangles, one black and one red, mov- 
ing from left to right. Under certain 
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Michotte apparatus. The figure on the left shows the disk. The figure on the right 
ne subject’s view, with the disk now behind the screen. 


conditions of movement, e.g., rate or 
beginning and ending of the two move- 
ments, an adult S perceives a black 
object, which hits or bumps a red 
object and causes it to move. This is 
the “causal effect.” Under somewhat 
different stimulus conditions this effect 
is destroyed. In the Olum study 
stimulus conditions were used to repre- 
sent both of these situations, i.e., the 
expected response in one situation was 
a definite causal experience, whereas the 
expected response in another situation 
was a definite noncausal experience. Two 
groups were used, adults and 7-year-old 
children, each S taken individually. The 
S was told to watch the little blocks and 
then the experimenter said: “What does 
it look like to you? . . . What do they 
seem to be doing? . . . Tell me what is 
happening.” For each presentation, the 
S took as long as he wished, describing 
what he saw while he was still observing 
the phenomenon. 
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The results of this experiment demon- 
strated that stimulus conditions that 
produce in adults only the causal or the 
standard noncausal responses evoke in 
a large percentage of young children 
responses never reported by adults. 
These were of two types: mutual ap- 
proach, in which both objects hit or 
push each other, and passing, in which 
the black rectangle passes the red one, 
the red passes the black, or both pass 
each other. The passing response is of 
special interest, since children clearly 
experience an overlapping of the mov- 
ing objects when physically such an 
overlapping never occurs. One child 
verbalized the passing experience: “One 
runs past the other. The black runs past 
the red. It passes somehow, over or 
under. The red really runs away. It 
looks like the black keeps on going but 
it doesn’t . . . it stops and the red keeps 
on going.” 

The descriptions were classified into 
four groups: (1) causal responses, (2) 
standard noncausal responses, (3) mu- 
tual approach, (4) passing. The dif- 
ferences between children and adults 
were Clearly significant. 

This method of description of per- 
ceptual experience is particularly valu- 
able in areas of research such as the 
perception of causality, which have not 
previously been studied in children, 
since we cannot assume that children 
will perceive things in the same way as 
adults. It should be pointed out, how- 
ever, that when using this method the 
younger the subject the more concerned 
the experimenter must be with problems 
of validity. The experimenter must be 
certain that rapport is good and motiva- 
tion high, that the child fully under- 
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stands what is required of him, and that 
he has the verbal ability to describe 
his experiences. 

Substitutes for the Direct Method. 
Below the age of verbal communic 


the child is, of course, not able to 
describe his perceptual experience. The 
best that the psychologist can do +s to 
examine the child’s behavior as c- 
fully as he can in as many situations as 


possible, both in the laboratory sce 
Substitutes for Psychophysical Methods, 
p- 318) and in everyday life, and from 
this behavior to deduce what the chi!d’s 
world must be like. Of course, such 
an approach i is essentially a “best g 
the primary criterion of validity 
that of consistency. Koffka states 
position: 


) o 
being 


his 


How shall we proceed to reconstruct 
the piraan world of a newborn 
baby? . . . Our reconstruction of the in- 
fant’s consciousness must “fit” the observed 
events and conduct which constitute its 
“objective” behaviour. That is to say, 
the infant’s consciousness and behaviour 
must fit together in the same way in which 
the phenomenal world of the adult fits his 
objective behaviour (1931, pp. 142-143). 

Another way of deducing the char- 
acteristics of the phenomenal world 
of the preverbal child is to study 
perceptual experiences at as many stages 
of development as possible in an attempt 
to discover whether there are definite 
developmental trends. When this is the 
case, the assumption can be made that 
the variable under consideration has 
evolved in the same direction from in- 
fancy. A picture of the world of the 
infant is arrived at by extrapolation 
from the available data, as the necessary 
prior state of a known developmental 
sequence. 


MENTAL 


Piaget’s work is the most striking ap- 


plication of these two deductive ap- 
proaches. From repeated observations 
of the behavior of children, both in 
spontaneous situations as well as in those 
initiated by the experimenter, Piaget 
reconstructs the phenomenal world of 
the child at each developmental stage; 
and looking at the developmental 
stages in sequence, he attempts to de- 
duce the nature of the phenomenal 


world of the infant. 

An example of this is Piaget’s study 
of babies’ reactions to the disappearance 
of an object. In the early stages of in- 
fancy the baby acts as if an object had 
no existence when not directly per- 
ceived, When the object vanishes, the 
baby makes no attempt to look for it. 
From a study of the changes in be- 
havior from this stage to the final one, 
in which the child searches for the 
vanished object, Piaget reconstructed at 
each point what the child’s experience 
of objects must be and deduced from 
the total series that the object as we 
experience it, the “object concept,” is 
achieved only little by little. Initially, 
he said, “The infantile universe is 
formed of pictures that can be rec- 
ognized but that have no substantial 
permanence or spatial organization” 


(1954, p. 4). 


} 


THE THIRD QUESTION 


Our first question concerned the basic 
correspondences between the physical 
world and dimensions of perceptual ex- 
perience, with special reference to de- 
velopment. Now, we are ready to ask 
how such correspondence is affected 
when other factors are introduced. 
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“Other” factors include complicating 
stimuli, practice, attention, and affective 
and social factors. Are the basic cor- 
relations made more precise, less precise, 
or are they distorted? How is an ex- 
periment designed to study the role of 
such factors? We can, of course, also 
ask how phenomenal experience (our 
second question) is affected in the same 
experiments. 

The kind of experiments discussed 
under the first question employed the 
classical methods of psychophysics, 
modified as necessary for the age of the 
child. Only one dimension of the stimuli 
to be compared was varied and that 
was the dimension to be judged (length, 
for instance, with the Galton bar). The 
variable was isolated, and constant 
errors inherent in the procedure were 
controlled by balancing them in the 
experimental design. Measures of preci- 
sion (variability or threshold) rather 
than constant errors were sought, and 
the conditions for judgment were 
typically optimal. Such cases are re- 
ferred to now as Type 7 experiments. 

Experiments deriving from question 3 
depart from this classic tradition by 
permitting more than one dimension of 
variation. We shall call these Type 2 
experiments. Many illusion experiments 
are in this group. In measuring the 
Miiller-Lyer illusion, the length of the 
horizontal line is variable, as it is with 
the Galton bar. But, together with 
length, there is added a variable of 
form—the direction of the arrowheads 
or wings at the ends of the line. In 
the size-w eight illusion the stimuli vary 
not only in weight, which is to be 
judged, but also in size (although no 
size judgment is requested). In these 
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experiments it is the constant error that 
is of interest to the experimenter. He 
has compounded two stimulus variables, 
and he is looking for a distortion in the 
dimension judged as compared to the 
simple case in which one dimension is 
isolated. The psychophysical methods 
are still used to obtain judgments, but 
it is departure from the basic corre- 
spondence that has become of interest. 

Constancy experiments fall into this 
classification. Size with distance variant 
(size constancy) implies variation of 
both objective size (to be judged) and 
distance. Whether S tends to underesti- 
mate as the object is placed farther 
away is of interest; if he does, he is 
making a constant error. In the size 
constancy experiment the second vari- 
able, distance, can be altered dimen- 
sionally, But sometimes the second 
variable can be altered only categori- 
cally, as is often the case with illusions. 

Some of the so-called “value” ex- 
periments likewise vary the stimuli in 
more than one aspect so as to contain or 
not contain the “value.” In the coin 
experiments, for instance (Bruner and 
Goodman, 1947; Carter and Schooler, 
1949), coins are matched with a neutral 
disk for size. But it is not only size 
that varies, since the coin contains a 
dollar sign or some other designation of 
value. In an experiment by Dukes and 
Bevan (1952) children were asked to 
compare the weights of jars visibly filled 
with candy and jars visibly filled with 
sand. The standard and variable stimulus 
objects could vary, therefore, not only 
in weight but in various stimulus aspects 
associated with their contents. In this 
experiment the constant error was of 
principal interest; the children over- 
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estimated the weight of the jars of 
candy, compared with jars of sand. But 
the constant method used in this experi- 
ment also permitted measurements of 
precision. The precision of judgment 
was significantly greater when both 
jars were filled with candy than when 
both were filled with sand or one with 
sand and one with candy. Tajfel (1957) 
has pointed out the theoretical im- 
portance of this increase in precision, 
and future experiments of this type 
should be set up to secure data on 
precision as well as constant errors. 

Sometimes these experiments (which 
vary in more than one aspect of the 
stimulus at the same time) are inter- 
preted as showing that a personality 
variable has affected perception, but 
such a conclusion is not justified when 
the “value” is represented in the 
stimulus. It is possible, however, to con- 
trol the stimulus variables so that only 
one is changed while varying some 
factor within the organism. The stand- 
ard and variable differ along one stimu- 
lus dimension, that to be judged, and a 
second variable, such as need (e.g, 
degree of hunger), degree of practice, 
or amount of reinforcement, is in- 
troduced as well. Ideally, both variables 
would be altered dimensionally, but 
there are obvious difficulties in scaling 
variables within the organism. 

An experiment by Beams (1954) 
varied like and dislike by using pictures 
of food that S’s had already rated for 
preference. The same pictures were 
used for all, but preferences for any 
given food pictured varied among the 
S’s. In a test series S looked at the actual 
food object in a stimulus box through a 
peephole. Then he looked at a projected 
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color photograph of the object in 
another box and adjusted it (by moving 


the picture back and forth) until he 
judged it to be the same size as the 
object. A majority of S’s made the 
pictures of preferred food objects 
larger than those of nonpreferred ones. 


Here the standard and variable stimulus 


were similar, except for the dimension 
that changed when S$ moved the pro- 
jected image. 

Lambert, Solomon, and Watson 
(1949) created “value” for poker chips 


by letting children drop them into a slot 
machine in return for a candy reward. 
Before and after this reinforcement, S 
matched the size of the poker chips 
with a luminous disk by adjusting a 
diaphragm. In this case the poker chips 
and the light spot differed in more than 
one dimension (color, etc.), but the 
other stimulus variables were controlled 
by securing the same matches before 
and after reinforcement as well as after 
an extinction series. Since S’s matches 
of the same standard and_ variable 
showed an increase and then a decrease 
in apparent size with reinforcement and 
withdrawal of it, it can properly be 
inferred that the difference is due to 
the reinforcing conditions. 

The effect of practice on discrimina- 
tion of form and area was investigated 
by Welch (1939) with children 12 
months to 5 years of age. The child was 
trained to choose a standard rectangle 
of given proportions and area by re- 
warding him with candy when it was 
chosen. Then it was paired with other 
rectangles of differing form and area 
that were never rewarded. There were 
32 different rectangles of varying di- 
mensions. A rough threshold was ob- 
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tained by using a technique resembling 
Piaget’s clinical concentric procedure. 
Further practice and new tests were 
given an experimental group from 
month to month for varying intervals. 
Practice improved the discrimination, 
for children in the experimental group 
discriminated as well at 27 months as 
those in a control group at 4% and 5 
years being tested for the first time. 

A third type of experiment might be 
singled out in which standard and 
variable differ in stimulus characteristics 
in many ways that cannot be specified, 
Judging different faces for prettiness 
or intelligence would fall under such a 
classification. Neither variable nor con- 
stant errors are meaningful here, since 
the experimenter does not know exactly 
what he is varying or what an “error” 
would be. He may pick out some 
measurable aspect, such as height of 
forehead, and see if it correlates with 
the dimension judged. But it is always 
possible that some other feature is 
being systematically varied and con- 
sistently responded to, even though 
E does not know what it is. Since 
stimulus dimensions cannot be exactly 
specified, the standard psychophysical 
techniques are not applicable, and a 
rating procedure or categorizing must 
be employed. Elaborate rating methods 
are not appropriate for children, but a 
simple scale with only a few categories 
can generally be used. The number of 
categories or points on a scale that could 
be employed would probably be a func- 
tion of age. 

Some experiments aimed at investigat- 
ing the effects of internal variables on 
perception belong to Type 3 because 
the stimulus objects presented for 
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judgment vary in a number of in- 
definable ways and only categorizing or 
rating methods can be employed. A 
relevant example is Murray’s experiment 
(1933) on the “maliciousness” of faces. 
A group of children at a summer camp 
rated pictures of faces for maliciousness. 
Later they played a game of “murder” 
and again performed the ratings. The 
ratings were raised (more “malicious” 
faces were perceived), presumably be- 
cause of the fear aroused. 

Impoverishment Experiments. Cut- 
ting across these three types of experi- 
ments, a number of special techniques 
have been devised for investigating 
the effects of internal variables on per- 
ception. They have been referred to as 
“impoverishment” experiments (Gib- 
son, 1948). They have in common some 
method of reducing the “power” of 
the stimulus and thus enhancing the 
role of the internal variable. 

Short interval presentation is a time- 
honored method of reducing the stimu- 
lus. Early tachistoscopes,? such as 
Dodge’s, were not especially suitable 
for use with children, but present-day 
instruments, which can be used in com- 
bination with a projector and a screen, 
make tachistoscopic presentation much 
less cumbersome for E and less awk- 
ward for S. Tachistoscopic experiments 
with children have been mostly de- 
voted to work on reading and have so 


*“Given less than optimal stimulus-condi- 
tions, certain factors of past experience may 
play a determinative part in perceptual 
organization” (Bruner, Postman, and Rodri- 
gues, 1951, p. 227). 

*See Woodworth and Schlosberg, 1954, p. 
o2ff., for a good discussion of tachistoscopes. 
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far yielded little information about 
other kinds of perception (Tinker, 
1958). Much more might be done with 
projection tachistoscopes. The e- 


quency and word recognition st 
(Solomon and Postman, 1952) might, 


for instance, be approached develop- 
mentally with profit. Something 
analogous to a threshold may be ob- 
tained by gradually increasing exposure 
time by using a sort of method of limits 
and arriving at a criterion time for 
correct recognition. The effect on this 
threshold of internal variables can be 
investigated. 

A variety of other ways of im- 
poverishing the stimulus has been em- 
ployed with adults, but not all are 
suitable for use with children (ones 


requiring steady fixation, for instance). 
Visual stimulation can be reduced by 
cutting down the contrast between 
figure and ground, by altering focus so 
as to produce blur, by reducing size 
of a figure, or by displaying the material 
ata distance! All of these are familiar 
methods for measuring acuity; a kind 
of threshold measure can be ‘expressed 
in terms of the amount of reduction 
that can be tolerated by the S, yet give 
rise to an accurate recognition. The 
methods could equally well be used 
for studying the effects of internal vari- 
ables. 

A “noisy” background against which 
the relevant stimulus is presented is a 
method suitable for creating auditory 
impoverishment. Recognition of speech 
against a background of noise is dif- 
ficult; thus guessing, which permits the 
influence of factors within the S, is en- 
couraged. Skinner’s “verbal summator” 
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is such a technique. The S hears 


al” sounds, such as mumbled, 


owels, and tries to make sense 
m., A vague pattern of speech 
at low intensity or against a 
yackground is recorded on tape 
peated until § makes some re- 
Skinner reports that in children 
ponses evoked are “heavily de- 
ed by the conditions of the ex- 
nt” (p. 263). Effects of practice 
ally created needs could well 
lied with this method. 

‘verbal summator” is closely re- 
) the projective techniques. These 
jues do not reduce the stimula- 
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tion, as does the tachistoscope, but in- 
stead present the S with a stimulus con- 
figuration that is ambiguous. Figures 
that are not literal representations of 
anything, such as ink blots, or incom- 
plete pictures, such as the Street figures 
(see Fig. 8-5), would be examples. These 
methods yield nothing akin to a thresh- 
old, but more or less objective measure- 
ments are still available: time to give a 
response can be taken; approximation 
to the most probable identification 
(with Street figures, for instance); and 
classification of responses as “whole,” 
“part,” etc. 

Other kinds of ambiguous stimula- 


Fig. 8-5. An incomplete picture taken from Street’s Gestalt Completion Test. 
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Fig. 8-6. An example of hidden figures used 
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by Gottschaldt (from W. D. Ellis, A Source 


Book of Gestalt Psychology, Harcourt, Brace, 1938). 


tion with more fidelity of representation 
are hidden figures and reversible figures. 
The Gottschaldt figures are examples of 
hidden or camouflaged figures (see 
Fig. 8-6). 

Reversible figures are those which 
place two representations in conflict. 
The staircase figure and the Necker 
cube are well-known examples. The 


effects on report of suggestion, past €x- 
perience, reinforcement, etc., can be 
studied with them. Solley and Sommer 
(1957) performed an experiment of this 
type with an ambiguous figure, which 
could be seen as either of two faces 
whose profiles joined, Children were 
shown the two faces separately, with a 
¥% second exposure, five times. Half the 
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children were given nickels whenever 
one was presented, half when the other 
was. Afterward the faces were exposed 
together in the ambiguous figure twenty 
times, and § was asked to name which 
he saw, The children named the re- 
warded profile more often. Other chil- 
dren looked at the test figure until they 
reported both figures, and then were 
asked which looked biggest, closest, etc. 
‘The rewarded profile was said to be 
siest, brightest, and closest (not 

. When naming is used as an 
tor of S’s percept, as it is here, 
it is open to the obvious objection that 
reward may have affected merely the 
naming habit. The reports of 
nearer, etc, under con- 
tinuous exposure were intended to 
provide a second criterion to meet this 
objection. 

Ames’ rotating trapezoidal window, 
a three-dimensional conflicting stimu- 
lus, has been used by Allport and 
Pettigrew (1957) to compare children 
of different cultures. A phenomenologi- 
cal report of what the children actually 
“saw” was sought, and the children 
used their hands, as well as speech, to 
indicate whether they saw a swaying 
rectangular window or a rotating trape- 
zoid. The rotating trapezoid was pre- 
sented at different distances and under 
both monocular and binocular condi- 
tions to urban children (white Eu- 
ropean and Zulu) and to rural children 
(Zulu). Under monocular conditions 
and at the greatest distance (20 feet) 
there was no difference between the 
groups; all tended to see the “illusion,” 
even if they did not recognize the frame 
as a window. Under binocular condi- 
tions and close up (10 feet) the urban 


brighter, 
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children reported the illusion (the 
window) more often than the rural 
children. The authors attribute this 
difference to more frequent experience 
with windows and straight lines. 

The methodological problems in 
studying the effect of internal variables 
on perception are many. First, there is 
the difficulty of controlling and then 
of scaling the internal variable. Hunger, 
thirst, and, to a lesser extent, anger and 
fear are possible to manipulate, Inter- 
ests, likes, and dislikes can be rated in 
various ways, depending on the experi- 
menter’s ingenuity. But some control 
of the internal variable is essential if it 
is to be isolated and its effect studied. 
To scale one of these variables and alter 
it dimensionally in an experiment is still 
more difficult; most experimenters have 
been satisfied with categories, such as 
like and dislike. It is often assumed that 
a relevant experience has or has not 
taken place (e.g, absence of straight 
lines in the environment of rural Zulus); 
but it is obviously desirable to “build” 
the experience as part of the experi- 
ment. 

Control of other variables than the 
one presumably being studied should be 
mentioned again. If the relevant vari- 
able is to be a “personality” variable, 
then it should mot be accompanied by 
an alteration in the stimulus in such a 
way as to confound the two. This is 
a simple axiom of experimental design, 
but it is often violated. 

The design of an experiment, in 
which a stimulus dimension or cate- 
gory is varied for measurement, as well 
as an internal variable, might well be 
set up so that interactions of the vari- 
ables, and the contribution of each, can 
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be measured. An experiment by Gil- 
christ and Nesberg (1952) varied need, 
degree of need, and relevance of stimu- 
lus and was designed in such a way 
that the triple interaction could be 
tested. Since interactions were signifi- 
cant, the example seems valuable. 

Finally, validity—the relevance of 
the behavioral criterion chosen—should 
be mentioned. If S overestimates the 
size of poker chips when they are ex- 
changeable for candy, how do we 
interpret the overestimation? A very in- 
teresting case among the experiments 
considered is the Beams experiment 
already described. Pictures of both 
liked and disliked food-objects were 
presented by means of a slide projector, 
and S was asked to adjust the size of 
the picture by moving the slide nearer 
or farther away to match the size of 
the real object. The pictures of liked 
foods were overestimated as compared 
with disliked ones. Was the judgment 
really one of size? When a picture is 
enlarged or reduced, § normally sees 
not a change in size but a change in the 
distance of the object. Perhaps these 
S’s were unwittingly trying to move 
the liked foods closer and push the un- 
liked foods away. Interpretation of the 
results of these experiments demands a 
careful analysis of what S is really per- 
ceiving as he makes his adjustment. The 
experiment is not worth the trouble if 
the criterion measure is not a valid in- 
dicator of the variable that E hopes he 
is isolating. An experiment must be in- 
terpreted as well as performed. 

The effect of complicating variables 
on the perception of infants is obvi- 
ously especially difficult to study, since 
Measurement of discrimination under 
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optimal conditions of stimulus presen- 


tation is difficult enough with them. It 
would seem that such questions night 
best be postponed until many experi- 
mental studies of the development of 


discrimination have been made avail- 
able. The effect of deprivation and en- 
richment of environment might be an 
exception here and is discussed lat: 


Problems of Perception 


DISCRIMINATION 
OF SENSORY QUALITIES 

To what extent the child can respond 
differentially to physical dimensions of 


stimulation is a problem in the realm of 
psychophysics. The methods have been 
described, in general terms, under our 
first question, But a few cases deserve 
further mention, either because there 
has been special interest in the problem 
or because available research is of 
methodological value. It is important 
to keep in mind that the method used 
and the difficulty of the task required 
will affect the results, as Peters (1927) 
concluded in his review of develop- 
mental studies of this type. 

Color. Color discrimination in infants 
and children has received much study, 
and the research is instructive because 
of the variety of methods that have 
been tried. Evidence for discrimination 
of the four primary hues was found by 
Staples (1932), using a simple fixation- 
preference method, in very young in- 
fants (see p. 319). Chase (1937), using 
as a criterion pursuit movements of the 
eyes in following a spot differing from 
its background only in hue, also found 
evidence of early discrimination of hues 
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at well-separated points of the spec- 
trum. These studies, although indicat- 
ing presence of some color discrimina- 
tion, yielded little information about 
exact sensitivity. A differential condi- 
ing technique (Figurin and Denisova 
ran, 1933) also gave evidence of 


In | 


early discrimination of primary colors, 
but it is likewise not very useful for 
threshold determinations, 

ien a child is old enough to use 
a matching procedure, the number of 
variable stimuli can be increased easily, 
but the task is more difficult, the 


younger the child. Cook (1931) found 
that children of 2 years could match 
hues (red, green, blue, and yellow) that 
varied in saturation and brightness with 
45 per cent accuracy, whereas successes 
increased to 97 per cent at 6 years. The 
improvement may or may not be at- 
tributable to an increase in differential 
sensitivity. Matching for brightness and 
saturation was less successful than 
matching for hue. Gilbert (1894) in- 
vestigated discrimination of saturation 
by a matching technique in school chil- 
dren, 6 to 17. Cloth of different satura- 
tions with red as the hue was used. The 
least saturated item, the standard, was 
compared with ten variables. The child 
was told to pick all the shades of red 
that were exactly like the standard. The 
number of shades chosen as the same 
decreased steadily as age increased. At 
6 years 57 per cent of the children did 
not discriminate at all. At that time the 
median number of shades picked as be- 
ing exactly alike was 9.6, but the num- 
ber decreased to 3.9 at 17 years. A 
somewhat similar procedure was used 
by the Heiders (1940) with deaf and 
hearing children of different ages. They 
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also found that the younger children 
selected matches over a wider range 
than the older ones. 

Discrimination of colors by naming 
develops even later than discrimination 
by matching and is only 62 per cent 
accurate (for red, green, blue, and yel- 
low) at 6 years (Cook, 1931). Koffka 
(1931, p. 280f.) has discussed methods 
of studying color discrimination and 
has related differences in the results to 
phenomenological descriptions of the 
tasks required by the different methods. 

Prepared color-blindness tests, such 
as the Ishihara and Dvorine, which de- 
mand tracing of a pattern differenti- 
ated only by hue, have also been used 
with children (Synolds and Pronko, 
1949). Since they are aimed at detect- 
ing abnormalities, rather than studying 
development of discrimination, they are 
less relevant here. 

A more accurate technique, such as 
a constant method or a method of limits, 
can be used with older children. One 
such study by Jones is reported in 
Peters (1927). Children 4 to 14 and 
adults were the subjects. A sector 
in a gray disk of a color wheel was 
increased gradually until the color was 
named correctly. Sensitivity increased 
markedly; adults, with a method of 
limits, were 118 per cent more sensitive 
than 6-year-olds. 

Pitch. Pitch discrimination, like color, 
has always been of interest, particularly 
because of its importance in musical 
appreciation and production. When the 
task is difficult, e.g., a constant method 
with very small differences between the 
tones to be compared, discrimination 
typically improves with age (Peters, 
1927, Werner, 1948, p. 101f.). Gilbert 
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(1893), using a method of limits com- 
bined with a same-different judgment, 
likewise found that sensitivity increased 
rapidly from 6 to 10 years. Whether 
the i improvement is attributable to prac- 
tice, to superior comprehension and at- 
tention to the task, or to some aspect 
of physical maturation or to all of these 
is still a problem. Wyatt (1945) has 
shown that improvement can be the 
result of training. 

Weight. Gilbert (1894) investigated 
differential sensitivity to lifted weights 
in school children by using the method 
he employed with saturation. The S 
was given a standard weight and asked 
to choose from a group of comparison 
weights those that were exactly the 
same. The size of the group judged to 
be the same decreased with age, and 
the measures of differential sensitivity 
showed an increase (sensitivity was 2 ¥, 
times greater at 17 years than at 6). 
The absolute amount of improvement 
in such studies cannot be taken as mean- 
ingful, for it can be shown to be a 
function of such factors as the scale- 
steps of the group of variable stimuli. 

A curious contrast to the typical 
finding of increased “veridicality” of 
perception with development is the 
judgment of weight in the size-weight 
illusion. When information is coming 
from more than one channel (visual and 
tactual-kinesthetic here), the predic- 
tion of performance in relation to age 
may be difficult. Sensitivity to a more 
complex, intersensory variable (specific 
gravity or density in this case) appar- 
ently increases; but if only the weight 
judgment is mensured it appears that 
veridicality decreases. Jenkin and West 
(1958) have shown that mental defec- 
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tives are less susceptible to the size- 
weight illusion than a normal control 
group. But, as they point out, it would 
be wrong to conclude that the defective 
group is perceiving more accurately 


They are failing to discriminate a more 
complex stimulus variable (density), 
which is a synthesis of information 
from several senses. The importance of 


analyzing the information in the stimuli 
presented is brought out rather 
ingly in this example. 

Skin Senses. Friedmann (1927) found 
that the two point limen, using a con- 
stant method, decreased with age; but 
the curve of increased sensitivity was 
different with small separations than 
with larger ones, again showing that 
results with a constant method may be 
dependent on the step-size between the 
comparison stimuli. Developmental dif- 
ferences were enhanced with smaller 
differences between standard and vari- 
able. 


trik- 


SPACE PERCEPTION 


Independent measurements of dis- 
crimination of single dimensions of 
space, such as distance and length, are 
very much complicated by the fact 
that spatial dimensions are naturally 
interdependent. Distance is a “cue” to 
the real size of an object; but size is 
also a “cue” to distance, or so it has 
been asserted. This seeming paradox has 
been responsible for much confusion 
and many experiments on “constancy.” 
The separate dimensions are often ab- 
stracted from their background and 
studied separately. 

Visual Size. Sensitivity to length 
(linear size) has been studied in chil- 


E 


dren of 6 years on; in one study it was 
found to increase, but in another it did 
not appear to change significantly 


(Lobsien, and Giering, reported in 
Peters, 1927). Welch (1939a) studied 
discrimination of visually perceived 
area (two-dimensional) in younger 
children (12 to 60 months) and found 


improvement with age. A conditioning 


method was used (see p. 321). Welch 
emy 


yed parallelograms of different 
arying in form (from a square 
to a thin rectangle) as well as in size. 


An attempt to separate the form from 
the size aspect of the stimuli seemed to 
indicate that fine discriminations of 
form came later than size. Some form 


difference was discriminated very early, 
and the two dimensions are hardly com- 
mensurable. 

In another experiment Welch (1939b) 
used boxes of different areas (three- 
dimensional stimuli), under one of 
which a piece of candy was hidden. 
There were nine sizes. Ten children of 
22 to 32 months, without special prac- 
tice, differentiated the area significantly 
better than a younger group (16 months 
or under), The discrimination of size 
of solid objects may be easier than of 
plane figures. 

Depth and Distance. Updegraff 
(1930) studied visual perception of 
distance in 4-year-olds and adults. The 
main experiment took place in a dark 
room ro meters long. Relative judg- 
ments (nearer, farther) were obtained 
by using a constant method. The mark- 
ers were illuminated circles, which 
moved on an invisible track. Five com- 
parison distances (two farther, two 
nearer, one the same) were presented 
with the standard distance, the range 
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predetermined for each S. The standard 
and variable distances were presented 
simultaneously. Nine series were run 
(monocular, binocular, size of marker 
varied, displacement of marker to the 
side or downward, etc.). The discrimi- 
native ability was much alike for chil- 
dren and adults. There was no differ- 
ence in accuracy between the binocu- 
lar and monocular series. Another ex- 
periment took place outdoors over a 
long range. The ground was a pasture 
with a slight rise. The stimuli were 
large beaverboard disks fastened to 
poles 2.5 meters high. A pair of stimuli 
was presented, and S was asked to indi- 
cate the nearer. The disks were 3.31 
meters apart. The standard distance was 
0.195 meter. The children were asked 
“to put up the hand in front of the one 
that is nearer to you.” The children 
could discriminate correctly 6 meters 
on either side of the standard. Visual 
angle was said to be the important cue 
here. Judgment of the height of the 
tops of the disks undoubtedly played 
a big role. Experiments of this type 
might better be done with the stimuli 
on the ground, rather than raised, and 
separated horizontally by a wider 
angle. 

Stereoscopic vision in preschool chil- 
dren (2 to 6 years) was studied by 
Johnson and Beck (1941). Two polar- 
ized images were projected on a screen. 
The child wore polarized spectacles. 
The stereoscopic slide was a color pho- 
tograph of a doll. Disparity was such 
that an adult would localize the doll 
midway between himself and the 
screen. The child was invited to touch 
the doll. His localization was noted by 
measuring his reach against a scale 
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ruled on the floor. Then one image was 
blocked out, and the child was again 
allowed to “touch.” In this case the 
children touched the screen. With 
stereoscopic projection, they reached 
10 to 12 inches in front of the screen. 
Furthermore, many reported that the 
doll seemed to move back and grow 
larger when one image was withdrawn, 
The authors concluded that 2-year-old 
children have well-developed stereo- 
scopic vision. 

Visual depth discrimination in 36 in- 
fants 64% to 14 months old has recently 
been studied by Walk and Gibson 
(1959) by a technique called the “visual 
cliff.” The infant was placed in crawl- 
ing position on a board in the center 
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of a large table. The surface of the 
was heavy plate glass, and a low 
surrounded it. Directly under th 
on one side was a patterned lin: 
surface. Four feet below the gl 
the other side was the same lir 
surface. The child’s mother stood 
nately on one side and then the 
for two 2-minute intervals, twit 
toy and urging the infant to c 
her (Fig. 8-7). When the mothe: 
oned at the side with a floor d 
under the glass, 75 per cent 

babies left the board and craw! 
her. The others stayed on the 

board. When she beckoned at tli 
with the floor 4 feet below the 
only 8 per cent of the infants wi 


Fig. 8-7. A “visual cliff’ used to test perception of depth in infants (photo by Cornell 


University). 
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her, 62 per cent stayed on the board, 


and 30 per cent actually crawled off 
in the opposite direction. It was con- 
cluded that by the time locomotion is 
possible human infants can discriminate 


»p-off” visually. 

Discrimination of Tilt. Reproduc- 
visually perceived vertical and 
horizontal lines has been studied by 
Wursten with children (see p. 316). 


tion of 


More elaborate research on apparent 
verticality as affected by body tilt and 
starting re has been performed by 
Witkin (1954) and by Wapner and 
Werner (1957), using a method of ad- 
justment. Wapner and Werner's sub- 
jects (6 to 19 years) sat in a tilt chair 


in a dark room, Their task was to ad- 
just a luminous rod so that it appeared 
vertical. The rod was adjusted by E, 
at S’s instruction, at first in steps of 2 de- 
grees and finally in smaller ones, There 
were 12 different conditions; three con- 
ditions of body tilt (erect, 30 degrees 
left, 30 degrees right), in combination 
with four starting positions of the rod 
(1o degrees left, 30 degrees left, 10 de- 
grees right, and 30 degrees right). There 
was a reversal of constant errors in re- 
lation to age. For younger S’s, the posi- 
tion of the apparent vertical was tilted 
toward the same side as body tilt; for 
older S’s it was tilted opposite the body 
tilt. 

Tactual Space. Investigations of 
space perception, other than visual, are 
rare in children. Renshaw (1930) has 
studied localization on the skin in chil- 
dren and adults. A 30-millimeter square 
area on the back of the hand or the 
forearm was marked with 30 spots. 
Stimulation of one of these was fol- 
lowed by S’s attempt at localizing it 
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with a pointed wooden stick. The S 
was blindfolded and given no knowl- 
edge of results. The children (8 to 11 
years) were superior to the adults both 
before and after practice, Practice ef- 
fects were large. These results are 
unusual, especially since the 2-point 
limen apparently shows a growth curve 
of developing sensitivity. Renshaw, 
Wherry, and Newlin (1930) ) tested a 
congenitally blind group of children 
and adults and found improvement in 
localizing with age. They concluded 
that the seeing individual shifts with 
age from tactual to visual localization, 
thus explaining the children’s superi- 
ority in the first experiment. 

Tactual perception of form was 
studied by Benton and Schultz (1949) 
in 3-, 4-, and 5-year-old children by an 
identification method. Familiar objects, 
such as a ball, fork, or pencil, were first 
shown to the child, and he was asked 
to name them. Then the objects were 
put in the child’s hand inside a screened 
box, and the child tried to identify the 
object by name. The ability to identify 
objects in this manner increased with 
age but was nearly perfect in most 
5-year-olds. 


FORM PERCEPTION 


Form perception is generally treated 
as a separate topic, but it must be re- 
membered that the dimensions of form 
are spatial. Although form has most fre- 
quently been studied in two-dimensional 
drawings, the forms of natural objects 
are three-dimensional and are perceived 
in a three-dimensional space. It cannot 
be assumed that findings in the first case 
can simply be carried over to the other. 
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Methods of measuring form discrimina- 
tion have included drawing, form- 
boards, discrimination learning, match- 
ing a sample, and recognition. Drawings 
are easy to obtain but have some ob- 
vious defects as a method. They are 
affected by motor skill and are difficult 
to score objectively. Form boards also 
require some manipulatory skill. Skeels 
(1933), using a conditioning technique, 
found that differently shaped blocks 
could be discriminated by children 15 
to 46 months old before the children 
could perform adequately with a form- 
board. She concealed a reward under 
one of a number of differently shaped 
blocks (cubes, spheres, crosses, etc.), 
and the child learned to choose the 
block that was consistently paired with 
the reward. Ling (1941) found that in- 
fants between 6 and 15 months could 
learn to choose one of two blocks dif- 
fering in shape when the “correct” one 
was sweetened and the other fastened 
to the board on which it was presented. 
Change in spatial orientation and size 
of the blocks had little effect on the 
discrimination, suggesting that form 
per se was discriminated. Difficulty was 
increased by increasing the number of 
blocks presented. It should be noted 
that these studies made use of three- 
dimensional shapes—i.e., objects. Dis- 
crimination of two-dimensional shapes 
or drawings is a different and harder 
task. Stevenson and McBee (1958) gave 
children aged 3.4 to 6.8 years a pair 
of stereometric objects, a pair of plano- 
metric objects, or a pair of painted pat- 
terns to discriminate by a rewarded- 
choice method. Discrimination was 
faster for stereometric objects. 
Transposability. Whether discrimina- 


THE STUDY OF COGNITIVE PROCESSES 


tion of one form from another is dis- 
crimination of form per se or is specific 
to the particular background and the 
identical stimulus conditions has been 


the subject of much discussion and is 
related to the “transposability” prob- 
lem, Munn and Stiening (1931) investi- 


gated the influence of background and 
the discrimination of form per se 
15-month-old child. A two-choice sit- 


ina 


uation with a reward concealed behind 
one of two doors was used. The pat- 
terns (cross, square, diamond, etc.) 


were placed on the doors. After learn- 
ing to discriminate two patterns, the 
child still responded to a positive stimu- 
lus with 45 or go degrees’ rotation, with 
changes in background, and with differ- 
ent negative stimuli. 

Gellermann (1933a,b) has a discus- 
sion of criteria for the discrimination 
of form. He maintains that a learned 
discrimination of a triangle, say, must 
persist when the form is rotated, be 
independent of size, generalize to other 
types of triangle and to outline figures, 
and be independent of background. He 
found generalization of these types 
with 2-year-old children and with 
chimpanzees by using a choice situa- 
tion with food reward. He found, as 
well, keen sensitivity to small changes 
in shape when he varied the positive 
and negative stimulus by small steps so 
as to approach one another in form. 
The inference was that the subjects 
were discriminating form as such and 
that the response to form was trans- 
posable. The children were superior to 
the apes, perhaps because they could 
verbalize to some extent. In the course 
of mastering a difficult discrimination, 
it was noted that all the S’s at one time 
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or another traced the outline of the 
forms with a finger. 

Transposition along a dimension, 
such as brightness or size, is a related 
problem, studied first by Köhler (1918). 
He demonstrated transposition of 
brightness in a 3-year-old child. An 
experiment by Hunter (1952) may 
serve as an example. The subjects were 
children (23 months and older), The 
stimulus objects were square hollow 
wooden blocks varying in area. The 
child was first taught to lift a block to 
secure a currant hidden beneath it. He 
was then trained to choose the larger 
(or smaller) of a pair and later tested 
with other pairs of different absolute 
sizes. Transposition is generally the re- 
sult in such experiments, but its inter- 
pretation is a matter of argument (see 
Chapter 9 on learning). The relation- 
ship of language to transposition along 
a dimension, or to a central position 
along a dimension (the “middle-size” 
problem), has been the subject of con- 
siderable experiment (Kuenne, 1946; 
Spiker and Terrell, 1955). 

Recognition of Inversion. Discrimi- 
nation of changing orientations of a 
plane form is of great importance when 
the child begins to read. That plane 
figures on paper can be identified by 
children 3 to 5 years old despite chang- 
ing orientation has been shown by vari- 
ous methods, such as drawing and rec- 
ognition by matching (Rice, 1930; 
Newhall, 1937). This is, indeed, a case 
of “transposition” similar to that just 
discussed. It is not so clear that the 
child cannot discriminate a change in 
orientation. In these two studies (Rice 
and Newhall) the ability appeared to 
be a function of age, as it did in a study 
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by Davidson (1935). He studied dis- 
crimination of the letter-shapes b, d, 
p, and q in kindergarten and first-grade 
children by a matching method. He 
found four stages in development of 
the discrimination, beginning with the 
confusion of all four letters. The chil- 
dren identified d and b as the same 
letter in different orientations before 
they identified them as different letters. 

Two other studies point in a some- 
what different direction. Hanfmann 
(1933) had children 4 to 7 years old 
copy triangles in varying orientations 
and found that predominance of verti- 
cal and horizontal orientations was 
already strong at 4 years. Shifts and 
reversals occurred in copying because 
the child changed the figure to a “natu- 
ral” position. The implication was that 
transposability was present and also that 
change of position on the child’s part 
was motivated. 

Hunton (1955) showed children 22 
months to 14 years old pictures of ob- 
jects, such as people and trees, in dif- 
ferent orientations and recorded their 
verbal comments. She reported that as 
early as 2 years children recognized 
pictures as disoriented. Head move- 
ments often occurred with recognition 
of inversion. It is possible that the iden- 
tification of originally meaningless ab- 
stract forms, as in the reading process, 
may develop differently from identifi- 
cation of the forms of natural ob- 
jects. Experimental materials might be 
planned with this distinction in mind. 

Acuity. A special problem related to 
form perception is the measurement of 
acuity. The usual Snellen chart has been 
modified in various ways for children. 
One is described—the Bailey visual per- 
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ception test—in Peckham (1933) and 
involves matching forms on a chart 
placed at a standard distance. The child 
is given a tray of cutout figures and 
asked to match one indicated on the 
chart. Sizes of figures asked for are 
progressively reduced. The group of 
cutouts from which the child is to 
choose can be reduced to make the 
task less difficult for younger children. 

A method similar to the E chart was 
used by Cohn (reported in Peters, 
1927) with 50,000 school children. He 
used an E turned to face North, South, 
East, or West at different distances 
from the child, beginning at 20 meters. 
The child had a similar E in his hand, 
which he turned to match. Cohn re- 
ported a slow rise in acuity up to 13 
years. Measurement of acuity with 
Landolt rings presents no difficulties by 
the time a child reaches kindergarten 
and perhaps before. 

Gestalt Factors in Form Perception. 
The Gestalt psychologists have empha- 
sized the importance of the figure- 
ground relationship in form perception. 
An early study by Schroff is reported 
in Meili (1931). Schroff presented chil- 
dren with line drawings of rows of cups 
or teapots. But instead of perceiving 
cups or teapots, the children reported 
perception of forms that corresponded 
to the intervals between the figures. In 
other words, they saw as a figure that 
which was intended as ground. 

More recently Meister (1949) com- 
pared the figure-ground discrimination 
of preschool children with that of 
adults, using eight series of cards (each 
series composed of seven cards) differ- 
ing in respect either to figure or to 
ground. In each of the first four series 
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the size of the figures used varied, 
whereas the ground (diagonal, vertical, 
or horizontal lines) was kept constant. 


In each of the last four series the figure 
was constant, and the background lines 
varied in density. 

After a preliminary practice series 
(on other cards), a test card was pre- 


sented for 0.5 second and then removed. 


The S was asked to pick the card he 
had just seen from the series before 
him. In the preliminary series the test 


card was actually present in the choice 
display, but in the experimental series 
it was absent. The adults were told to 
choose a card “most like” the test card 
(which was to them obviously absent), 
but this instruction was omitted for 
children on the grounds that it was too 
complex. The author felt that this de- 
ception did not affect the results. (The 
desirability of keeping instructions 
comparable when two groups are to be 
compared has been pointed out before.) 
Adults chose a figure either slightly 
larger or slightly smaller as most like 
the test card, whereas children chose a 
larger figure than that on the test card. 
Adults chose very dense grounds, and 
children tended in this direction but 
not so strongly. Meister concluded that 
children and adults differ in their per- 
ception of figure and ground. 

The problem of whether children 
perceive globally, that is, the whole at 
the expense of the part, or analytically, 
the parts or details at the expense of 
the whole, is a controversy of long 
standing. In some situations children 
seem unable to differentiate the parts 
of a perceptual whole, and in oth- 
ers they apparently perceive details 
when an adult would have a unified 
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percept. Meili in 1931 summarized the 
research up to that time. 

In an experiment of Heiss and Sander 
(reported in Meili), children 3 to 19 
years old had to complete a figure that 
lacked a piece by finding the missing 
piece either in a random collection of 
pieces or embedded in another figure. 
When the piece was embedded, the 
task took longer for subjects of all ages, 
but the time was much greater for 
young children than for adults, A 
study, using Gottschalt figures (see p. 
330), was done by Witkin (1950) with 
adults and 10-year-olds. The task was 
to locate a simple figure embedded in 
a complex figure. In each trial first the 
complex, then the simple, and then the 
complex figure was presented, with in- 
structions to locate the simple figure 
in the complex one. Children had much 
greater difficulty than adults and often 
had to trace the simple figure in order 
to find it in the complex one. It took 
the children much longer. 

Crudden (1941), working with chil- 
dren 5% to 6% years old, first trained 
the child to select one of a pair of sim- 
ple figures in a standard learning situa- 
tion, The two figures were drawn on 
cards and set in a large screen in such 
a way that when the child pushed the 
correct one a bell, chime, or siren 
sounded (“an auditory reward”). Dis- 
crimination by size alone was prevented 
by altering the relative size during the 
learning period. When the S had made 
five correct successive responses, he was 
told that the E was going to hide the 
pictures by drawing lines around them 
and that he had to find the one that 
rang the bell, i.e., the figure to which 
he had been trained. Then the child was 
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presented with complex pairs of figures 
in which the training figures were em- 
bedded. Crudden compared the ease of 
finding embedded symmetrical or 
asymmetrical figures and found that an 
asymmetrical figure was much more 
difficult for a young child to find than 
a corresponding symmetrical figure, 
although individual differences were 
very great. 

The most recent experiment of this 
type is Ghent’s (1956) with children 
between the ages of 4 and 13. She used 
overlapping line drawings of realistic 
and geometric figures, as well as em- 
bedded figures, and asked the child to 
tell her what he saw in the picture. 
When the child pointed to an object, 
he was asked to trace it. In the case of 
the overlapping geometric figures, the 
S was asked to pick the form he saw 
from a display of various forms. Ghent 
found that children had difficulty per- 
ceiving figures that were not clearly 
set apart from each other, especially 
when they shared a contour, as in the 
case of embedded figures; the younger 
the child, the greater the difficulty. 

A somewhat different approach to 
the problem of embeddedness was 
taken by Leuba (1940). In his main 
experiment nursery-school children 14 
to 5% years old were seated about 10 
feet away from an arrangement of 
small round pillboxes. Each child was 
given a small piece of chocolate to eat 
and shown one of the pillboxes, which 
he practiced opening and shutting. The 
child then watched while the Æ ar- 
ranged the boxes (in a circle, square, 
or triangle) and placed a piece of 
chocolate in one of them. The child 
was told that he could have the choco- 
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late if, after a minute, he could pick 
up the box it was in without touching 
any of the other boxes. Then for 1 
minute a screen was placed between 
the child and the pattern, and the child’s 
attention was diverted. The children 
were unable, in their first choice, to 
find the correct box when it was a 
part of a configuration, When the box 
was placed just outside, but touching, 
the configuration, the choices were 
almost always correct. Leuba asserted, 
however, that the children did not com- 
pletely react to these configurations as 
unanalyzed wholes, since the results 
appeared to depend in large measure on 
where in the configuration the choco- 
late was placed, Success was much more 
frequent when it was at the ends of the 
line, at the corners of the square, or at 
the points of the triangle; and even the 
positions adjacent to these led fre- 
quently to success. 

The child’s perception of the part 
rather than the whole has been reported 
by many psychologists. Binet (in 
Meili) pointed out that the child enu- 
merates details at an age younger than 
that at which he describes a total pic- 
ture. Cramaussel (in Meili), studying 
kindergarten children, used two iden- 
tical mosaics and two other mosaics, 
identical to each other but differing 
from the first two in some small detail. 
The child, given one of the first mo- 
saics, was asked to choose the one of 
the three remaining that was the same. 
That the young child was able to do 
this was interpreted as evidence that 
children have no syncretic perception, 
Segers (1926), working with children 
4 to ro years old, used a technique like 
the game of lotto. The child was given 
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a large cardboard on which were drawn 
16 pictures of houses. Each house was 
identical to the others except in some 
small detail. The child also had 16 small 
corresponding pictures, which he was 
to match to the pictures on the large 
cardboard. In this study the youngest 
group had the greatest difficulty per- 
ceiving the differences. 

Children’s descriptions of pictures 
have also been considered as evidence 
for whole-versus-part perception. Cra- 
maussel (1924) showed two children in 
their second and third year, respec- 
tively, pictures in art magazines and 
recorded their comments. He found 
that the children did not use the whole 
picture as a basis for their comments. 
Attention went from one to another 
object in a picture but not to the re- 
lations between them. An attempt at 
giving a response to the whole was 
based on a detail and was rather like 
caricature; a distinctive detail was 
seized on and brought the rest with it, 
in spite of incongruities. The material 
used was highly complex; simple pic- 
tures of familiar objects might have 
produced something different. 

The results in these studies appear to 
depend to a large extent on the mate- 
rial used. Meaningless, relatively un- 
structured forms (such as mosaics) may 
well give different results from simple 
familiar forms, such as pictures, or a 
complex painting in an art magazine. 

Two studies of developmental dif- 
ferences in response to the Rorschach 
ink blots are relevant to this whole- 
versus-part controversy. Hemmend- 
inger (1953), working with adults and 
3-to-11-year-old boys, found that the 
youngest children responded globally 
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by giving whole responses and few de- 
. By ages 6, 7, and 8, however, there 
as an increase in detailed responses, 
but these details were not integrated 
into larger wholes until ages 9 and 10. 
longitudinal study of children 2 
to 10, Ames et al. (1953), have re- 
ported these same general trends, al- 
though they differ somewhat from 
Hemmendinger in respect to the ages 
lich these shifts occur. 
udies of closure (completion of in- 
lete forms) have generally shown 
that children are less able than adults 
to recognize a form when it is incom- 
The technique used is often sim- 
ply that of naming the objects por- 
trayed in incomplete pictures. Some- 
times a series of pictures of increasing 
completeness has been used to deter- 
mine age differences in degrees of com- 
pleteness necessary for recognition. Re- 
sults vary somewhat, depending on 
whether the whole picture is gradually 
built up in the series by adding details 
or whether the outer contour of the 
object is filled in by adding details. For 
early studies using these techniques, see 
Meili (1931). 

Siegel and Ozkaptan (1953) studied 
closure by means of 16 geometrical 
forms, cut in two. The subjects (chil- 
dren 4 to 7 and adults), given the two 
halves, were asked to make whatever 
they liked out of them. Although 
“minimal perimetry” occurred in 87.6 
per cent of the adults, it occurred in 
only 58.4 per cent of the children. The 
authors concluded that closure is a 
function of past conditioning and fa- 
miliarity. 

Mooney (1957) studied the relation 
of age and ability in closure. He tested 
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school children (7 through 13) and 
adults. Forty incomplete black-and- 
white representations of the heads and 
faces of people were used, together 
with additional similar but nonsensical 
items. These pictures were made up 
into packs of cards, each pack contain- 
ing 40 real and 20 false (nonsensical) 
items. The task for the child was to 
sort his pack of cards into seven cate- 
gories—six types of people (e.g., boy, 
old woman) and a leftover category. 
The results showed a positive and sig- 
nificant association between ability to 
achieve closure and age. The nature of 
the task (discrimination and categoriz- 
ing as well as closure are called for in 
sorting) may account in part for the 
age differences found. Street (1931), 
using his well-known Gestalt comple- 
tion test (see p. 329), found no differ- 
ences between scores of third-grade, 
sixth-grade, and high-school students. 
Piaget and von Albertini (1954) did 
an elaborate study of the perception 
of intersecting and incomplete forms. 
They found that, with increasing ac- 
curacy from 4 years up, children could 
recognize intersecting forms; but the 
recognition of incomplete forms was 
much more difficult, These forms were 
made by cutting or tearing out a piece 
of the original figure. The E showed 
S the seven pieces of cardboard, each 
representing a good form with some- 
thing cut out, and said, “Here are some 
forms out of which something is cut.” 
Then the Æ took a rectangular paper 
and cut out a corner, saying, “Here too, 
something is cut out. Show me how it 
looked before being cut.” The child 
was to show the complete form by 
tracing with his finger or by drawing. 
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The easiest gaps for the child to fill 
were those that required a straight line 
without corners, as in prolonging an 
actual line; arcs of circles were still 
more difficult, and oblique lines and 
corners were the hardest. A 75 per cent 
Jevel of success was not achieved under 
7 years. Gaps were perceived, but they 
were filled with scribbles, straight lines 
joining the actual lines, or bumps drawn 
from one of the actual lines to the other, 
Very young children tried to make em- 
pirical forms (boats, men) out of the 
cut forms, apparently not perceiving 
them as incomplete wholes. Under 6 
years the child also had trouble rec- 
ognizing dotted-line figures (circle, 
square, triangle, rectangle); and when 
several dotted-line figures overlapped, 
even the 6-year-olds could find only 
the simplest of them. Only at age 9 
could all of them be recognized. Piaget 
and von Albertini interpreted these re- 
sults in terms of primary good Gestalten 
due to field effects and secondary good 
Gestalten that were the result of per- 
ceptual activity during the course of 
development. 


PERCEPTUAL CONSTANCY 


It has already been pointed out that 
the dimensions of space and the forms 
of objects are interdependent. The so- 
called constancy problem is a reflection 
of this fact. The psychologist asks how 
it is that an object is usually perceived 
in its real size or shape when it lies at 
various distances or at various angles 
with respect to the observer. Two op- 
posed explanations have been suggested 
to account for the apparent constancy 
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of the dimensions of an object. Since 
developmental trends in constancy ex- 
periments are thought to be relevant to 
these explanations, the theories are out- 
lined before dealing with the research 
itself. 

The Gestalt psychologists have taken 
the stand that perception is organized 
within a framework and that an in- 
variant relation between certain aspects 
of stimulation, along with the frame- 
work, accounts for constancy. They 
demonstrated constancy in 
(KGhler, 1915) and considered it a 
primary feature of perceptual organi- 
zation, even at a simple level. Kofika’s 
statement is representative: 


animals 


Since, however, constancy of any kind 
presupposes dynamic intercourse between 
the segregated object and the whole field, 
constancy itself should under favorable 
conditions appear from the start, since 
progress consists not in creating or in- 
creasing but in decreasing the mutual in- 
terdependence of the parts (1935, p- 240). 


N. M. Locke (1938) took a radical 
position of this sort, arguing that con- 
stancy was especially characteristic of 
more primitive organisms, animals and 
children. He believed that “as we as- 
cend the phylogenetic scale, the organ- 
ism increases in intelligence and at the 
same time decreases in degree of per- 
ceptual constancy” (p. 336). 

On the other hand, some psycholo- 
gists, following the Helmholtzian view, 
have argued that constancy is a result 
of learning; the retinal image (pro- 
jected size of an object, for instance) 
is “corrected” by motor and other 
kinds of experience with the objects 
providing stimulation. The young 
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child, for instance, learns to “allow” 
for the distance of an object in esti- 
mating its size because he has reached 
for it or walked to it. The views of 
Piaget and of Brunswik are both of 
this kind. They have led to a number 


of di velopmental studies, for it is pre- 
from this view, that constancy 
increase with age and that lack 
of constaney should be characteristic 
of the young child for any given judg- 
ment. 

The results of the developmental 
studies are not easy to interpret in this 
light because they are highly dependent 
on variations in the experimental meth- 
ods employed. 

Methodological Variables. Lamber- 
cier (1946) has written a very detailed 
history of work on size constancy in 
children in which he emphasizes the 
methodological procedures that have 
varied in the studies and that may ac- 
count for the wide differences in re- 
sults, Some of these conditions are (1) 
single comparison versus comparison 
with a series or group of stimuli; (2) 
arrangement and scale level of the 
series (a “central tendency” or leveling 
effect may occur); (3) placing the 
standard nearer than the comparison 
object or placing it farther; (4) differ- 
ent types of stimulus objects (mean- 
ingful-meaningless, flat-solid, filled- 
empty, etc.); (5) distance of the objects 
from one another and from S; (6) the 
background (size, nature of surfaces 
visible, whether objects lie on a ground 
surface or are seen against a vertical 
background, presence or absence of 
other objects); (7) the lateral and ver- 
tical displacement of the objects; (8) 
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the S’s height of regard; (9) the method 
of comparison (distance can be con- 
stant and size varied or size can be 
constant and distance varied); (10) the 
S's attitude (is he trying to match 
“real,” “apparent,” or “projective” 
size); (11) instructions; (12) repetition 
and practice; (13) treatment of results; 
and (14) the psychophysical method 
chosen. 

Size Constancy. Lambercier under- 
took a series of studies of size constancy 
with different age groups that is espe- 
cially instructive for its methodologi- 
cal implications. One of these (1946) 
compared a method of successive single 
presentation with two methods of si- 
multaneous “serial? presentation in 
order to determine the effect of the 
constitution of the series on size con- 
stancy. The objects used were thin wire 
rods varying in height. The standard 
(at 1 meter from S) was 10 centimeters 
high. The variable (at 4 meters from 
S) was from 3.5 to 21 centimeters high 
in steps of o.5 centimeter. In the 
method of single comparison the vari- 
able rods were presented successively, 
either in a regular order or by the con- 
centric clinical method. One method of 
serial presentation (“fixed serial”) pre- 
sented an ordered series of variable rods 
simultaneously (3, 7, 15, or 31 at once). 
The median height of the series varied 
from trial to trial. The series was im- 
mobile for any one judgment, and S 
chose the rod “equal” to the standard. 
The other serial method (mobile serial 
presentation) involved moving the 
series progressively so as to expose a 
fraction of the rods in a window be- 
fore S’s eyes until the whole scale had 
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Fig. 8-8. Graphs taken from Lambercier (1946, p. 195) showing the upper and lower limits 
of the threshold (top and bottom line in each graph) and the median (dotted line) for 
three age groups (5 to 6 years, 6 to 7 years, and 7 to 8 years) and for adults (the 
cross-hatched section in the center of the graph). The different methods of comparison are 
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been shown. The fraction exposed at 
any moment included 3, 7, or 15 rods. 
S was to say when the middle rod of 
this exposed fraction was equal to the 
standard, 

The results for single comparison 
and mobile serial comparison were 
similar and showed a decrease in con- 
stant error with age (combined means 
were 11.6 centimeters for the youngest 
age group and 10.3 centimeters for 
adults). The precision of judgments in- 
creased markedly from the youngest 
children to the adults. The results for 
fixed serial comparisons were quite dif- 
ferent. The precision was greatly in- 
creased, compared to the other tech- 
niques, for all ages. The constant error 
for any series shifted with the level of 
the series. If the middle of the series 
was 10 centimeters, the same as the 
standard, perfect constancy resulted, 
but if the middle of the series was 8 
centimeters, the mean judgment was 
lowered; if it was 12 centimeters, the 
mean judgment was raised accordingly. 
In other words, what looked like over- 
or under-constancy occurred if the 
middle of the series of variables pre- 
sented was under or over the standard. 
Lambercier felt that the absence of any 
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evidence for the growth of constancy 
with age, found by certain experiment- 
ers (Burzlaff, 1931), might be explained 
by their use of a fixed series of com- 
parison stimuli having a middle stimu- 
lus that was equal to the standard, The 
children were actually more susceptible 
to the series effect (“central tendency” 
or leveling) than the adults, Figure 8-8 
from Lambercier shows the wide vari- 
ation in precision, depending on the 
method. Note the apparent precision, 
with a fixed series of variables for com- 
parison, especially series III 15, where 
the middle of the comparison series co- 
incided in size with the standard. Note 
also the difference in variability (width 
of threshold) with age. 

The best measure of constancy, 
Lambercier thought (the freest from 
effects of the scale), was the “mobile 
serial” comparison. One can, he showed, 
choose a series of variables of such 
composition, and presented by such a 
method, as to obtain any desired de- 
gree of “constancy” for any age group. 
Even with single comparison there is 
not really an isolated variable—the scale 
of the variables presented will still be 
influential. 

Another methodological variable re- 


indicated below the horizontal axis in order of presentation. A key to the notation of 
methods follows: I a. Single comparison, concentric clinical method. II 1a. Single comparison, 
in regular order; rst, 2nd, 3rd, and 4th indicate four successive repetitions. II 3, II 7, and II 15. 
Mobile serial comparison; the arabic numerals indicate the number of elements presented 
simultaneously in a series. II 1. A repetition of IL 1a. III. Fixed serial comparison: II 15 
represents the result for a series of 15 elements, with its middle at ro cm. (equal to the 
standard); III com, represents the over-all (combined) result of all the fixed series; II 31 
represents the result for a series of 31 elements, with its middle at 12 cm. II 1c. A repetition 


of II ra. I b. A repetition of I a. 
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sponsible for apparent over- or under- 
constancy is the position of the stand- 
ard. Piaget and Lambercier (1943) in- 
vestigated its effects for size constancy, 
using wire rods, but with a method of 
adjustment rather than a constant 
method. In one experiment the stand- 
ard rod, i.e., the one to be matched, was 
placed 1 meter from S and the variable 
(raised and lowered by E until $ 
judged it to be equal) at 4 meters, In 
another experiment the positions of 
standard and variable were reversed. 
The measure of apparent constancy ob- 
tained under these two conditions was 
different because of what Piaget terms 
the “error of the standard.” The stand- 
ard tends to be overestimated; thus, if 
it is placed close at hand, the variable, 
farther away, will be made greater than 
the standard’s 10 centimeters and ap- 
parent under-constancy results. If the 
standard is farther away, the overesti- 
mation of it automatically produces a 
trend toward “over-constancy.” Such 
results were obtained. Even the young- 
est age group had a mean of 104 (slight 
over-constancy) when the standard was 
farther away than the variable. But 
their mean judgment showed “under- 
constancy” when the standard was 
nearer. A comparable shift occurred for 
all age groups. From a developmental 
point of view, the results appear para- 
doxical, for there was a decrease with 
age in the constant error when the 
standard was near and variable far but 
an increase in error with age when the 
standard was far and the variable near 
(the trend is toward greater over- 
constancy). 

One of the biggest sources of con- 
fusion and criticism in constancy ex- 
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periments has been the instructions 
given S. Sometimes he is clearly asked 
to match “real” size. Sometimes a “pro- 
jective” size match seems to be sug- 


gested (though seldom clearly). Often 
the instructions speak of “apparent” 
size, not making clear what is meant 


by that term. Piaget and Lambe 
(1951, 1956) conducted two & 
ments in an investigation of proje 
size matches and compared them de 
opmentally with real size matches. 
They had the hypothesis that projective 
size matches would be more readily 
made by younger children because, 
they thought, objective perception 
(constancy) resulted from a learned 
correction of projected size on the 
retina. They first got real size matches 
from all S’s (five age groups, young- 
est 6 to 8 years) with an adjustable 
variable rod. The concentric clinical 
method was used. This was an eas) 
task for all the S's. Projective size 
matches followed and required very 
elaborate instruction and practice. (E 
began with teaching S how to make a 
perspective drawing of two lines at 
different distances. Then he was ques- 
tioned about his understanding of the 
drawing. If S did not understand, he 
was shown a window behind which 
cartoon figures could be placed at dif- 
ferent distances. The § was shown two 
figures of the same height at different 
distances and two of such height that 
their images were projected on the 
window at the same level when one was 
farther away. If the S still did not un- 
derstand, he was eliminated.) The S’s 
retained made three sets of projective 
estimations, with tests following the 
first two to check their understanding. 


el- 
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check involved putting a window 
th a horizontal grid on the experi- 
mental table in front of S and having 
him verify his results by indicating on 
the window the projections of the 
st rd and the variable he had chosen. 

Some of the results from these two 
iments are presented in Table 8-1. 


ex 
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but since practice was essential for per- 
forming the task at all the conclusion 
is dubious. Lack of precision, especially 
for the youngest age group, was very 
evident. In one of the experiments more 
than half of the youngest age group 
was eliminated before the projective 
judgments were taken. Such elimina- 


TABLE 8-1 
Mean objective and projective size matches in mm. made by different 
age groups with standard near and standard far. The projective matches 
given were made after practice. (From Piaget and Lambercier, 1951 and 


1956) 

Age 7-8 8-10 10-12 12-14 Adults 
Standard Objective 102 97 94 gl 90 
Near* Projective 297 217 206 219 280 
Standard Objective 104 118 116 124 
Fart Projective 161 150 150 143 


* When standard is near, a perfect projective match would be 400 cm.; 
for an objective match, a number less than 100 indicates over-constancy. 

t When standard is far, a perfect projective match would be 100 cm.; for 
an objective match, a number greater than 100 indicates over-constancy. 
Note that objective matches are made more accurately by the children than 


by the adults. 


The most striking fact is the extreme 
inaccuracy of the projective size 
matches for all age groups, even after 
Practice. (The figures given for projec- 
tive size matches represent the third set 
of judgments. The two prior sets were 
less accurate.) Age differences in pro- 
jective estimation were found, but they 
Were not consistent in the two experi- 
ments. When the standard was near, the 
youngest age group actually made bet- 
ter projective matches than the other 
groups; but in the second experiment 
it did not. The authors thought that 
practice had covered up such a trend, 


tion could have resulted in a group se- 
lected for intelligence compared to the 
other age groups. When complicated 
instructions are involved in an experi- 
ment used for comparative purposes, 
some method of equating intelligence 
of the groups is desirable. Possible in- 
teractions of age, practice, and intelli- 
gence make these experiments difficult 
to interpret. 

An older experiment by Weber and 
Bicknell (1935) might be compared 
with the foregoing. Weber and Bick- 
nell photographed disks of different 
sizes, at three different distances, and 
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presented them in a stereoscope with 
3 degrees of disparity as well as to the 
S’s unaided eye. The S was asked to 
take a “phenomenal” attitude rather 
than an “object-directed” attitude; that 
is, to judge the photographic size of the 
disk, i.e., the projected size. To do this 
he selected a disk from a set of 30 
comparison-circles varying in area, 
after gauging the phenomenal area of 
the photographed disk. All of the S’s 
(children 9 to 12 and adults) showed 
a tendency to see the disks closer to 
their real sizes than the photographic 
size would warrant. Age differences 
were small, but the children showed by 
one measure a greater tendency to see 
the photographed object as larger than 
its stimulus-size. The tendency became 
larger with practice. These results ap- 
pear contradictory to those of Piaget 
and Lambercier, but the point to be 
made is rather that differences in ex- 
perimental conditions can alter results 
profoundly, The use of photographs, 
the comparison with a series of objects, 
and the stereoscopic situation might all 
make a difference, despite the fact that 
the attitude invoked was similar. 
Studies of size constancy in infants 
have been made by using the response 
of reaching instead of verbal judgments. 
Cruikshank’s (1941) study employed 
infants 10 to 50 weeks old. She had two 
rattles of similar appearance, one three 
times as large as the other. A rattle was 
presented in front of the infant at a 
distance of either 25 or 75 centimeters, 
and the infant’s facial and reaching 
movements were noted. There were 
three situations: (1) the small rattle at 
25 centimeters; (2) the small rattle at 
75 centimeters; (3) the large rattle at 
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75 centimeters, making its retinal in 
the same as that for the smaller at 4 
distance of 25 centimeters. Reaching in 
situation (1) would be an adaptive re 
sponse, but it would not be so in (2) 
or (3), since the rattle was too far away 
to be grasped, Reaching only in (1) 
would presumably indicate the presence 
of size constancy and distance percep- 
tion. The same response in situation 
(1) and (3) would indicate that pro- 
jective size was perceived, without dis- 
crimination of distance. At about 14 
weeks, positive responses occurred 62 
per cent of the time in (1), 31 per cent 
in (2), and 35 per cent in (3). By 24 
weeks, response was 99 per cent in (1), 
and only 13 per cent in (2) and in (3). 
Cruikshank concluded that infants of 6 
months distinguished between objects 
of different size, even when the retinal 
extents were equal; in other words, size 
was perceived in relation to distance. 

A methodologically very different 
study was made by Dukes (1951) on a 
6-year-old boy. He was randomly in- 
terrupted during his ordinary daily ac- 
tivities and asked what he was looking 
at; then he was asked to make a size 
judgment of the object by designating 
size equivalents in terms of other ob- 
jects in his environment, at different 
distances, which he chose himself. A 
high degree of constancy was shown. 
This experiment was inspired by 
Brunswik (1956) and is an example of 
“representative design.” 

Constancy of Interspaces. A differ- 
ent type of size judgment was studied 
in an experiment by Costa (1949). S 
was asked to match the size of empty 
intervals between pairs of horizontally 
grouped objects seen at different dis- 


E. 


tances against a neutral background. 
The objects delimiting the intervals 
were suspended in air so that $ could 
not relate them to a ground surface. 
The standard interval between two ob- 
ject (20 centimeters) was 1 meter 
distant from S. The variable interval 
between two similar objects was always 


at 4 meters. In one series $ judged an 
interval marked by sticks, in a second, 
one marked by small cubes, and in a 
third the interval was filled by a solid 


stick. A concentric clinical technique 
was used. Adults made good objective 
matches even for the empty intervals. 
Children tended toward under-con- 
stancy for the empty intervals, but 
there was no steady progression from 
the youngest group to the oldest. For 
the solid intervals the tendency of the 
younger groups toward under-con- 
stancy was less apparent (the median 
judgment in this case for the youngest 
group was 22.5 millimeters, compared 
with a standard of 20 millimeters). 
Color Constancy. Constancy of color 
and shape have also been studied devel- 
opmentally but less frequently than 
size. Brunswik (1956) studied bright- 
hess constancy in children by using a 
graded series of papers (black to white) 
with a standard presented at different 
distances from a light source. A method 
of single comparison was used (one 
variable at a time compared with the 
standard). He found a developmental 
curve, with constancy increasing up to 
age 10 and showing a decline later at 
about 16. Another study by Burzlaff 
(1931) employed a group of compari- 
son shades of gray presented simultane- 
ously and found no developmental 
curve whatsoever; children of four 
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showed complete constancy. From 
these two studies it might be suspected 
that all the variables that affect the re- 
sults in size constancy experiments 
would do so for color—that is, such 
factors as instructions and attitude, 
method of presentation (single or si- 
multaneous grouping), or psychophysi- 
cal method. As an example of the last, 
Burzlaff found different results when 
a method of adjustment (with a color 
wheel) was used than when a group 
of variables was offered for simultane- 
ous comparison, 

Shape Constancy. An experiment by 
Klimpfinger (reported in Brunswik, 
1956) on shape constancy obtained a 
developmental curve similar to Bruns- 
wik’s for color. Ellipses tilted at vary- 
ing degrees were compared with a 
standard, again with single presentation. 
No comparable developmental study 
with groups of comparison stimuli pre- 
sented simultaneously appears to be 
available. That results for shape con- 
stancy should vary with the experi- 
mental conditions, as in size constancy, 
is to be expected. 

In conclusion, it should be re-empha- 
sized that developmental experiments 
on the constancies face extremely diffi- 
cult problems of control and interpre- 
tation. The number of variables that 
may influence results is large, and every 
one may alone produce effects varying 
with age and development. Interpreta- 
tion of any particular developmental 
curve of constancy must therefore be 
cautious, since it is almost certainly 
only one of many possible ones. 

It does not seem possible at the pres- 
ent time to claim exclusive support for 
either of the two theories described 
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from the developmental data available. 
What does seem fairly clear, however, 
is a trend toward greater precision of 
judgments with age, when the relevant 
data are furnished. One more experi- 
ment, which emphasizes this trend, 
might be cited. Cohen, Hershkowitz, 
and Chodack (1958) studied size con- 
stancy with different distances of the 
standard in subjects 5 to 17 years old. 
The comparison stimulus was always at 
a distance of 8 meters from S, whereas 
the standard was at 2 or 6 meters. Only 
one of the thirteen comparison stimuli 
was presented at a time. Ascending and 
descending series of judgments were 
obtained so that a measure of precision 
(interval of uncertainty) could be ob- 
tained. Three sizes of standard were 
employed. There was no difference in 
constant error for the age groups when 
the standard was at 6 meters. When the 
standard was at 2 meters, the 17-year- 
old group was significantly more ac- 
curate, but there was no difference in 
accuracy between the other age groups. 
Thus constancy, as it is usually defined, 
did not show a clear trend with age. 
Precision showed more consistent dif- 
ferences with age, the interval of un- 
certainty tending to decrease. The au- 
thors suggest that the performance of 
the younger children can be interpreted 
as a “reflection of their greater diffuse- 
ness in perception.” 

A trend toward greater over-con- 
stancy with age has appeared in a 
number of experiments. This finding is 
sometimes interpreted as evidence for 
the learning theory of constancy. Such 
an interpretation is not valid, however, 
for the only prediction that could rea- 
sonably be made from the learning 
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theory is that judgments should | 
more accurate with increasing an 
of experience with objects. A 

formulation of the concept oí 
stancy and a stricter logic in deducing 


ome 
unts 
earer 
con- 


the experimental outcome to be pre- 
dicted will have to precede use of de- 
velopmental trends as crucial evidence 
for either theory. 
PERCEPTION OF MOVEMEN 1 
Lower Threshold. The lowe esh- 
old for visually perceived movement 
pen- 


was studied by Carpenter and ( 
ter (1958) with a moving belt. The S's 
were two children (81 and 1o1 months 
old), two chimpanzees, and adults. A 
training situation with reward was used 
to instruct the children and chimpan- 
zees, The threshold was determined by 
a method of limits. The data consist- 
ently supported the conclusion that the 
threshold is lowered with maturity. 

Perception of Velocity. Fraisse and 
Vautrey (1952a) studied the perception 
of velocity (speed of movement) in a 
group of school children aged 4 years 
6 months to 5 years 6 months. Two 
toy cyclists (one blue, one red) were 
moved on parallel tracks by E’s hand 
and perpendicular to the child’s line of 
regard. Fourteen different experimental 
arrangements were used, in which spa- 
tial, temporal, and movement factors 
were systematically varied. Sometimes 
the cyclists disappeared into tunnels at 
the end of the course, and sometimes 
the child could witness their arrivals 
at the finish. The child compared the 
space covered, the speed, and the time 
taken (duration). Here are two sample 
courses: 


EXP! 


The space covered is 
equal, arrival is simul- 
taneous, but 772 leaves 
later than .7n, goes 
twice as fast, and 
overtakes him. 


This is the tortoise 
and the hare. 77 leaves 
after e. mM is less 
rapid, but still keeps 
ahead of mm. They ar- 
rive simultaneously. 


m ——> 
Ma ——> 


The child was told before the test to 
note the speed of the cyclists, and after- 
ward he was asked, “Did they go at 
the same speed?” If he answered no, 
he was asked which went faster. Similar 
questions were asked about space and 
time but not at the same test. In all the 
situations, including the tunnel ones, at 
least half the children compared the 
velocities correctly. Of those errors 
that occurred, passing or preceding 
during the run was sometimes the 
cause. When the slower preceded 
throughout, there were errors, but 
nearly 75 per cent of the children 
judged correctly. The authors con- 
cluded that there was no single sys- 
tematic error. Perception of velocity is 
relative to the spatial situation in which 
the movement is presented, and pass- 
ing, preceding, and overtaking are 
“perceptual data” by which velocity is 
manifested. But the child could com- 
pare velocities even when the greater 
speed did not correspond with passing. 
They concluded that there was “direct 
perception of velocity.” 

Perception of Apparent Movement. 
Meili and Tobler (1931) studied strob- 
oscopic (apparent) movement in chil- 
dren 5 to 12 years old and adults. They 
used a technique that projeeted on a 
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screen two luminous vertical dashes. 
The time between the appearance of 
the first dash, A, and the second dash, 
B, was variable. As the time interval 
was decreased the percept changed 
from one of succession (A then B) to 
optimal movement (a single object 
moved from the position of A to that 
of B) and finally to simultaneity (A 
and B). The range of time intervals 
was such that only children reported 
simultaneity. 

Initially, two little cartoon figures 
were projected in succession, and the 
child was asked to describe them. Then 
the figures were presented at an inter- 
val that produced optimal movement, 
i.e., the child said that one figure ran 
or jumped from one place to the other. 
When the child understood the stages 
of succession and movement, the figures 
were replaced by dashes, and the child 
was asked if he saw two dashes or a 
single dash going from one place to the 
other. In addition, a real movement was 
projected below the stroboscopic move- 
ment and in the opposite direction. In 
order for a movement response to be 
acceptable, the S had to say that the 
apparent and real movements looked 
the same, except for direction, For each 
S, the threshold was determined at 
which succession gave way to move- 
ment with decreasing time intervals and 
movement gave way to succession with 
increasing time intervals. The authors 
found that in the descending series 
(time interval decreasing) succession 
perseverated to values for which in the 
ascending series there was movement. 
Despite great individual differences, the 
results showed that stroboscopic move- 
ment appeared in children under the 
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same conditions as in adults but at time 
intervals greater than those needed for 
adults, i.e. children saw movement 
when adults saw succession. 

Gantenbein (1952), working with 
the same age range, used a similar tech- 
nique but varied the lengths of time 
of presentation of A and B, the light 
intensity, and the point of fixation, as 
well as the time interval between the 
presentation of A and B. Her apparatus 
allowed for a variation in the condi- 
tions so that developmental differences 
in the perception of simultaneity could 
be determined. She used the concentric 
clinical method, Gantenbein found, at 
all ages, that greater exposure time of 
A and B, low intensity, fixation on A, 
a smaller time interval between A and 
B, and increase of distance between 
stimuli all favored the perception of 
movement and simultaneity. The 
younger the S, the more easily was the 
perception of movement and simul- 
taneity produced, with children experi- 
encing movement over a larger range 
between succession and simultaneity 
than adults. 

Perception of Complex Movement 
Relationships: The Causal Effect. By 
determining the conditions of motion 
that produce the causal effect, Michotte 
established the perception of causality 
as a special case of perception of move- 
ment. Olum, using the Michotte tech- 
nique, studied the perception of cau- 
sality in children. This experiment em- 
ployed the phenomenological method 
as discussed (see p. 322). The same ex- 
periment was also performed with two 
other groups (one adult and one child) 
and used a different experimental 


THE STUDY OF COGNITIVE PROCFSSES 


method, namely, under conditions of 
specific instructions related to the ex- 
pected causal and noncausal responses 
(Olum, 1958). The notions of shove 
and tag were chosen, after considerable 
pretest of possible instructions, as the 
best translation, into terms understand- 
able to a child, of the expected causal 
and noncausal experiences. Before pro- 
ceeding with the experiment, the Æ ex- 
plained to each child the differences 
between the motions involved in tag 
and shove, in an attempt to make cer- 
tain that the child knew what to look 
for in the perceptual situation. 

The number of shove (causal) re- 
sponses and tag (noncausal) responses 
for adults and children were compared, 
and it was found that there were sig- 
nificant differences between children 
and adults. Stimulus conditions to which 
the adult response was noncausal evoked 
in children a significantly greater num- 
ber of causal responses. When these two 
groups were compared to the two 
groups that were not given specific in- 
structions, differences were also found. 
Especially striking were the differences 
between the two child groups. Under 
conditions of specific instruction, no 
passing or mutual approach responses 
were given. It is not clear whether the 
instructions actually changed the per- 
cept for the children or merely in- 


* The phrasing of instructions in meaningful 
terms is especially important when working 
with young children, since they will often 
try to perform for the experimenter, whether 
or not they completely grasp the instructions. 
Even after terms are chosen that are meaning- 
ful to the age group, it is necessary to make 
sure that each child understands what is re- 
quired. 
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hibited their verbal report. This situa- 
tion illustrates again the influence of 
the method chosen on the results ob- 
tained, 

Other work in this area has recently 
been reported from Piaget's laboratory 
in Geneva. Piaget and Lambercier 
(1958) studied a large number of causal 
effects with children (4 to 8 years old) 
and adults, also using the Michotte disk 
technique but with different stimulus 
conditions than those used by Olum. 
One problem studied intensively was 
the effect on the perception of causality 
of changes in the spatio-temporal in- 
terval between the two objects at the 
place of their expected causal interac- 
tion. Results from this study indicated 
that (1) children perceive a contact be- 
tween the two objects in motion when 
there is, in fact, no contact and when 
adults perceive none; and (2) when 
children fail to perceive such a contact 
they also do not experience the causal 
effect, whereas when adults fail to per- 
ceive such contact they continue to 
perceive causality but across a per- 
ceived gap, that is, causality at a dis- 
tance. 

A modification of the phenomeno- 
logical method discussed (p. 322) was 
used by Piaget and Lambercier. Since 
they were interested in certain specific 
aspects of S’s experience (e.g, the per- 
ception of contact between A and B), 
if S failed to report on these spontane- 
ously, E helped by asking relevant 
questions and by introducing stimulus 
Variations that would call S’s attention 
to these aspects. Such a modification of 
the phenomenological method may of- 
ten be unavoidable when working with 
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very young children, but it has obvious 
dangers. The E is actually instructing 
S to concentrate on aspects of the situa- 
tion that may or may not have been 
previously present for him. Since, as 
already noted, instructions may often 
change the percept itself, or at least S’s 
response, it is important to keep the 
instructions explicit and identical for 
all S’s, 

A device, worth noting for work 
with the very young, was used in these 
experiments. Cubes of wood, painted to 
represent the moving objects in the 
apparatus, were placed on the table at 
which the child sat during the experi- 
ment. The child, by moving these cubes, 
could demonstrate how the objects in 
the apparatus appeared to him to be 
moving. This is an excellent way to 
check on the young child’s verbal de- 
scriptions. 

A second study, by Piaget and Ma- 
roun (1958), was concerned with de- 
velopmental differences in the percep- 
tion of tactile-kinesthetic impressions of 
causality and the influence on these im- 
pressions of visual perception. 

The S, holding a special type of rake, 
felt-padded to eliminate any sound 
when it came into contact with an- 
other object, stood at the end of a long 
table on which there was a track con- 
taining a number of felt-padded boxes, 

laced one ahead of the other. The 
rake, pushed along the track, struck the 
first box. As the S continued to push 
the rake, the first box struck the second, 
and so on. The purpose of the experi- 
ment was to determine (1) where the 
feeling of impact was experienced, that 
is, was it localized in some part of the 
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rake or in some part of the body or was 
it delegated beyond the pushing-end of 
the rake; (2) were there developmental 
differences; and (3) were these locali- 
zations or delegations influenced by vi- 
sion. 

Adults and children (6 to 7 years old) 
were studied under two conditions: (1) 
the S’s started with their eyes open and 
then repeated the task with their eyes 
closed; (2) the S’s started with their 
eyes closed and then proceeded with 
eyes open. 

The results indicated (1) that when 
the rake comes in contact with the first 
box the impact is located at the end of 
the rake; when the rake pushes a box, 
which then pushes another ahead of it, 
there is a delegation of the impact from 
box to box. (2) Such impressions of 
location and delegation increased in fre- 
quency with age. (3) These impressions 
were stronger if S worked with his eyes 
open; and there was a strong influence 
of the visual on the tactual-kinesthetic, 
the groups working first with eyes open 
showing stronger impressions of locali- 
zation and delegation in the eyes-closed 
situation than the groups working i ini- 
tially with eyes closed. 


PERCEPTION 
OF COMPLEX QUALITIES 


Qualities of Objects. The perception 
of objects in terms of their expressive 
or physiognomic qualities is sometimes 
said to be more characteristic of the 
child’s perception than of the percep- 
tion of the adult (Werner, 1948, p. 
67f.). Wapner and Werner (1957) 
studied the effect of the physiognomic 
quality of motion inherent in a static 
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figure on perceived velocity when this 


figure was actually moved along a 


trajectory. Four sets of pictures, cach 
set consisting of a standard, showing a 
static figure (e.g., a grazing horse), and 
a variable, showing a dynamic figure 
(e.g., a running horse), were used with 
children 6 to 19 years old, An additional 
set consisted of a triangle with apex 
pointing away from the direction of 
movement as standard and the identical 
triangle pointing in the direction of 
movement as variable, Standa and 
variable were drawn on belts of white 
paper and moved independently, each 
behind its own stationary window. The 


S, seated in front of the two windows, 
had the task of adjusting the speed of 
the variable to that of the standard. 
There were four trials for each set of 
figures, two trials in which the speed 
of the variable was initially set at less 
than that of the standard and two in 
which the speed was initially greater 
than that of the standard (see Fig. 8-9). 

The results indicated that for the 
youngest age group (6-7) the dynamic 
figure was perceived as moving faster 
(i.e., adjusted to a slower speed) than 
the static figure. This effect decreased 
with age and held for all five sets of 
figures. 

Qualities of People. The perception 
of expressive qualities of human faces, 
sometimes called social perception, is 
of increasing interest to psychologists 
today. For an overview of this area, see 
Bruner and Tagiuri (1954). Murphy 
and Murphy (1931, Ch. 6) have an ex- 
cellent discussion of the early experi- 
mental work with children on the 
development of responses to social 
stimuli. Among these early studies, 


EXPERIMENTAL 


< 
a 

R 
SOR 
Pe 


METHODS OF STUDYING PERCEPTION IN CHILDREN 


> 
Fak 
be 
ER 


Fig. 8-9. Static and dynamic figures (from Wapner and Werner 1957, p. 32). 


many were concerned with the smiling 
response. The more recent study by 
Spitz and Wolf (1946) is the most 
thorough on this subject. 

This study was designed to de- 
termine the stimulus conditions nec- 
essary to evoke the smiling response, 
its developmental pattern, and its uni- 
versality, The S’s were 251 children 
(White, Negro, Indian) from five dif- 
ferent environments (private homes, 


nursery, foundling home, delivery 
clinic, Indian village) and ranged in age 
from birth to 1 year. A number of 
experiments were performed on each 
child to exclude the possibility of ac- 
cidental results, and each experiment 
was done independently by both a male 
and a female experimenter, who neither 
talked to nor touched the child. The 
mother was never present in the child’s 
‘visual field. The technique was to 
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present the child with a nodding or 
smiling face, in full-face. When the 
child smiled, the face was slowly turned 
into profile, continuing to nod or smile. 
If the child stopped smiling, the face 
was turned back to full-face. In the 
standard smiling situation, the E smiled 
or nodded pleasantly at the child. In 
addition three other conditions were 
used: (1) the E assumed an expression 
of a savage beast baring its fangs, thus 
conserving the configurational aspects 
of the stimulus but removing the pleas- 
ant emotional quality; (2) the E wore 
a Halloween mask and rhythmically 
stuck out his tongue, thus retaining 
configurational aspects but removing 
all human attributes; and (3) a life-size 
puppet with a nodding head was used, 
with the E out of sight. 

Results showed that although the 
motor pattern for smiling was present in 
the baby from the first few days of life, 
without an environmental stimulus, the 
social responses of smiling did not be- 
gin until about three weeks. In the 
3-to-6-month group it was almost un- 
failingly present in all of the conditions 
discussed when presented in full-face. 
When the face was turned to profile, 
the child stopped smiling. In the second 
half of the first year the child no longer 
smiled indiscriminately but only at cer- 
tain people. 

A control series, consisting of many 
objects of varying intensity, surface 
structure, shape, size, familiarity and 
emotional significance (e.g., bottle, 
doll, or block with funny face on it) but 
lacking the configurational aspects, was 

also used. At no age did the children 
tested smile in response to these ob- 
jects. 
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These findings held for all groups 
tested, despite differences in racial ori- 
gin and background. Spitz maintained 
that this pattern is established by the 
child’s experience with the mother, 
whose face is seen full-face in emotion- 
ally positive situations of feeding and 
child-care. 

Gates (1923) studied the growth of 
the ability to identify specific emotions 
by facial expression in photographs in 
children 3 to 14 years old. Six photo- 
graphs, representing joy, pain, anger, 
fear, scorn, and surprise, were used as 
stimuli. For each picture, she asked, 
“What is this lady doing?” “How does 
she feel?” There was a gradual increase 
with age in the ability to interpret each 
picture correctly. Laughter was the 
only emotion recognized by over half 
of the children at age three. It was not 
until age six that half of the children 
recognized pain; and anger, fear, and 
surprise came later than that, with 
scorn the most difficult of all. Only 
43 per cent of the 11-year-olds judged 
scorn correctly, although adults had no 
difficulty with it. 

Dashiell (1927) modified this method 
somewhat by first telling the child a 
story and then asking him to pick from 
a series of pictures fhe appropriate one 
for each incident in the story. This 
method has the advantage of depending 
less upon the available vocabulary of 
the child. Consequently, it proved to 
be a more discriminating tool for use 
with very young children, especially in 
studying recognition of the subtler emo- 
tions. 

Gates (1927) studied the child’s 
ability to identify expressive qualities in 
the human voice and related this to his 
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ability to identify emotions in facial 
expression, using children in grades 3 
through 8. A recording was made of an 
actress repeating the letters of the 
alphabet in tones of happiness, un- 
happiness, anger, fear, surprise, scorn, 
defiance, pity, and suspicion. Gates’ 
results showed a developmental increase 
of correct judgments, although there 
was a greater number of correct re- 
sponses to the visual than to the auditory 
situation, 

Harlow’s (1958) recent study of the 
stimulus properties of a love object is 
a new approach to social perception. 
Although Harlow worked with baby 
monkeys, his research has relevance for 
child development. The question was, 
what perceptual qualities must an ob- 
ject have to be accepted by the baby 
monkey as an adequate mother-object 
(i.e., the mother-object must evoke in 
the baby the characteristic baby-mother 
affectional responses). Two surrogate 
mothers were constructed, one of wire 
mesh and the other of wood covered 
with sponge rubber and sheathed in tan 
cotton terry cloth, both radiating heat. 
Eight newborn monkeys had both sur- 
rogates equally available, but for four 
of the monkeys the cloth mother 
lactated and the wire mother did not, 
whereas for four others the condition 
was reversed. In order to determine the 
relative importance of nursing comfort 
and contact comfort, the time each 
monkey spent on each surrogate mother 
was recorded for 165 consecutive days. 
The physiognomic qualities (tactual 
and visual) of the cloth mother won 
out, since regardless of whether the 
monkeys nursed from the cloth or wire 
mother they clearly preferred the cloth 
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mother, spending much more time on 
her and running to her as a haven of 
refuge when they were frightened. This 
was nicely underlined when the mon- 
keys were put in the “love machine,” 
a dimly lighted box with a little door 
which the monkey could open in order 
to look out. The strength of affectional 
responsiveness to an object was meas- 
ured by the rate with which the mon- 
key opened the door to look at the 
object outside. The cloth mother, wire 
mother, another baby monkey, and an 
empty box were presented for a thirty- 
minute period on successive days, The 
monkeys responded equally to the cloth 
mother and another baby monkey but 
showed no greater responsiveness to the 
wire mother than to an empty box. 

Attempts to identify the critical 
stimuli for physiognomic qualities will 
doubtless increase in the near future, 
but methods in this area are still in 
their infancy. 


PERCEPTION OF QUANTITY 


Although the growth of number con- 
cepts has received much study (see 
Chapter 10), there have been few ex- 
periments with children on the percep- 
tion of quantity or numerosity. Long 
and Welch (1941) studied the develop- 
ment of ability to discriminate and 
match for quantity. Three kinds of test 
were used. 


1. Discrimination of ten marbles from 
a varying quantity of them, beginning 
with one and progressing to nine by a 
modified method of limits. The child 
was told to look for a toy under the box 
with most marbles, 
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2. A number matching test. Two 
marbles were placed in front of the 
child. He was then given ten to fifteen 
marbles and told to make a pile just 
as big. If he succeeded, the standard pile 
was increased, 

3. A group matching test. Four 
groups of marbles were placed within 
a radius of 6 inches. Two groups con- 
tained two marbles, two contained 
three. The child was asked to point to 
the groups containing most marbles 
and then to the groups containing least. 
The numbers were increased following 
a success, 


The performance on all three tests in- 
creased with age (30 months to 7 years), 
but the tests were not of the same 
degree of difficulty, The authors con- 
cluded that they were measuring dif- 
ferent things, not a single trait of 
“quantitative perceptual ability.” A 
concept of “larger” was needed for 
test 1 (the easiest), of “sameness” for 
test 2, and both for test 3 (the hardest). 

Several studies have been performed 
to discover the factors that aid the child 
to apprehend numbers as a group rather 
than to count individual items. Carper 
(1942) varied context (e.g., embedding 
of objects in a group), form (geometric 
as opposed to familiar objects), and 
pattern arrangement (regular versus ir- 
regular) in pictures drawn on cards. 
The subjects (kindergarten children) 
drew as many objects as they per- 
ceived. Apprehension by groups was 
apparently hindered by complex em- 
bedding, “poor” form units, and ir- 
regular patterns. Dawson (1953) also 
showed children cards on which objects 
were differently arranged and asked 
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them to say how many there were as 
rapidly as possible. Apprehension by 
grouping was hindered by complexity 


(lack of symmetry, heterogencity of 
elements, poor figure-ground relations, 
etc.). First graders could not apprehend 


more than four or five elements 
group. With more, they tended to 
count. 


is a 


TIME PERCEPTION 


Fraisse and his co-workers have 
studied perception of duration in chil- 
dren with both visual and auditory 
presentation of intervals. In the same 


situation used to investigate velocity 
Fraisse and Vautrey (1952b) secured 
comparative judgments of duration for 
two toy cyclists running courses of 
variable lengths and at variable speeds. 
The children (5 years old) were asked 
which took longer to run its course. ‘The 
judgments averaged 5o per cent correct. 
Correctness depended on whether the 
factors on which the children based 
their judgments corresponded to the 
actual duration. There was no “intuition 
of the duration as such.” Time, the 
authors thought, is perceived by adults 
in terms of succession of events and is 
thus simply a constituent of the struc- 
ture given by these events. Children of 
5 estimated duration by the quantity 
of work apparently done, i.e., whether 
the cyclist went farther or faster, or by 
the quantity of “effort,” i.e taking 
longer to do the same work (the chil- 
dren referred to the cyclist in this case 
as “tired”). The situation permitted 
identification of the child with the 
cyclist, and he responded by estimating 
time in terms of action. 


EXPERIMENTAL 


Fraisse and Orsini (1958) used an 
adjustment technique with children of 
three age groups (6, 8, and 10 years). 
The task was to match the duration 
(thirty seconds) of a noise made by a 
synchronous motor. When the noise 
stopped, the motor was started again, 
and S was to stop it when it had lasted 
equally long. Three lights of different 
color gave S knowledge of results 
(tight, too long, or too short). Ac- 
curacy increased with age. All the chil- 
dren improved, the youngest most. The 
6-year-olds tended to overestimate, 
whereas the 10-year-olds had a tendency 
to underestimate. 

Smythe and Goldstone (1957) used 
a different technique with auditory 
duration. The S’s were children, 6 to 
14, and adults. The S was asked to 
report whether a tone heard was more 
or less than a second. Runs of ascending 
and descending duration, starting at 1 
second, were given. Then S was given 
five runs of 1 second duration and told 
to use them as a standard; then he was 
tested again. All the S’s overestimated 
a second. Variability decreased with in- 
creasing age. The 6- and 7-year-olds 
were unable to profit from the informa- 
tion given, perhaps because of their 
very inadequate concepts of a second, 
but the others became more accurate. 


PERCEPTION 
OF SYMBOLIC FORMS 
AND PICTURES 


Symbols. Perception of speech sounds 
and written words are specialized topics, 
better considered under speech and 
reading. It should be pointed out that 
the perceptual problems in learning to 
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read are closely related to the dis- 
crimination of meaningless or abstract 
forms drawn on pieces of paper. Dis- 
crimination of such forms by matching 
techniques has been used as part of 
reading readiness programs, on the as- 
sumption that practice in the task will 
facilitate progress in reading. How 
discrimination of such forms can be im- 
proved is a problem of interest. Will 
the learning of identifying names for 
certain forms, for instance, make dis- 
crimination faster and more accurate? 
Spiker (1956) has reviewed some rele- 
vant experiments performed on chil- 
dren and given some rules for good 
methodology to be followed in experi- 
ments on “acquired distinctiveness.” 

Pictures. Although pictures of ob- 
jects have been used in intelligence tests 
and as material for the study of per- 
sonality, there has been relatively little 
research on the development of this 
kind of perception as compared with 
“real” objects. Research with a group 
of native African children (Nissen, 
Machover, and Kinder, 1935) indicated 
that these children whose culture in- 
cluded few if any pictures performed 
relatively poorly on tests that had pic- 
torially representative content. Experi- 
mental research might well follow up 
this lead. 

Some of the material discussed under 
form perception is relevant here, since 
two-dimensional drawings are in many 
cases pictures, though of varying de- 
grees of fidelity. Some attempt to di- 
mensionalize fidelity might be of use in 
organizing the rather hit-or-miss collec- 
tion of facts in this area. There is no 
systematic research on children’s per- 
ception of photographs, as compared 
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with line drawings, abstract geometrical 
forms, and (at the other end of the 
scale) the real objects. 

One attempt to grade pictorial ma- 
terial can be cited. Vernon (1940) 
showed a graded series of pictures 
presented for 1.8 seconds in a tachisto- 
scope to children of various ages. The 
series progressed from a clearly “mean- 
ingful” picture with sharp contours to 
one similar in design but drawn with 
blurred, crosshatched outlines. The chil- 
dren’s descriptions of the “meaningful” 
pictures were accurate, but the blurred 
ones produced a great deal of invention. 

Recognition of incongruity in a pic- 
ture of a building (photographed so 
that it appeared to have only one side) 
was studied by Behar and Bevan 
(1956). Age was an important factor in 
sensitivity to this incongruity. The 
authors related this fact to the child’s 
ability to respond to the representation 
of a three-dimensional world in two- 
dimensional space, pointing out the lack 
of perspective in children’s drawings. 
But children’s judgments of space in 
photographs does not appear to have 
been studied where there is good fidel- 
ity. Some other factor may have been 
responsible for failure to notice the in- 
congruity. Methods other than draw- 
ing and free report are indicated. 


ATTITUDE AND PERCEPTION 


Techniques appropriate for studying 
the influence of attitudinal factors 
(need, set, emotion, etc.) have been 
discussed under question 3 (sce p. 325). 
Two rather specialized problems within 
this area might be given further men- 
tion: the effect of attention on percep- 
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tion and the relationship between 
perception and personality. 

Attention. That perception is selec- 
tive—that one can attend or not attend 
—is a generally accepted proposition. 
It is also commonly understood that 
children’s capacity for attending in- 
creases with age (e.g., as shown by 
digit span on intelligence tests, for in- 
stance). Here might lie a factor of great 
importance in the understanding of per- 
ceptual development, as indicated by 
increase in precision and ability to dif- 
ferentiate. Yet there seems to be little 
research on attention in children, at 
least in regard to perceiving. Experi- 
ments by Dallenbach (1914) with 
second graders, aged 7 to ro years, gave 
drill in “apprehending” nine fi 
presented for five seconds on a card 
(numbers, letters, geometrical figures, 
etc., in a 3 by 3 matrix), Practice over 
a period of weeks resulted in apparently 
lasting improvement on similar tests, as 
compared with unpracticed school- 
mates. 

That training with tachistoscopic de- 
vices (“flashmeters,” for instance) re- 
sults in improved performance in the 
same situation has often been demon- 
strated (Gates, 1925; Freeburne, 1949). 
Whether this improvement transfers to 
more complex perceptual skills, such as 
reading, is a matter that has caused much 
controversy. Goins (1958) recently 
summarized the controversy and pre- 
sented fresh experimental evidence that 
tachistoscopic training does not transfer 
to reading skill. Research on the prob- 
lem obviously requires control of many 
factors, such as teacher differences and 
motivation of control groups. Validity 
of results of such experiments must 
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be sought in transfer situations; is it 
merely specific eye-movement habits 
or fixations that are being trained or is 
it something more general? 

What an infant will attend to 
spontaneously was the subject of an 
experiment by Berlyne (1959). He used 
a fixation preference technique, like 
Staples, with paired comparisons of 
stimuli varying in albedo and in “com- 


plexity” of figures. Differences in 
albedo did not affect fixation, but 
“con plexity” may have, since the 


preferred patterns were described as 
ones with more contour. Fantz (1958) 
used a similar technique. Infants as 
young as 2 months showed some con- 
sistent preferences. They tended to 
fixate a checker pattern in preference 
to a square and a bull’s-eye in prefer- 
ence to stripes. 

Perception and Personality. It has 
become a popular tenet in the last 
decade that personality affects percep- 
tion and also that personality can be 
studied by way of perception. It is 
therefore of considerable importance 
to clinicians to discover whether dif- 
ferences in personality are correlated 
with differences in perception. The 
method of such experiments is to give 
a battery of personality tests and then 
to seek some measures of perception, 
later correlating individual differences 
in the two. Witkin et al. (1954) made 
such a study. Personality was assessed 
by the Rorschach, TAT, figure-draw- 
ing tests, and by miniature-toy play. 
Perception was measured by a tod and 
frame test, tilting room and tilting chair, 
and embedded figures. What to cor- 
relate here is a very complex matter. 


The authors were interested in “in- 
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dependence of the field” in perception 
and thought that differences could be 
correlated with such aspects of person- 
ality as self-concept, organization in 
play, control of impulses, “coping” 
ability, and so on. The methodology of 
correlational experiments is unavoid- 
ably much concerned with statistics, 
not the concern of this chapter. But the 
validity and meaning of the individual 
test scores correlated is obviously im- 
portant and must be independently de- 
termined. Furthermore, results are 
bound to depend to some extent on the 
choice of tests. Post hoc selection of 
differentiating scores as a basis for 
analysis is a dubious procedure some- 
times employed. Postman (1955) has 
jointed out some of the logical and 
technical weaknesses of this work. 


ILLUSIONS 


In recent years there has been con- 
siderable interest in the development of 
perception of illusions (Inhelder, 1957). 
Piaget and his collaborators system- 
atically studied a great number of them. 
He divides illusions into three types: 
those that decrease with age, such as the 
Miiller-Lyer illusion, those that increase 
with age, such as the size-weight illu- 
sion, and a third type that increases up 
to about age 10 and then decreases, as 
does one form of the vertical-hori- 
zontal illusion. 

The various techniques used to study 
illusions can be illustrated by the work 
done on the Miiller-Lyer illusion with 
children. Binet (1895) made a system- 

1 See Piagets Recherches sur le développe- 
ment des perceptions. Archives de Psychol- 
ogie, Genéve, from 1942 on. 
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atic study of illusions in children. 
Children 7 through 14 were seated in- 
dividually at a table on which was a 
booklet with the standard, Figure A, 
on the right-hand margin. The com- 
parison figures, B, in varying sizes made 
up the pages of the booklet. Instructions 
to the § were to disregard the oblique 
lines and to compare the lengths of the 
straight lines, choosing the variable 
figure that looked equal in length to the 
standard. 


B A 


In one booklet A was 10 centimeters, 
the B’s varied from 9 to 15 centimeters, 
and the oblique lines were 4 centimeters. 
In another booklet A was 2 centimeters, 
the B’s were 1.8 to 4 centimeters, and 
the oblique lines were 0.8 centimeter. 
The distance between A and B, kept 
constant at 20 centimeters, required the 
S to move his eyes for purposes of 
comparison. The variable figures were 
presented in increasing and decreasing 
order. All the S’s chose a larger B as 
equal to A, but the illusion was stronger 
in the younger children. It was also 
found to be greater for the smaller 
figure and in the decreasing rather than 
the increasing order. 

In another experiment Binet had S’s 
compare Figure A to a series of straight 
lines and Figure B to a series of straight 
lines in order to separate the double 
effect of the different types of oblique 
lines. He found that the illusion of A 
is stronger than that of B. 

Van Biervliet (1896), working with 
children 12 to 16 years old and adults, 
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used a method similar to Binet’s, but he 
varied the angles of the oblique lines 
as well as the length of the vertical 
lines. Taking Binet’s large Figure A as 
the standard (with oblique lines at 30 
degrees) he had five series of type A 
comparison figures. In all of the series 
the figures varied in length, the 
angles of the oblique lines differed from 
series to series (60, 90, up to 180 de- 
grees). He found the illusion stronger in 
children than in adults, but in all cases 
the more obtuse the angle, the less the 
illusion. 

Piaget and von Albertini (1950), 
working with children 5 to 9 years old 
and adults, systematically varied both 
the angle of the oblique lines and their 
length, while keeping the standard 
figure constant at 15 millimeters. Both 
types of Miiller-Lyer figures were used 
independently. All figures were placed 
horizontally, with comparison figures 
presented successively at the right of 
the standard and in concentric order 
(first a figure much larger than the 
standard, then one much smaller, etc.). 
The authors found that (1) as the 
height of the oblique lines increased, the 
illusion decreased; (2) in type A figures, 
as the length of the oblique lines in- 
creased, the illusion first increased, then 
decreased, and for children even be- 
came negative; (3) in figures of type 
B, as the length of the oblique lines in- 
creased, the illusion increased; (4) i 
all cases the illusion decreased with age. 

A somewhat different technique was 
used by Wapner and Werner (1957), 
working with children between 6 and 
19 years. They combined into one 
horizontal pattern both types of Müller- 
Lyer figures (~+—). The part on the 


but 
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left was the standard and was 75 milli- 
r rs long. The S, who sat 3 feet from 
the pattern, could adjust the part on 


the right (by means of a knob) until 
it looked equal to the standard. Four 
trials were used (two with the initial 


setting of the variable at 25 millimeters 
and two with the initial setting of 
the variable at 125 millimeters). The 
order of the settings was small, large, 
large, small. 

In this study, as in all the others, the 
results show a considerable decrease of 
the illusion with age. 


DEVELOPMENT AND 
ENVIRONMENTAL OPPORTUNITY 


Development under controlled en- 
vironmental conditions has often been 
the subject of study, especially in ani- 
mals. Generally, the behavior observed 
has Been locomotor, emotional, or social. 
The development of perception under 
such conditions is no less interesting and 
instructive. When human infants are 
to be the subjects, it is not feasible, as 
a rule, to create special types of en- 
vironmental deprivation, but nature 
provides such cases occasionally and it 
is important that good methods should 
be followed in studying them. 

Deprivation of Stimulation. The 
most spectacular cases of deprivation 
are those of the blind-born restored to 
sight by surgery. Extensive reports of 
these cases are found in Bourdon (1902, 
Ch. 13) and Senden (1932). Since very 
different conclusions have been drawn 
from studying these reports, caution is 
necessary in interpreting them. Many 
of them are very old, and the testing 
procedures were informal and not re- 
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ported in detail. Dennis (1934) dis- 
cussed the inadequacies of the research 
procedures. One source of disagreement 
may lie in the use of verbal identifica- 
tion (naming) as an indicator of dis- 
crimination, It is not reasonable to sup- 
pose that a word learned to identify 
stimulation coming from other senses 
would automatically transfer to visual 
stimulation present for the first time 
(i.e., the fact that a child can identify 
a marble by touch does not mean that 
he could name it when it is shown him 
for the first time). The most suitable 
methods of testing discrimination would 
probably be those that have been used 
successfully with young children, such 
as matching methods. The task in- 
volved should be as easy as possible 
because of the emotional upsets, inter- 
ference from old habits, and nystagmus 
that are characteristic of these cases. 

Experimental production of depriva- 
tion, such as dark-rearing, is possible 
with animals. Riesen’s (1958) experi- 
ments with infant chimpanzees are the 
most relevant here and should be con- 
sulted for methods of control and 
testing (see Chapter 7). 

A few studies of the effect of a 
restricted environment on human in- 
fants are available, though they contain 
little information about discrimination. 
Dennis and Dennis (1941) raised a pair 
of twins from the beginning of the 
second month to the end of the four- 
teenth in relative confinement. The in- 
fants never left the nursery (except 
for medical check-up), were cared for 
solely by two adults, and remained in 
their cribs except for necessary care. 
There were no pictures or decoration, 
a screen was placed between the two 
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cribs, and they were given no toys un- 
til the forty-ninth ee For the most 
part their development was normal, as 
far as could be observed, but they were 
retarded in visually directed reaching. 
This ability developed very rapidly 
when practice was allowed. The authors 
thought that learning played a part in 
the development of “autogenous” 
responses—those that appear without 
training but are practiced by the child 
himself. 

Spitz’s study (1945) of “hospitalism” 
compared children in two institutions 
during the first year. Again, there is 
little information in regard to dis- 
criminatory abilities. It is not clear how 
much of the children’s retardation was 
caused by inadequate “mothering” and 
how much by the restricted environ- 
ment. 

A recent report by Dennis and 
Najarian (1957) of infants reared in a 
foundling home in Beirut in a very 
limited environment discusses the prob- 
lems of testing these subjects and in- 
terpretation of the results. Restriction 
of learning opportunities resulted in a 
depressed developmental index in the 
last 9 months of the first year but ap- 
parently did not result in a generally 
poor performance by 4% to 6 years. It 
is to be hoped that future studies of such 
groups can be planned to study per- 
ceptual development with specialized 
laboratory methods as well as a routine 
test schedule. 

Another type of environmental re- 
striction is found in children with 
organic impairment of sight and hear- 
ing. Education and development of 
blind and deaf children has long been 
a subject of special study and is too 
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voluminous to treat in detail. It was at 
one time supposed that deprivation of 
one sense brought about “compensa- 
tion” or extraordinary superiority of 
discrimination by means of others. That 
any such compensation is probably the 
result of training is now a general 
opinion (Hayes, 1934). How superior 


acuity is achieved in such cases is a 
matter of extreme interest for per- 
ceptual development and deserves 


further study. The studies of obstacle 
avoidance in the blind by echolocation 
(Griffin, 1958, Ch. 12) are an impres- 
sive example of the knowledge to be 
obtained by expertly planned research 
in this area. 

Cases of early blindness, later blind- 
ness (occurring after four years), and 
Spa S’s were compared by Drever 

(1955) to discover whether early dep- 
rivation is a more serious handicap 
than later. The S’s were tested in tactual 
recognition of shapes, orientation and 
forms on a peg board, and perception 
of straightness of a curved rule. Results 
differed for the tests. Sighted S's and, 
next, late blind were better at recogni- 
tion of shapes, but the blind S’s had a 
more objective perception of tactual 
straightness, 

Enrichment of Stimulation. The 
effect of a specially enriched environ- 
ment or extra opportunities for practice 
on the development of discrimination 1s 
not only interesting theoretically but of 
obvious importance educationally. The 
idea of developing “sensitivity” by pro- 
viding special opportunities for prac- 
tice is an old one. The “Montessori 
Method” specialized in such practice, 
and an account of it is instructive to 
those interested in perceptual learning, 
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though it was a teaching not a research 
program. The basis of the program was 
the “education of the senses,” which 
had as its aim “the refinement of the 
differential perception of stimuli by 


means of repeated exercises” (Montes- 
sori, 1912, p. 173). The exercises con- 
sisted of observing and ordering series 
of graded stimuli, such as sandpapers 


of graduated roughness, weights, pieces 
of wood of varying thickness and 
height. 

The color series was most interesting. 
It consisted of eight “tints” (black, red, 
orange, yellow, green, blue, violet, and 
brown), with eight gradations of in- 
tensity for each. First the child was 
asked to find duplicates for only three 
“strong colors.” Later, he found matches 
for all eight. Then the lighter tones 
would be presented, and the child was 
shown how to arrange them in order of 
gradation. The procedure was con- 
tinued until the child could order all 64 
samples. Children of three were said to 
be able to “put all of the tints into 
gradation.” Some of the principles 
given are interesting in the light of 
current research. The procedure was 
always from “few stimuli, strongly con- 
trasting, to many stimuli in gradual dif- 
ferentiation, always more fine and 
imperceptible” (p. 184). Activity and 
self-correction on the part of the child 
were thought very important. Transfer 
from simple kinds of preliminary train- 
ing was stressed; when forms and letters 
were taught, for instance, the child be- 
gan by following the contours with 
his finger before attempting visual rec- 
ognition. The importance of early ex- 
perience, so much stressed nowadays, 
was also emphasized. “It is necessary to 
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begin the education of the senses in the 
formative period” (p. 221). 

The validity of these techniques was 

not tested with controlled experiments, 
but such experiments can be done and 
are beginning to be. An example of an 
experiment on transfer from pretraining 
is one by Jeffrey (1958) with tonal 
frequencies. His criterion task was 
learning to press a button on the left in 
response to a low tone (128 cycles per 
second), one on the right to a high 
tone (1152 cycles per second), and, 
after mastering this problem, to do the 
same for a closer pair (256 and 384 
cycles per second). Three groups of 
kindergarten children took part. Group 
I did only the criterion task, Group 
II had pretraining consisting of trying 
to sing the two tones and were then 
passed to the criterion task. Group III 
had pretraining by striking the ap- 
propriate key on a piano (the two keys 
were marked with red tape) and were 
then passed to the criterion task. Only 
one of the seven S’s in Group I was able 
to learn the criterion task in 72 trials, 
In Group II five of seven learned it, 
and in Group III all did. One S in 
Group II and one in Group III trans- 
ferred to the closer pair of tones. None 
in Group I did. The immediate com- 
parison between a response-produced 
tone and the stimulus tone accentuates 
the correctness of a response and also 
helps “define the continuum,” Jeffrey 
thought. 

Research in this area will certainly 
increase. It is to be hoped that future 
experiments will include tests of dis- 
crimination other than learning or nam- 
ing. It would be interesting, in fact, to 
compare the effects of various types 
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of training on discrimination meas- 
ured in a number of different ways. 
Ordering tests are a promising pos- 
sibility, since they can be extended to 
almost any length and degree of dif- 
ficulty. One problem to be pondered by 
the investigator is, as always, whether he 
is actually measuring “discriminative 
ability or whether the real variable is 
the child’s comprehension of the task. 


Summary and Evaluation 


A brief final appraisal of methods for 
studying perception in children leads 
to the conclusion that there is no dearth 
of techniques or proper conventions for 
using them. Looking at the area as a 
whole, one is struck by the fact that its 
weaknesses are not methodological; they 
seem to arise, rather, from a theoretical 
lack. The methods are rich enough; 
sometimes an experimenter’s ingenuity 
in devising substitutes for too “adult” 
methods is astonishing. 

With the simpler discriminations, 
such as color comparisons, there is a 
good array of facts, derived from a 
variety of methods, Success is already 
notable or is just around the corner, 
With the more complex discriminations, 
such as physiognomic qualities and in- 
terpersonal relations, methods are need- 
ed. The difficulty of describing the 
stimuli in these cases makes the problem 
very hard. Good research will have to 
cope with this difficulty. 

As for the real faults in this field, we 
might place first the rarity of the truly 
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developmental approach. At least half 
the studies the writers have been 
through were just experiments that hap- 
pened to have been done with children 
—perhaps because the children were 
conveniently at hand in a nursery 
school or institution. The methods em- 
ployed in such cases are not tools for 
the solution of a problem in deve!op- 
ment. The choice instead is likely be 
merely opportunistic. Often, in a g ven 
area of research, there are many facts 
and interesting methods, but they seem 
to add up to a motley, rather hit-or- 
miss collection. This is obviously not 
the fault of any one person who made a 
contribution. However, it warns the 


would-be experimenter that a careful 
and systematic definition of his prob- 
lem should come first, lest another bit 
of unrelated information be the result. 

Finally, this field would benefit very 
much by programmatic research. Long- 
range planning of a program in which 
the experiments are related and the 
designs thought out in terms of all the 
significant variables and dimensions 
would begin to solve problems not just 
create them, Piaget’s laboratory, with 
its long series of integrated studies, one 
developing from another yet 
stemming from a prearranged pattern, 
is an impressive example here. The ex- 
periments have their faults, no doubt. 
One wishes for fewer words, more 
data, and an occasional test of sig- 
nificance, but developmental psycholo- 
gists in this country should certainly let 
themselves profit from the systematic 
nature of his program in planning their 
own research, 


and 
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Research Methods 
in Children’s 
Learning 


Charles C. Spiker 
State University of Iowa 


After nearly two decades of neglect, 
child psychologists currently recognize 
the empirical study of learning in chil- 
dren as one of the urgent needs for an 
understanding of child behavior and 
for the advance of general psychology. 
The lack of emphasis on children’s 
learning, from the 1930's to the 1950's, 
resulted from several factors. The most 
decisive was surely the difficulty in de- 
veloping and adapting appropriate ap- 
paratus, methods, and procedures. It is 
to be hoped that an anthology of tech- 
niques, which have been used in such 
studies, will alleviate some of the prob- 
lems of potential investigators and in- 
directly stimulate further research in 
this area, 

Although the purpose of the present 
chapter is not to extol] the significance 
of research on the learning of children, 
perhaps it is not entirely inappropriate 
to summarize the main concerns of 
previous investigators. Their primary 


interest was to provide normative, de- 
velopmental data on children’s learning. 


These investigators recognized that be- 
havior changes resulting from learning 
are among the most obvious phenomena 
accompanying growth and develop- 
ment. They subscribed to the belief that 
any serious attempt to describe the 
behavioral changes accompanying in- 
crease in chronological age must in- 
clude behavioral changes resultins from 
learning. Perhaps the immensity of the 


task they set for themselves discouraged 
continuation of this approach to chil- 
dren’s learning. At any rate, the litera- 
ture contains data on only a few of the 
vast number of tasks that children 
normally learn. Principles governing the 
learning of children are still required 
to fill in the developmental picture. 

Somewhat later the belief that child- 
hood learning has profound effects on 
adult behavior and personality be- 
came prevalent among general and child 
psychologists. This assumption has been 
crucial not only in the comprehensive 
accounts of human behavior advanced 
by psychoanalytic writers but also in 
the more restricted formulations of 
stimulus-response behavior theorists. 
Although this conviction did not lead 
to immediate investigations of children’s 
learning, it has contributed to the recent 
renewal of interest in the area. 

Many recent experiments in children’s 
learning have been concerned with at- 
tempts to extend theories of behavior 
based primarily on infrahuman and 
adult human behavior. Some investiga- 
tors appear to have selected children for 
subjects because the study of children 
enables them to test certain principles 
of the theories to which they adhere. 
They do not assume that their task is 


374 


RESEARCH METHODS IN CHILDREN’S LEARNING 375 


ly to demonstrate that principles 
ed from studies of other subjects 
ipplicable to children. Rather, they 
seck to discover additional variables and 
and new combinations of old 
which are required for a more 
lete understanding of behavior. 

are simultaneously concerned 
with discovering the limitations of prin- 
ciples based on infrahuman behavior 
with methods of extending these 
ciples to human behavior. This ap- 
proach promises to place the study of 
lren’s learning within a framework 
t extends beyond that of traditional 
child psychology to include that of gen- 
eral psychology. 


PROBLEMS 
IN DEFINING LEARNING 


Whether one discusses the method- 
ology for studying learning or the 
results of learning studies, some con- 
sideration must be given to the meaning 
of “learning.” The term appears both as 
a “chapter heading” and as a technical 
term. As a chapter-heading word, it 
provides a convenient name for the 
variety of phenomena that the text- 
book writer, for reasons of his own, has 
decided to discuss in a single place. As a 
technical, “theoretical” concept, its 
Meaning is usually about as explicit as 
are those of the other concepts that 
appear in the theory. Its meaning will 
vary somewhat from one theory to 
another, and a definition acceptable to 
one theorist will probably be unac- 
ceptable to others. Contiguity theorists, 
for example, prefer to define learning 
as the strengthening of a stimulus- 


response association, which results when 
the response is made in the presence of 
the stimulus. Reinforcement theorists 
define learning as the strengthening of 
the association when the response is 
made in the presence of the stimulus 
and is followed by a reward. Other 
theorists (e.g., Lewin, 1954, p. 931) 
consider learning a lay concept that has 
no place in psychological theory and 
even consider its use as a chapter head- 
ing more misleading than helpful. The 
different meanings that have been given 
learning in different theories result also 
in disagreement about the classification 
of empirical phenomena. Insightful or 
sudden solution of a problem, for ex- 
ample, would be considered new learn- 
ing by some psychologists; others would 
explain it as the transfer of previous 
learning to relatively new situations. 
Evaluation of the methodology of a 
given study is made difficult by the 
different theoretical meanings of learn- 
ing. Consider the following situation. 
The learning concept in Hullian theory 
(Hull, 1943) is “habit strength’—de- 
fined in classical conditioning as a posi- 
tive function of the number of paired 
presentations of the conditioned (CS) 
and unconditioned stimulus (US). 
Habit strength is assumed to be posi- 
tively related to the relative frequency 
of conditioned responses (CR’s) to the 
CS. But the theory also asserts that 
other variables, such as drive and in- 
hibitory potential, affect the relative 
frequency of CR’s. Consequently, a 
Hullian investigator is required to con- 
trol the other theoretical variables if he 
wishes to determine the exact relation- 
ship between habit strength and fre- 
quency of CR’s. The implication is that 
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at present a study of learning can be 
methodologically sound within the 
framework of one theory but method- 
ologically faulty within the framework 
of others. 

The learning concept has been used 
in many different ways in psychological 
literature, and this lack of agreement 
has been inconvenient. Of far more 
serious consequence than this incon- 
venience are the fruitless controversies 
that have arisen from a writer’s failure 
to state explicitly how the term was to 
be used in a given context. 

Despite the disagreement about the 
exact meaning that “learning” ought to 
have, there is a considerable com- 
monality of meaning. Most psychol- 
ogists agree that learning involves a 
change in behavior, but they exclude 
changes resulting from drugs, fatigue, 
increase in age, and changes in external 
stimulation, Fortunately, this com- 
monality makes possible a useful discus- 
sion of learning without a precise pre- 
liminary definition. The present writer 
has adopted a statement by McGeoch 
and Irion (1952, p. 5): “Learning, as 
we measure it, is a change in per- 
formance which occurs under the con- 
ditions of practice.” Although this 
statement justifies the inclusion of each 
study mentioned in the present chap- 
ter, it does not provide criteria for 
evaluating these studies within each of 
the several theoretical frames of ref- 
erence, But then no reasonable allot- 
ment of space would be adequate for 
such a task, Such methodological treat- 
ment must be left for articles specifically 
concerned with theory construction and 
evaluation. 


THE STUDY OF COGNITIVE PROCESSES 
SCOPE OF THE CHAPTER 
This chapter is concerned with 
methods that have been used in re- 
searches on children’s learning. eth- 
ods” has been interpreted to lude 


procedures, apparatus, materials, ond, in 


some cases, the experimental desig To 
conserve space, without unduly ict- 
ing the number of methods ted, 
three principles were used to em 
the selection of material. First, merhods 
that are highly similar to those used 
with adults and infrahuman subjects are 


discussed primarily with respect to their 
adaptations to children, and the reader 
is referred to other methodologics| dis- 
cussions already available (Hilgard and 
Marquis, 1940; McGeoch and Irion, 
1952; Hovland, 1951; Brogden, 1951; 
Spence, 1956; and Hilgard, 1951). 
Second, minor variations are not ex- 
tensively treated. References to articles 
in which such variations were used are 
included, however, and the reader is 
urged to examine them carefully. 
Finally, many details of apparatus and 
procedure have been omitted. Although 
no detail of method is too minor to 
warrant careful consideration, a com- 
plete description of the methods utilized 
in each study would seriously restrict 
the number of learning situations ex- 
amined. 

As in other areas of science, the 
simpler situations have received the 
most careful attention. Methods for 
studying classical and instrumental con- 
ditioning, discrimination learning, and 
rote learning are more advanced and 
more rigorous than those for the study 
of complex motor learning, the learn- 
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ing icademic content, and reasoning. 
The present discussion emphasizes the 
stal procedures, on the assumption 
thar rapid development of new methods 
for the study of complex learning will 


soon make present procedures obsolete. 


Conditioning 


Conditioning is generally regarded as 
iplest form of learning. Although 
types and subtypes have been 

ne all have one characteristic 

in common: during the course of con- 

ditioning, the subject begins to make a 

re se, the CR, to a stimulus, the CS, 

which did not previously elicit that 
response, The conditions under which 
the association between the CS and the 

CR is established provide the criteria for 

distinguishing among different types of 

conditioning experiments. Hilgard and 

Marquis (1940, p. 27) distinguished be- 

tween classical conditioning and in- 

strumental conditioning. In classical 
conditioning the CS is presented to the 
subject, followed shortly by the un- 
conditioned stimulus (US). Thus in 
eyelid conditioning a flash of light (CS) 
is presented approximately one half 
second before an air puff (US) is de- 
livered to the eye. The air puff elicits 
an eyelid closure, the unconditioned re- 
sponse (UR). After several CS-US 
presentations, the eyeblink begins to 
occur after the CS onset but before 
the US onset; that is, the response be- 
gins to occur during the CS-US in- 
terval. The anticipatory eyeblink is 
called a conditioned response (CR). 
The interval between the CS and US is 
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usually held constant, with both stimuli 
occurring on each trial. The occurrence 
of the CR does not result in any change 
in procedure. 

In instrumental conditioning the CS is 
presented, and, if the subject makes the 
CR, he is given a reward or is permitted 
to escape or avoid a noxious stimulus, 
A conventional example concerns the 
rat in a box containing a lever, near 
which is a food cup. The sight of the 
lever constitutes the CS and pushing the 
lever is the CR. If the rat pushes the 
lever, a pellet of food is delivered into 
the food cup. Thus the CR is instru- 
mental in obtaining the food reward. 
An analogous situation may be arranged 
for children. The child may be placed 
in a room in which there is a lever that 
can be pressed in order to obtain a toy, 
a piece of candy, or some other attrac- 
tive object. 

The experimenter ordinarily arranges 
the instrumental situation so that the 
initial occurrence of the CR to the CS 
is highly probable. Thus the experi- 
menter may place food on the lever to 
increase the likelihood that the rat will 
press the lever. An attractive toy may 
be tied to the child’s lever to speed the 
child’s manipulation of the lever, or the 
experimenter may simply instruct the 
child to push the lever “to see what 
happens.” These procedures serve a 
function analogous to that of the US 
in the classical conditioning experiment. 
The difference is mainly one of de- 
gree—the US of classical conditioning 
is selected because it elicits the UR with 
a high reliability; the experimenter’s 
techniques in instrumental conditioning 
are not usually so effective in eliciting 
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the response at the beginning of the ex- 
periment. 

Both classical and instrumental con- 
ditioning have been further subdivided, 
according to whether the stimulus that 
follows the CR satisfies an appetitional 
need or is a noxious stimulus (Spence, 
1956, p. 38ff.). Thus classical condition- 
ing is called classical reward condition- 
ing if the US is food or some other goal 
object needed by the subject and classi- 
cal defense conditioning if the US is a 
noxious stimulus. Similarly, instrumental 
conditioning is called instrumental re- 
ward conditioning if the CR is followed 
by a goal object, instrumental escape 
conditioning if the CR permits the 
subject to escape a noxious stimulus, 
and instrumental avoidance condition- 
ing if the CR results in the subject's 
avoidance of the noxious stimulus. 

All conditioning involves the estab- 
lishment of an association between the 
CS and the CR if the CS is a stimulus 
that did not originally elicit the CR. At 
first glance, it might appear that it 
would be easy to demonstrate condi- 
tioning in a given situation. For ex- 
ample, in the classical conditioning of 
the eyeblink the CS could be presented 
to the subject and followed by an air 
puff to the eye. If the subject began 
to make anticipatory eyeblinks after 
several trials, one might be tempted to 
conclude conditioning had been estab- 
lished. The demonstration would not be 
conclusive, however. It is conceivable 
that the subject might begin to blink 
at the CS after a series of presentations 
of the CS alone. The subject might also 
blink at the CS after a succession of 
presentations of the air puff alone. 
Neither of these results would be at- 
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tributed to conditioning. Some of the 
earlier studies of conditioning failed to 
control for pseudo-conditioning. It is 


now customary to provide such 2 con- 
trol. In classical conditioning, ex- 
ample, one group receives the experi- 
mental conditions, a second group may 
receive the CS alone, and a third group, 
the US alone. The “backwz con- 
ditioning” control is also frequently 
applied. With this procedure, the con- 
trol group receives the CS and US on 
each trial, but the US is presented first. 
This temporal arrangement of the 
stimuli results in little or no condition- 


ing, and it controls simultaneously for 
the two factors that otherwise require 
two groups of subjects. Regardless of 
the method of control used, the ex- 
perimental group must be compared to 
the control group in order to ascer- 
tain that conditioning was established. 
The comparison requires the applica- 
tion of statistical procedures. 
Analogous controls for instrumental 
conditioning are not as readily attained. 
Indeed, one of the present concerns of 
students of instrumental conditioning 
is whether the increase with trials in the 
relative frequency of the CR (or in the 
speed of the CR) represents condition- 
ing or an increase in motivation. Con- 
sider a child who pushes a button when 
a light (CS) is presented in order to 
obtain a reward. Does the increase in 
his speed of button-pushing reflect a 
strengthening of the association between 
the CS and the button-pushing re- 
sponse? Or does it reflect an increase in 
the child’s motivation as he becomes 
more certain that the reward follows the 
button-pushing? Since learning is in- 
volved in either case, the question is not 
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wl r learning occurs; it has to do 
with what is learned. The question, 
therefore, is not merely methodological; 


it is 2 matter for empirical research. 


CLASSICAL CONDITIONING 


The first investigations of condition- 
1 children were those of Krasno- 


ing 

gorski (1925) in which he used a 
modification of the Pavlovian pro- 
cedures. He observed the motor com- 
ponent of the total reaction to food 
(swallowing and mouth movements) as 
well as the glandular reactions (saliva- 
tion). The early American interest in 


classical conditioning in children cen- 
tered around the question of the con- 
ditionability of neonates (Marquis, 
1931; Wenger, 1936). Many of the 
methods that have been developed are 
more appropriate for infants than for 
older children, Although the early 
studies provide many useful method- 
ological suggestions, few of them would 
serve as models for contemporary con- 
ditioning experiments. 

General Considerations. There are 
some general requirements for classical 
conditioning that obtain, regardless of 
the specific experiment. One of the 
more important of these requirements 
concerns the selection of the US. As 
previously noted, the US determines 
whether the experiment is to be classical 
reward or classical defense. Care should 
be taken to select a US that will elicit 
the UR consistently and that will not 
permit the subject to adapt to it in 
the later stages of conditioning. The 
most frequently used US's in classical 
conditioning experiments have been 
such potent stimuli as food for hungry 
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subjects, air puffs to the eye, acids or 
salts in the mouth, and electric shock. 
The selection of the US also affects the 
selection of the CR. The response to 
the US is rarely a single, discrete re- 
sponse; usually it is a constellation of 
responses, and the experimenter must 
decide which of these components of 
the UR he wishes to observe and record 
as CR’s. For example, presentation of 
food to the infant elicits salivation, 
chewing movements, lip and mouth 
movements, swallowing, and sucking. 
It may also bring about a reduction in 
the frequency and intensity of crying, 
general bodily activity, and orienting 
behavior (Kantrow, 1937). The ex- 
perimenter must decide which of these 
behavioral changes he wishes to observe 
and record. His decision will be 
governed in part by the type of ap- 
aratus and materials he has available 
and by the degree of precision his 
purpose requires. It will also be gov- 
erned by psychological considerations. 
He will note that all behavioral changes 
that occur to food in the mouth are not 
possible as anticipatory responses to 
food, The subject cannot literally 
swallow or chew food or suck milk into 
his mouth in the absence of food or 
milk, although he may make swallow- 
ing, chewing, and sucking movements 
similar to those made with food. The 
experimenter will also note that some 
responses are more sensitive to change 
than others. For example, since infants 
do not cry incessantly, even when 
hungry, noncrying would not be a 
sensitive measure of conditioning. 

The selection of the CS is also im- 
ortant and is made only after con- 
sideration of type of subject to be used. 
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Assurance that the subjects normally 
perceive the CS can be obtained from 
preliminary evidence that they make 
consistent responses to it. This is not 
always easily demonstrated in the case 
of inarticulate subjects such as infants. 
The CS is also selected so that it does 
not initially elicit that aspect of the UR 
which is recorded as the CR. The 
elicitation of all or part of the UR by 
the CS is particularly likely if the CS 
has one or more dimensions in common 
with the US, For example, if the CS 
is a tactile vibrator and the US is a 
moderate shock, there could be some 
question as to whether anticipatory 
responses to the CS represented con- 
ditioning or merely generalization of 
the UR from the shock to the vibrator, 

Although the interval between the 
onset of the CS and the onset of the 
US (the CS-US interval) affects the 
rate of conditioning, the exact relation- 
ship is not known for each type of con- 
ditioning situation (Hovland, 1951, P- 
615; Brogden, 1951, p. 575). Since one 
ordinarily wishes to select the interval 
that is optimal for conditioning, the 
choice of the appropriate interval poses 
a problem. The interval that is optimal 
for conditioning, as evidenced by the 
subsequent resistance of the CR to ex- 
tinction or by the promptness with 
which it begins to appear on test trials 
when the US is omitted or delayed, may 
not maximize the probability of ob- 
taining anticipatory CR’s on each trial. 
This is to say that a CS-US interval that 
maximizes the rate of conditioning may 
be shorter than the normal latency of 
the CR. Current evidence suggests that 
the shorter CS-US intervals generally 
produce faster conditioning (Brogden, 


THE STUDY OF COGNITIVE P SSES 
1951, pp. 575ff.). However, if the typical 
latency of the CR is longer tl the 
selected CS-US interval, it is necessary 
to obtain measures of the CR on trials 
other than the conditioning trials, e.g, 
by presenting the CS alone periodically 


during the conditioning phase in- 


terspersing trials in which the US is 
delayed, or by determining resistance 
to extinction following conditioning. 
The type of subject used in cl»ssical 
conditioning is undoubtedly great 


importance. Classical conditio: of 


human subjects has not been extensive 
thus far, except for conditioning of the 
galvanic skin response (GSR) and the 
eyeblink. The GSR is regarded əs in- 
voluntary, meaning that its occurrence 
or nonoccurrence is not normally sub- 
ject to instructional control. The eye- 


blink is classified as semivoluntary, 
since it is subject to instructional con- 
trol under some conditions but not 
under others. Numerous complications 
arise in the conditioning of voluntary 
responses in articulate subjects. Even 
in eyelid-conditioning experiments, 
complex criteria have been devised to 
eliminate voluntary responses and vol- 
untary responders. 

The deliberate study of the effects 
of verbal processes on conditioning is, 
of course, a legitimate problem. But if 
one wishes to study human condition- 
ing in order to formulate laws compar- 
able to those obtained from infrahuman 
subjects, one must attempt to prevent 
the contamination of the data by the 
verbal processes. There are several pos- 
sibilities for accomplishing such con- 
trol. The experimenter may use human 
infants, on the assumption that they do 
not have extensive verbal control over 
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th responses. He may also use older 
sul s but must restrict his observa- 
tions to responses that are normally in- 
voluntary (e.g., GSR, salivation, pupil- 
lary reflex, and heart rate). He may 
also observe responses in older subjects, 
which, although voluntary, have a 
minimal proprioceptive feedback (e.g., 
minor movements of the fingers, meas- 
ures of muscle potential, and eyeblink). 
Ti tter possibility requires the use 
of highly refined methods for measur- 
ing the responses. The writer’s ex- 


perience with preschool children 
suggests that when such responses as 
button-pushing, lever-pulling, and bulb- 
sque ng occur as anticipatory re- 
sponses they tend to be viewed as 
erroneous by the subjects and inhibited. 
This voluntary inhibition masks the de- 
velopment of conditioned responses that 
can be observed under more subtle ex- 
perimental arrangements. 

Response Measures. One of the most 
frequently used response measures in 
classical conditioning is the number of 
CR’s that occur during the CS-US 
interval. In some cases it is desirable to 
correct this measure for spontaneous 
occurrences of the response. Thus, if 
the experimenter has recorded the num- 
ber of sucking responses that occur in 
a CS-US interval of five seconds, it is 
advisable to compare this measure with 
the number of sucking responses that 
occur in five-second intervals during 
which the CS is not presented (Kant- 
row, 1937). This procedure is par- 
ticularly appropriate when different 
experimental conditions result in dif- 
ferent probabilities of spontaneous 
responding. Thus, in a study designed 
to compare the effects of different CS- 
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US intervals on conditioning, the num- 
ber of “CR’s” for subjects given the 
longer intervals may be inflated by 
spontaneous responses (McAllister, 
1953). 

The latencies of anticipatory re- 
sponses have also been used as measures 
of the strength of conditioning (Hilgard 
and Campbell, 1936). The practice 
is apparently based on the finding that 
the latencies of anticipatory responses 
that occur late in conditioning are 
shorter than those that occur early. 
Pavlov’s work, however, indicated that 
the latency of the anticipatory response 
is dependent on the length of the CS- 
US interval. The subject learns to delay 
the anticipatory response until the end 
of the CS-US interval if the interval 
is relatively long (Pavlov, 1928, p. 149). 
Perhaps, then, the latency of the an- 
ticipatory response should be considered 
an index of the strength of conditioning 
only if the CS-US interval is slightly 
longer than the average latency of the 
CR. Even under this restricted condi- 
tion, Spence (1956, p. 7off.) reports 
latency data from eyelid conditioning 
that show marked irregularities as a 
function of the number of condition- 
ing trials. 

Response magnitude, measured by 
scales of such quantities as volume of 
saliva secreted, length of a movement, 
amount of electrical resistance gen- 
erated, and amount of pressure exerted, 
has also been used as an index of 
strength of conditioning. Studies of 
GSR conditioning have also relied 
heavily on amplitude measures of some 
type. Since most eyelid conditioning 
experiments report the results in terms 
of frequency of the CR’s, the amplitude 
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measure has apparently not been found 
highly useful. It is obvious that the 
amount of instrumentation required to 
obtain reliable indices of either response 
latency or response amplitude is greater 
than that required to determine the 
occurrence or nonoccurrence of the 
response. This fact probably has con- 
tributed to the greater popularity of 
the response frequency measure. 

If the experiment is designed to relate 
the strength of conditioning to the 
number of conditioning trials, one of 
the aforementioned measures may be 
used. For each trial, or block of trials, 
the average of the number, latencies, 
or amplitudes of the CR’s may be 
computed and plotted to obtain a con- 
ditioning curve for all subjects com- 
bined. If interest is in the exact form 
of the conditioning curve, however, 
care must be taken not to average in- 
dividual curves so heterogeneous that 
the group curve will not represent the 
individual curves on which it is based. 
One useful procedure is to divide the 
subjects into subgroups, which condi- 
tion at slow, fast, and medium rates, 
and to examine the curves for these sub- 
groups (Spence, 1956, p- 6off.). 

Another measure of the strength of 
conditioning is the degree to which the 
conditioned subject resists extinction 
during a period when the CS is pre- 
sented without the US. Following con- 
ditioning, the experimenter presents the 
CS alone and counts the number of 
trials necessary before the CR fails to 
occur or the number of CR’s the subject 
makes in a set number of trials. Re- 
sistance to extinction measures are often 
used when it is undesirable or impracti- 
cal to obtain measures of anticipatory 
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responses during the conditioning phase. 
Since the number of conditioning trials 
is generally positively related to the 


number of responses during extinction, 


the latter is taken to be an index of the 
strength of conditioning. The extinction 
measure does not permit assessment of 
the growth of conditioning during the 
administration of the paired CS-US 
presentations, since extinction is not be- 
gun until all conditioning trials have 


been given. For this reason, the « \peri- 
menter will often intersperse single 
presentations of the CS (test-trials) 
among paired CS-US presentations dur- 
ing conditioning. The test-trials provide 
an index of the growth of the CR dur- 
ing conditioning. The extinction meas- 
ure and the test-trial procedure may 
be used when the different experimental 
conditions result in different prob- 
abilities of anticipatory CR’s; for 
example, when the experimental condi- 
tions differ in the length of the CS-US 
interval (McAllister, 1953). Recent 
practice in eyelid conditioning provides 
an extra long CS-US interval on test 
trials rather than an omission of the 
US altogether. This procedure pre- 
sumably maintains the level of motiva- 
tion and minimizes the development of 
extinction effects. 

Classical Reward Conditioning. The 
Pavlovian experiments are the proto- 
type of classical reward conditioning. 
Classical reward conditioning in older 
children has been studied by Krasno- 
gorski, by Chuchmarev, and by Lenz. 
The elaborate laboratories, methods, 
and procedures of these investigators 
have been described in detail by Razran 
(1935) and are not repeated here. 
Razran also gives an extensive review 
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of the findings of these experiments. In 
general, the procedures do not differ 
markedly from those used by Kantrow 
(1 , except that in many experi- 
ments measures of salivation were di- 
rectly obtained. Her methods and pro- 
cedures may still serve as a useful model 
for this type of conditioning. Kantrow 
used infants ranging in age from 1% to 
4 months at the beginning of the ex- 
periment. The US was the presentation 
of the nipple of the feeding bottle into 
the infant’s mouth, and the CS was the 
onset of a buzzer. The CS was pre- 
sented for five seconds before the feed- 


ing bottle, and the CS and US over- 
lapped for fifteen seconds, The infants 
were fed on a four-hour schedule, but 
the experiment was conducted only 
during the four daytime feedings. Sev- 
eral paired presentations of the CS and 
US were given at each feeding. 

A linen harness was placed under the 
infant’s chin and was mechanically con- 
nected by strings to a recording poly- 
graph pen, A second pen recorded the 
onset and duration of the CS. The ex- 
perimenter manually recorded on the 
polygraph tape the time of administer- 
ing the bottle, Thus each movement of 
the infant’s chin during sucking was 
recorded graphically, with its temporal 
position designated in terms of the on- 
set of the CS and US. 

Kantrow’s measure of conditioning 
was the number of sucking responses 
made in the CS-US interval, corrected 
by subtracting the number of sucking 
responses occurring during a five-sec- 
ond critical control period immediately 
preceding the conditioning trial. The 
critical control period was that period 
of five consecutive seconds, in the 25 
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to 75 seconds immediately preceding 
the onset of the CS, which contained 
the largest number of sucking responses. 
The critical control period was designed 
to control for random or spontaneous 
sucking responses. Since the period se- 
lected was the one having the maximum 
number of sucking responses, the con- 
trol seems to be a conservative one. 

Kantrow did not use a group to con- 
trol for increase in sucking behavior to 
the CS, which might have occurred 
without the CS-US pairings. The de- 
cline in the anticipatory responses that 
occurred during extinction suggests that 
the paired presentations of the CS and 
US were essential to their development 
and maintenance. The use of the critical 
control period provides some control 
for pseudo-conditioning, but her design 
would have been more convincing if a 
pseudo-conditioning control group had 
been included. 

Investigators seeking to extend the 
findings of Kantrow’s study would 

robably wish to consider shorter CS- 
US intervals. Research by Krasnogorski, 
summarized by Razran (1935), com- 
pared the secretory and the motor re- 
sponses (swallowing and sucking). The 
latencies of the motor responses were 
found to be somewhat less than one 
second, whereas the salivary responses 
had average latencies of about five sec- 
onds. Thus the use of shorter CS-US 
intervals would still permit the swal- 
lowing and sucking responses to occur 
as anticipatory responses. It might also 
be the case that the rate of condition- 
ing would be faster, and the final level 
more stable, if shorter intervals were 
used. 
Classical Defense Conditioning. The 
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procedures used in defense conditioning 
in the classical situation have been more 
varied than those in appetitional con- 
ditioning. Classical defense conditioning 
should be distinguished from instru- 
mental escape or instrumental avoid- 
ance, which is discussed in a later sec- 
tion. In classical defense conditioning 
the subject receives the noxious US on 
each trial except test trials, and the UR 
does not result in escape from the 
noxious stimulus. Moreover, if the CR 
begins to appear anticipatorily, it does 
not result in avoidance of the US. 

Aversive conditioning procedures 
with children have not been popular. 
The US must be noxious enough to 
elicit the UR reliably. On the other 
hand, since experimenters prefer the 
subject’s voluntary cooperation, the 
US must not be so noxious as to arouse 
strong fear reactions and tendencies to 
escape the experimental situation. The 
delicate balance of these two factors is 
difficult to achieve. With older chil- 
dren and adults, cooperation of the 
majority of subjects can be achieved 
even though relatively noxious stimuli 
are utilized. With infants, it is possible 
to select a US that is noxious enough 
to promote conditioning without harm- 
ing the child or arousing emotional re- 
actions that interfere unduly with the 
purpose of the experiment. 

An investigation of classical defense 
conditioning in infants was reported by 
Wickens and Wickens (1940). An elec- 
tric shock was applied to the foot of 
the infant. Approximately 250 millisec- 
onds prior to the application of the 
shock the CS (buzzer) was sounded. 
The paired CS-US presentations were 
given 12 times a day for three days. 
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On the third day 12 presentations of 


the buzzer were also administered with- 
out shock. The experimenters « ved 
the number of leg flexions that occurred 
on the test trials. Two control groups 
were included. In one group CS 
was presented without shock the 
same number of trials given the eri- 
mental group. In the second trol 
group the US was presented iout 


the CS. Both control groups t re- 


ceived 12 test trials on the third day. 
The results verify the need to control 
pseudo-conditioning, since the yroup 
given only shock gave as m leg 
flexions to the buzzer as the  roup 
given paired CS-US presentations. The 
control group given only the CS made 
significantly fewer responses than cither 


of the other two groups. 

Wickens and Wickens depended for 
their basic data on the observations of 
the experimenters. Wenger (1936) de- 
veloped methods for the graphic re- 
cording of the onset of the CS and US 
and of the infants’ CR’s. His records 
permitted the study of foot and toe 
movements as well as the grosser limb 
withdrawal. The Wickens’ method- 
ology would have been considerably 
improved had they used the record- 
ing techniques developed earlier by 
Wenger. 

Eyelid conditioning procedures, com- 
parable to those used so successfully 
with adults, have not been developed 
for children. Although the adult pro- 
cedures could be adapted rather easily 
to upper elementary and high-school 
children, special problems may be in- 
volved with younger children. Modern 
techniques utilize sensitive potentiome- 
ters fastened to the eyelid, and best 
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results are obtained when the subject 
remains quiet with the head relatively 
immobile (Spence, 1953). The adapta- 
tion of such procedures to younger 
children entails obvious difficulties. 
One study of eyelid conditioning 
with older infants, however, suggests 
that if proper conditions can be 
achieved this type of defense condi- 
tioning will be profitable. Morgan and 
Morgan (1944) presented as the US a 
puff of air to the eye, administered 
manually with a hand syringe. The CS 
was the sight of the syringe being 
brought to the infant’s eye. The ex- 
perimenters did not use any graphic or 
other mechanical means of recording, 
depending on the experimenter’s judg- 
ment to determine whether the blink 
Was anticipatory or random. They re- 
port a fairly high level of conditioning 
as having been obtained in a consider- 
able number of infants between the 
ages of 6 and 10 weeks. Unfortunately, 
adequate controls for random or spon- 
taneous blinking were not included. 
An early study of the conditioning 
of the GSR was reported by Jones 
(1930). He was able to demonstrate 
sizable increases in the magnitude of 
the GSR to stimuli after they had been 
paired with shock, Effective use of the 
GSR procedure requires elaborate ap- 
paratus as well as unusually good con- 
trol over extraneous stimulation. The 
GSR has been found to be sensitive to 
a wide variety of stimuli and produces 
reliable stable measures only when the 
subject is relatively inactive. Adapting 
the procedure to unselected samples of 
younger children poses some technical 
problems that remain to be solved. 
The conditioning of “overt emo- 
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tional responses” in young children has 
been investigated by Watson and 
Rayner (1920) and Jones (1931). 
These experimenters paired neutral 
stimuli with noxious stimuli that elic- 
ited crying and escape responses and 
noted the subsequent tendencies for 
these responses to occur with neutral 
stimuli. Extensive laboratory investiga- 
tions of such conditioning have not 
been made. Further investigation seems 
warranted only if response measures 
are developed that are sufficiently pre- 
cise to permit fruitful study of such 
variables as intensity of the US, length 
of temporal intervals, intensity of the 
CS, range of generalization, and the 
like. If these variables can be as satis- 
factorily studied with the GSR, the 
elicitation of overt emotional responses 
in young children appears unnecessary. 

Although the study of aversive con- 
ditioning in the classical framework 
would undoubtedly add considerably 
to our knowledge of learning in chil- 
dren, particularly to our understanding 
of the learning of fear and anxiety, the 
practical difficulties of developing ade- 
quate procedures are not to be under- 
estimated. It is likely that fruitful study 
of this type of conditioning in children 
must be preceded by a thorough un- 
derstanding of the phenomenon in in- 
frahuman organisms. 


INSTRUMENTAL CONDITIONING 


Historically, instrumental condition- 
ing is probably an outgrowth of the 
trial-and-error experiments of C. L. 
Morgan and E. L. Thorndike (Hilgard 
and Marquis, 1940, p. 53). Indeed. 
Spence (1956, p. 37ff.) characterizes in- 
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strumental conditioning as a limiting 
case of trial-and-error learning in which 
the occurrence of competing responses 
is minimized. It is convenient to discuss 
the methodology of classical and in- 
strumental conditioning separately be- 
cause different apparatus and proce- 
dures have been employed. The sepa- 
rate discussion, however, does not 
necessarily imply a conviction that dif- 
ferent laws govern behavior in the two 
types of situations. 

General Considerations. Two proce- 
dures for instrumental conditioning, 
distinguished by the manner in which 
a trial is defined, are currently recog- 
nized and applied. The free-operant 
method, developed and intensively 
studied by Skinner (1938), places the 
subject in a situation in which occur- 
rence of a given response results in a 
reward, The subject is free to make 
as many of these responses as the time 
allows. The course of conditioning is 
measured by an increase with succes- 
sive periods in the rate of responding 
in the conditioning situation. The 
“trial” is thus an arbitrary period of 
time set by the experimenter. The CS 
presumably includes internal cues, aris- 
ing from processes inside the organism, 
and the external situation, particularly 
the response manipulandum. 

The discrete-trial method resembles 
classical conditioning. The trial is de- 
fined in terms of the presentation of the 
CS. The subject is ordinarily allowed 
an arbitrary maximum time for the re- 
sponse to occur. The trial is terminated 
when the CR occurs or when the maxi- 
mum time has elapsed. The CS is spec- 
ified as it is in classical conditioning 
and is equally subject to experimental 
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control. The free-operant and discrete- 
trial methods have also been combined 
to develop the “controlled-operant” 
method, The CS is presented on each 
trial for a set period of time during 
which the subject is free to respond 
repeatedly. Here, the usual ponse 
measure is the number of CR’s per trial. 

Free-operant and discrete-tria! con- 
ditioning have been employed in the 
investigation of somewhat diferent 
types of problems. Free-operant condi- 


tioning has been widely used in the 
study of motivational and incentive 
variables as well as in the study of 
reinforcement schedules, The discrete- 


trial method has been preferred in the 
study of those problems having to do 
with characteristics of the CS, such as 
stimulus generalization, variation in CS 
intensity, and differential conditioning. 
The discrete-trial procedure has also 
been preferred by psychologists con- 
cerned with the development of S-R 
learning theories, since the manner of 
specifying a conditioning trial is more 
consistent with learning principles in 
these theories. 

One of the major problems of the 
investigator of instrumental condition- 
ing is the selection of an appropriate 
response to be conditioned. Before con- 
ditioning can occur, the CR must be 
elicited. After it occurs in the presence 
of the CS, the experimenter may in- 
crease its probability of occurrence on 
the next trial by an immediate admin- 
istration of a reward. It is usually de- 
sirable to select a response that has a 
high probability of occurring in the 
experimental situation. Several proce- 
dures have been used to elicit the CR. 
The bar-pressing response in rats has 


been facilitated by increasing the dis- 
tinctiveness of the response mechanism, 
differentiating it from the background 
stimuli. Usually, food is smeared on the 
bar during the early trials to elicit ap- 
proach and manipulative responses. The 
normal exploratory behavior of rats is 


exploited in the instrumental condi- 
of a running or approach re- 
sponse in the simple runway situation. 


tioning 


With children, instructions to pull a 
lever, press a button, squeeze a bulb, 
etc., have been given to increase the 
probability of occurrence of the CR. 


As noted previously, these arrange- 
ments of the experimenter serve as a 
substitute for the US of the classical 
condoom experiment. It is, of 
course, desirable to select a response 
that can be observed and recorded with 
a high degree of reliability. 

Response Measures. The response 
measures available in instrumental con- 
ditioning vary somewhat from one pro- 
cedure to another. Several measures 
based on the frequency of the CR have 
been used. In discrete-trial condition- 
ing the number of subjects responding 
on each trial or the number of CR’s 
given by each subject in a block of 
trials may be determined. With the 
usual procedures for instrumental con- 
ditioning, however, neither of these 
measures proves to be highly satisfac- 
tory. The experimenter ordinarily at- 
tempts to arrange the situation so that 
the subject responds on each trial. As 
noted earlier, this is accomplished by 
pretraining, by instructing human sub- 
jects, or by arranging the experimental 
situation so that there is a high prob- 
ability of occurrence of the response. 
Thus a conditioning curve based on 
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frequency of CR’s would begin at or 
near 100 per cent and show little change 
thereafter. Spence (1956, p. 77) has 
shown that a frequency measure re- 
flecting the course of conditioning can 
be obtained whenever the latencies of 
the CR’s are also recorded. By his pro- 
cedure, an arbitrary latency criterion 
may be set so that a response is counted 
as a CR only if its latency is equal to 
or less than the criterion. In tests of 
generalization the standard frequency 
of response measure may still be a sat- 
isfactory index of the strength of gen- 
eralized conditioning (Shepard, 1956). 

The free-operant conditioning pro- 
cedure typically provides the rate-of- 
response measure. The total time that 
the subject is in the conditioning situa- 
tion may be divided into a number of 
time intervals. The rate of responding 
in any interval is given by dividing the 
number of responses by the amount of 
time. The controlled-operant proce- 
dure provides the number of responses 
occurring during each CS presentation. 

Several time measures may profitably 
be used to show the course of condi- 
tioning. In the discrete-trial method the 
time between the onset of the CS and 
the occurrence of the response is often 
designated as the response latency. 
Strictly speaking, this measure will ap- 
proximate the response latency only if 
the response has a very short duration. 
The distinction between response la- 
tency and response duration can be 
exemplified in the runway situation in 
which the rat is placed in the starting 
box behind an opaque door leading into 
the alley. The time between the open- 
ing of the opaque door (the CS pres- 
entation) and the rat’s beginning to 
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traverse the alley is the latency of the 
CR. After the rat has started, the time 
taken to traverse the alley and enter 
the goal box may be designated the re- 
sponse duration. A measure that in- 
cludes the time between the opening 
of the door and the rat’s entry into the 
goal box is composed of the response 
latency and duration. The reciprocal of 
the response latency is called the re- 
sponse evocation speed, whereas the 
reciprocal of the response duration is 
the response speed. It is, of course, 
quite possible that experimental vari- 
ables may affect these two measures 
differently, with the consequence that 
they are not highly correlated. Re- 
sponse evocation speeds, for example, 
seem to be more sensitive to distracting 
stimuli and to variations in the subject’s 
orienting behavior than response speeds. 
Therefore, it is advisable to analyze the 
total response time into the two com- 
ponents unless the response duration is 
quite brief. 

The free-operant procedure does not 
permit the determination of response 
latencies, since the CS is not under the 
experimenter’s control. If the responses 
are graphically recorded, however, re- 
sponse-duration times may be obtained 
in either the free-operant or controlled- 
operant situations. In the controlled- 
operant situation the time between the 
onset of the CS and the occurrence of 
the first response may also be obtained. 

Several measures of resistance to ex- 
tinction have been developed, all of 
which are obtained during nonrein- 
forced trials following conditioning. 
Two measures are typical of the dis- 
crete-trial method. The experimenter 
may count the number of CR’s made 
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in a set number of extinction trials. Or 
he may administer the nonreinforced 
CS until the subject fails to respond, 
on one or more successive trials, within 


a predetermined period of time; his 


measure, then, is the number of CS 
presentations required to reach tl cri- 
terion. The same general procedures 
apply in the controlled-operant situa- 
tion, in which the number of responses 
per trial is the basic measure. Iree- 
operant conditioning, of course, pro- 
vides the rate of responding 1 sure 
during extinction as well as in ndi- 
tioning. Furthermore, the experi iter 
may record the total number re- 
sponses the subject makes before 'ailing 
to respond in a predetermined p riod 
of time. 


Although amplitude of the CR has 
been used as an index of response 
strength (Holton, 1956; Haner and 
Brown, 1955), it is questionable 
whether or not these measures provide 
a good index of strength of condition- 
ing in the instrumental situation. Hull 
(1943, p. 304ff.) cites evidence that a 
rat learned to respond with a particular 
amount of pressure on a bar. Holton’s 
experiment showed no consistent 1n- 
crease with practice in the pressure pre- 
school children applied to the response 
mechanism. For these reasons, response 
amplitude in instrumental situations 
should be carefully explored before it 
is assumed that it reflects the strength 
of conditioning. 

Instrumental Reward Conditioning: 
Discrete-Trial Method. One of the sim- 
pler arrangements for instrumental re- 
ward conditioning has been described 
by Shepard (1956) in connection with 
her study of stimulus generalization. The 
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apparatus and procedure were adapted 
to children of preschool ages (3⁄4 to 6 
years). The apparatus consisted of a 
box with an aperture in which a colored 
light (the CS) was presented, a push- 
button on a small panel as the response 
mechanism, and an interval timer to 
control the duration of the CS. The CS 
was presented for four seconds and the 
subject was required to respond within 
that time. Shepard used verbal praise 
as rewards for responses that occurred 
within the interval. 

Unfortunately, the only response 
measure available with this procedure 
is the frequency of occurrence of the 
CR. The rapid learning of preschool 
children under these conditions makes 
this measure relatively insensitive to 
variations in the experimental variables. 
Although the number of responses that 
occur during an extinction session could 
be obtained, this measure tends to be 
quite variable for children when the 
reward has been experimenter approval 
and the experimenter remains in the 
room during extinction. Two modifica- 
tions of the Shepard apparatus would 
overcome these difficulties. First, a re- 
ward dispensing unit may be incorpo- 
rated into the apparatus so that the ex- 
perimenter need not remain in the 
room. Second, an electric clock may 
be wired into the circuit so that it will 
start with the onset of the CS and be 
stopped by the subject’s response. This 
provides a measure of response latency, 
since response duration is brief, that 
may be used to obtain evocation speeds 
or to derive a response frequency meas- 
ure based on an arbitrary latency cri- 
terion. 

Stimuli other than lights may be 
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used; other visual stimuli may be pre- 
sented in a visual stimulus presentation 
device (e.g, a memory drum), and 
auditory stimuli may be employed. 
Similarly, any objective instrumental 
response may be substituted for the 
button pushing. The only requirement 
is that the response be simple, discrete, 
and capable of operating an electric 
switch. Responses requiring the devel- 
opment of a complex motor skill are 
probably to be avoided, since measures 
of such responses reflect the develop- 
ment of the skill rather than simple 
conditioning. Anything that appeals to 
the subjects may be given as a reward 
if it can be reliably dispensed auto- 
matically. 

Longstreth (1957) adapted to pre- 
school children the instrumental run- 
way situation frequently used with rats. 
At one end of a large room he placed 
a wooden screen, approximately 6 feet 
high and 4 feet wide, behind which 
were located the controls and record- 
ing equipment. During the experiment 
the experimenter remained behind the 
screen viewing the child through a one- 
way mirror. On the subject’s side of 
the screen was a small door with spring 
hinges arranged so that the pushing of 
a button under the door caused the 
door to spring open to expose a reward 
compartment. Directly above the door 
was a single stimulus aperture through 
which colored lights could be filtered. 
On the floor at the base of the screen 
was a switch-mat. Extending away from 
the screen was a rubber hall-carpet ap- 
proximately 12 feet long. A small chair 
was placed at the end of the carpet. 
Two electric clocks were wired into 
circuits, so that the first clock started 
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when the CS was presented. When the 
child rose from the chair, he activated 
a switch that stopped the first clock and 
started the second. Finally, when he 
stepped on the switch-mat at the end 
of the “alley,” he stopped the second 
clock. The first clock thus recorded the 
response latency and the second, the 
running time. 

Although Longstreth studied the ef- 
fects of incentives on the children’s 
behavior, his procedure may also prove 
fruitful in the study of such variables 
as amount of training, generalization, 
stimulus intensity, etc. It is necessary 
in this situation, as in others involving 
time measures, to minimize extraneous 
distracting stimuli and/or to present 
dynamic stimuli (e.g., change in room 
illumination, flashing lights, raucous 
buzzers) in order to obtain reliable 
response measures. 

Instrumental Reward Conditioning: 
Controlled-Operant Method. The mod- 
ified operant procedure used with in- 
frahuman subjects (e.g., Guttman and 
Kalish, 1956) has been adapted to pre- 
school children (Spiker, 1956a, 1956b). 
The controlled-operant method pro- 
vides the experimenter with better con- 
trol of the CS than the free-operant 
method and at the same time retains 
much of the latter’s reliability of re- 
sponse measure, Spiker’s procedure in- 
volved the presentation of a stimulus 
light in a flashed opal glass aperture for 
a three-second period. The subject was 
required to pull a lever. Each pull of 
the lever resulted in the automatic de- 
livery of a marble and was recorded 
on an electric impulse counter, Re- 
sponses made between presentations of 
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the CS were neither rewarded nor re- 
corded. The response measure was the 
number of responses occurring “uring 
each CS presentation (trial). \Vhite 
(1958) used a similar procedi ex- 
cept that visual stimuli (colored 
patches) were presented by means of 
a device serving the same function as a 
memory drum. 

An interesting fact concerning this 
type of conditioning is that if the sub- 
jects are pretrained to pull the lever, 
without presentation of the C5, the 
rate of responding to the CS during 

This 


conditioning shows little increas: 
fact suggests that change in the rate of 
responding that occurs withou! such 
pretraining largely represent skill 
learning rather than an increase in the 
strength of conditioning. Essentially, 
the same thing may be true of the free- 
operant situation. A disadvantage of 
both free-operant and controlled-op- 
erant methods is that the experimenter 
has no direct control over the rate of 
responding and thus has no control over 
the amount of fatigue developed by the 
individual subject. 

Instrumental Reward Conditioning: 
Free-Operant Method. In an explora- 
tory study, Warren and Brown ( 1943) 
adapted the free-operant method to 
children of preschool ages. The ap- 
paratus consisted of a mechanism to 
deliver a piece of candy down a chute 
when the lever projecting into the ex- 
perimental room was depressed. After 
adaptation sessions, the subjects were 
placed in the room with the door 
closed and with two or three toys in 
addition to the lever. A kymographic 
record of each response was obtained, 
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providing the data to determine the 
rate of lever pressing. Acquisition un- 
der continuous and partial reinforce- 
ment and extinction were studied. The 
acquisition curves generally indicated 
that che children responded at relatively 
constant rates after the first few trials. 

Whereas Warren and Brown left the 
child ro discover the lever and the con- 
sequences of pressing it, ‘Siegel and 
Foshee (1953) oriented their subjects 
to the lever by instructions. They 
counted the number of nonrewarded 


responses during extinction as the meas- 


ure of the strength of the conditioning 
and reported a positive correlation be- 
tween number of previously reinforced 


responses and resistance to extinction. 

\n experiment by Screven (1954) 
qualifies as a free-operant study. The 
turning of a crank by the subject en- 
ergized in succession 12 jeweled lights 
placed on a vertical panel. When the 
lights reached the top of the column, 
a marble was automatically ejected to 
the subject, ending the trial. The re- 
sponse measure was the rate of crank 
turning—number of turns per second 
—in each trial. Although the rate of 
crank turning had no effect on the rate 
of progression of the lights, there were 
significant increases in the rate of turn- 
ing as a function of the number of pre- 
ceding trials. The speed of turning was 
read directly from a speed indicator. 
In a second study by Screven and 
Nunis (1954) the crank was attached 
to a small generator, and the voltage 
produced by crank turning was re- 
corded by the sweep-pen of a volt- 
meter on a polygraph tape. The crank- 
turning response in the free-responding 
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situation was also utilized by Lambert, 
Lambert and Watson (1953) in the 
study of the effects on behavior of re- 
ward cessation. 

Another type of operant condition- 
ing has been described in detail by 
Bijou (1955, 19578) and utilized in the 
study of patterns of reinforcement 
(1957b). In its simplest form it is prob- 
ably best adapted to children of pre- 
school and kindergarten ages, although 
older children might be used for certain 
types of problems. The child takes a 
ball from a receptacle and places it in 
a hole in the front face of the apparatus. 
A motor-driven bicycle chain with 
small trinkets attached is arranged so 
that a trinket may be released into a 
chute and delivered to the child. Plac- 
ing the ball into the hole activates a 
polygraph pen that records the number 
of responses and the rate of their emis- 
sion. Depending on the reinforcement 
schedule prearranged by the experi- 
menter, the response may also activate 
the reward dispenser. 

Bijou (1958) also describes a “labo- 
ratory on wheels,” which was devised 
to circumvent some of the difficulties 
inherent in experimenting with children 
in immobile laboratories. The mobile 
laboratory, constructed in a house- 
trailer, can be taken by the experi- 
menter to the children’s homes or 
schools. It thereby eliminates the prob- 
lems involved in transporting the chil- 
dren to and from the laboratory, and, 
if the subjects are to be drawn from 
several different schools, it provides the 
experimenter with the same experimen- 
tal situation in which to run the sub- 
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Operant conditioning of infants ap- 
proximately 12 weeks of age has been 
reported by Rheingold and Gewirtz 
(1957). First, the experimenters ob- 
tained a baseline of vocalization by re- 
cording the number of vocalized re- 
sponses occurring in set periods of time 
during which the experimenter would 
bend over the infant’s crib with an ex- 
pressionless face. After the baseline was 
established, each vocalization by an in- 
fant was followed by a clucking sound, 
a smile, and a patting of the infant's 
abdomen by the experimenter. Finally, 
an extinction period was begun during 
which the experimenter reverted to the 
procedure of the baseline period. The 
investigators report significant increase 
in rate of vocalization during the con- 
ditioning phase, followed during ex- 
tinction by a significant decrease to- 
ward the level of the baseline. The use 
of this procedure to reinforce those 
sounds that are relatively rare in young 
infants (Irwin, 1947) would provide 
some interesting results for the devel- 
opment of speech sounds, 

One of the major advantages of free- 
operant conditioning is that it provides 
a reliable, objective response measure. 
The measure has proved to be sensitive 
to variations in the number of previous 
rewarded responses, in the value of the 
incentives given, in the motivational 
level of the subject, and in the rein- 
forcement schedule. The difficulties of 
the free-operant situation include the 
lack of experimental control over the 
administration of the CS, the complica- 
tions of response chaining, and the in- 
ability of the experimenter to control 
the amount of work done by the sub- 
ject. 
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Instrumental Escape Condi! ning. 
Since instrumental escape conditioning 
with child subjects has not 1 re- 
ported, the method is illustra with 


an infrahuman study. A rat y be 


placed in one of two compariments 
with a low barrier between then. The 
floors of each compartment contain 
grids through which shock te ani- 
mal’s feet'may be administer The 
onset of shock stimulates activity on the 
part of the rat, in which jump over 
the barrier is the terminal respo The 
grid of the second compartment is not 
activated, so the subject esc the 
shock. He remains in the secon: com- 
partment until shock is administered 
again, eliciting a sequence of b iviors 
until the barrier is jumped again. A re- 


peated series of trials results in the 
gradual reduction in the time between 
the onset of shock and the jumping of 
the barrier. The latency of the jumping 
response constitutes the primary meas- 
ure of conditioning. 

The difficulties encountered in ad- 
ministering noxious stimulation to chil- 
dren probably account for the paucity 
of research in this type of learning. 
Moreover, the most convenient experi- 
mental arrangements for children, such 
as shock to a limb, would probably not 
result in changes in the latencies of 
simple withdrawal responses. It should 
be noted that in instrumental escape 
conditioning there is no CS preceding 
the noxious stimulus. The onset of the 
noxious stimulus is the subject's first 
cue to perform the escape response. 

Instrumental Avoidance Condition- 
ing. Instrumental avoidance condition- 
ing is similar to escape conditioning, 
except that a CS precedes the noxious 
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stimulus, and if the subject makes the 
CR or to the onset of the noxious 
stimulus there is complete avoidance of 
the cifects. Frequently, to produce 
avoidance conditioning, preliminary 
tra g in escape conditioning is given 


to establish the response to the noxious 


stimulus at a high level. Instrumental 
avoidance conditioning starts out as 
identical to classical defense condition- 
ing. As the subject begins to make an- 
ticiparory CR’s, however, the proce- 
dure changes from classical defense 
conditioning to a partial reinforcement 
or extinction condition of classical de- 
fense conditioning. 

Razvan (1935) summarized the re- 
sults of several studies of conditioned 
finger withdrawal in children con- 


ducted in the laboratory of Bekhterev. 
A similar procedure has been used by 
Wickens (1938, 1939, 1943) with 
adults. The selection of highly volun- 
tary responses as CR’s complicates the 
interpretation of results. Instructions 
designed to prevent subjects from in- 
hibiting or deliberately making antici- 
patory responses are given. It is ques- 
tionable whether laws obtained from 
such situations will resemble those ob- 
tained from conditioning studies in 
which the subject is unaware of inter- 
relations of the stimuli and the response. 

Higher-Order Instrumental Condi- 
tioning. The procedure originally de- 
veloped by Ivanov-Smolensky (1927) 
for use with children is an example of 
higher-order instrumental conditioning. 
It involves the setting up, by training 
or instruction, of an instrumental re- 
sponse to a stimulus (Sı) which did 
Not originally elicit that response. After 
this preliminary training or instruction, 
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Sı serves in a role analogous to that of 
the US in classical conditioning. A sec- 
ond stimulus (Sə) is then introduced 
in a role analogous to the CS of classi- 
cal conditioning. That is, S is pre- 
sented prior to the onset of Sı, and the 
basic measure is the number of times 
that the response occurs anticipatorily 
to S2. 

Ivanoy-Smolensky’s procedure has 
been described in considerable detail by 
Razran (1935) and Munn (1954). The 
experimenter arranges a sloping tube 
extending from the experimenter’s com- 
partment into the subject’s compart- 
ment. On the top surface of the sub- 
ject’s end of the tube is a glass window. 
When the experimenter releases a piece 
of candy into his end of the tube, the 
subject sees the candy as it rolls past 
the window. The subject is instructed 
to squeeze a bulb in order to eject the 
candy from his end of the tube. Thus 
the sight of the candy moving down 
the tube serves as the US eliciting the 
squeezing response. The experimenter 
precedes the insertion of the candy into 
the tube by presenting the CS—a bell, 
buzzer, or flash of light. After a few 
trials, the subject begins to anticipate 
the sight of the candy in the tube by 
squeezing the bulb at the onset of the 
CS. The time of occurrence of the CS, 
the US, and the subject’s response are 
recorded for each trial on a polygraph 
tape. 

Dernowa-Yarmolenko (1933) fol- 
lowed essentially the same procedure 
in the conditioning of a voluntary hand 
movement. The experimenter instructed 
the subjects, as a group in the class- 
room, to raise their hands when the 
experimenter raised his (US). The ex- 
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perimenter would then tap a pencil on 
the desk (CS) two seconds before rais- 
ing his hand. Anticipatory hand-raising 
to the pencil tap was noted. 

Dernowa-Yarmolenko’s finding that 
the number of anticipatory responses 
decreased with age of the subjects sug- 
gests that the occurrence of anticipa- 
tory voluntary responses is viewed as 
incorrect by the subject and the re- 
sponses tend to be voluntarily inhib- 
ited. A decrease in the time interval 
between the CS and US might serve 
to reduce the subject's sensitivity to the 
“false” responses. 


GENERALIZATION 
OF CONDITIONING 


The simplest form of transfer of 
training has been studied in the context 
of classical and instrumental condition- 
ing. Although the basic idea is simple, 
the methodological problems are nu- 
merous. When a response has been 
conditioned to a given stimulus (S,), 
it is often found to occur when some 
other stimulus (Sə) is presented, even 
though the response has never been 
conditioned to Sə. If the response oc- 
curs to the new (S2) stimulus, it is said 
to have generalized. Generalization is 
studied by presenting generalized or 
test stimuli (GS) to the subject fol- 
lowing conditioning. Suppose, for ex- 
ample, the experimenter has trained a 
group of children to get a reward by 
pulling a lever each time a light is pre- 
sented with a brightness of 5000 foot- 
candles (f.c.). The subjects have re- 
ceived 25 conditioning trials, and they 
now pull the lever promptly each time 
the CS is presented. Suppose the test 
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stimuli are lights of 500, 50, and 5 f.c. 
When the experimenter presents one of 
the GS’s, he must either reward or not 
reward the child if the latter pulls the 
lever. If the reward is given, chen a 
conditioning trial has been ninis- 
tered, and the child’s subsequent tend- 
ency to pull the lever to that GS has 
been strengthened. If the reward is not 
given, an extinction trial has been ad- 
ministered, and the tendency to make 
the response subsequently has been 
weakened. If the experimenter wishes 
to determine the relation between the 
strength of the response tendency and 
the similarity of the GS to the CS, fol- 


lowing a certain amount of condition- 
ing, then a GS can be presented to a 
given child only once. From available 
evidence (e.g., Hovland, 1937a,b), it 
seems safe to conclude that the amount 
of generalization is an increasing func- 
tion of the similarity of the CS and GS. 

It also appears that the extinction 
effects built up to a stimulus through 
nonreinforcement generalize to similar 
stimuli (Hovland, 1937¢; Bass and Hull, 
1934). This fact further complicates 
the study of the generalization gradient. 
If a given subject is presented with 
more than one GS, the extinction ef- 
fects that result from the nonreinforce- 
ment of one GS will generalize to 
others, thereby distorting the gradient 
of generalized conditioning. Hence, if 
one wishes to obtain a reliable measure 
of generalized conditioning, uncon- 
taminated by extinction effects and the 
generalization of these effects, it is nec- 
essary to administer each GS to a siz- 
able group of subjects and to test each 
subject only once, 

The typical practice has been to sac- 
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rifice some precision in estimating the 
shape of the gradient. Sometimes each 
GS is repeatedly administered to a dif- 


ferent group of subjects, and sometimes 
each GS is repeatedly administered to 
each subject. There are many problems 
that can be attacked with these ap- 
proximate procedures, but the law re- 
lating generalized response strength to 
the similarity of the GS and the CS 


must await a highly precise experiment. 

Generalization can be assessed by 
any response measure that is sensitive 
to change or conditioning. The shape 
of the obtained generalization gradient 
may well depend on the particular re- 
sponse measures employed in the ex- 
periment, Because the experimenter 
does not have adequate control of the 
CS in free-operant conditioning, the 
study of stimulus generalization in this 
situation is difficult. 

Experimental study of stimulus gen- 
eralization in instrumental conditioning 
is reported by Mednick and Lehtinen 
(1957) and by Spiker (1956a,b). White 
(1958) reported generalization data 
with variation in two dimensions of the 
CS. Systematic investigation of chil- 
dren’s generalization tendencies in clas- 
sical conditioning remains to be con- 
ducted. 


Discrimination Learning 


Discrimination-learning problems re- 
quire the subject to respond differen- 
tially to different simultaneously or 
successively presented stimuli. A sub- 
ject responds differentially when he 
makes a response to one stimulus and 
does not make that response ta others, 
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as well as when he makes a different 
response to each stimulus. He may be 
rewarded for making the correct re- 
sponse and punished and/or nonre- 
warded for making the incorrect re- 
sponse to a stimulus. The discrimina- 
tion may be developed in other ways, 
however. For example, the subject may 
be given a large reward if he responds 
to one stimulus and a small one if he 
responds to the other. Or, responding 
to the “correct” stimulus may result in 
immediate reward, whereas responding 
to the “incorrect” stimulus may result 
in reward only after considerable delay. 

Discrimination-learning situations may 
be classified in several ways. The the- 
oretical commitments of a writer often 
seem to dictate his preference for a 
given classificatory schema. In the pres- 
ent chapter the different types of dis- 
crimination-learning problems will be 
labeled according to conventional us- 
age, a practice that pleases no theorist 
but wears the cloak of custom. Differ- 
ential conditioning, also called condi- 
tioned discrimination, involves condi- 
tioning the subject to make a response 
to a given stimulus and extinguishing 
his tendency to make that response to 
other stimuli presented at other times. 
Simultaneous discrimination learning re- 
quires the subject to select or respond 
to a given one of two or more simul- 
taneously presented stimuli. Successive 
or patterned discrimination learning 
requires the subject to respond in one 
manner to a given stimulus and in a 
different manner to other stimuli pre- 
sented at other times. One of the major 
concerns of present discrimination- 
learning theories is to provide a satis- 
factory account of the empirical phe- 
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nomena that have been observed in each 
of these situations, 


DIFFERENTIAL CONDITIONING 


Since differential conditioning may 
be studied in any of the conditioning 
situations previously described, there is 
no need for a detailed summary of the 
experimental arrangements necessary 
for its study. Differential conditioning 
involves the successive presentation of 
at least two different stimuli, to one of 
which the subject is rewarded for re- 
sponding, whereas response to the other 
is not rewarded or at least is followed 
by a less attractive reward. The dis- 
crimination is developed in one of two 
ways. With the successive phase, the 
subject is first consistently rewarded 
for responding to the positive stimulus 
until he responds to it each time. Then, 
for the first time, the negative stimulus 
is introduced without reward, and both 
stimuli are presented in a random se- 
quence until the subject consistently 
responds differentially to them. In the 
mixed phase both stimuli are presented 
alternately from the beginning. The 
mixed phase procedure, in its pure 
form, is rarely used with infrahuman 
subjects because the extinction effects 
generated from presentations of the 
negative stimulus generalize and make 
difficult the establishment of the re- 
sponse to the positive stimulus. With 
children, and especially older children, 
response to the positive stimulus may 
be satisfactorily established by instruc- 
tions so that the mixed-phase procedure 
does not result in the complete extinc- 
tion of the response. 

In differential conditioning the ex- 
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perimenter may obtain separ meas- 
ures for responses made to the positive 
and to the negative stimuli may 
record whether or not the ponse 
occurs to each stimulus, as wc s the 
speed and magnitude of the ONSE. 
The development of the discr vation 
with successive trials is reflected by the 
divergence of response curves for the 
positive and negative stimulu esen- 


tations. A single curve reflecting the 


development of the discrimination may 
be obtained by plotting, for each block 
of trials, that percentage of total 
responses made to the positive si nulus. 
This measure will reach 100 per cent 
when the subject responds only to the 


positive stimulus. For controllec\-oper- 
ant conditioning, in which multiple re- 
sponses are permitted on each trial, the 
number of responses to each stimulus 
presentation may be obtained as a meas- 
ure of the differentiation between the 
positive and negative stimuli (Spiker 
and White, 1959). 

Whereas differential conditioning has 
been extensively studied with infrahu- 
man subjects, particularly in instrumen- 
tal situations, comparable studies of 
children are not yet available in the 
literature. Kasatkin and  Levikova 
(1935a,b) differentially conditioned the 
sucking response in infants to visual and 
auditory stimuli. Spiker and White re- 
port an experiment in which preschool 
children were differentially conditioned 
in the controlled-operant ‘situation. 

Although the preceding paragraphs 
have referred to appetitional condition- 
ing situations, differential conditioning 
could be carried out in aversive condi- 
tioning situations as well, In this case 
the subject would learn to make a de- 
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fensc, escape, or avoidance reaction to 
the iforced stimulus and not to make 
suc response to the nonreinforced 
stir 1S. 


c is one important advantage of 


differential conditioning in the study of 
discrimination learning. Since the posi- 
tive stimulus is presented alone, the 
sub s response to it is not immedi- 
ately affected by his tendency to re- 
spon to the negative stimulus and vice 
ver Thus conditioning to the posi- 
tive stimulus and extinction to the nega- 
tive stimulus may be studied without 
thei: being confounded by the neces- 
sity ‘or the subject to choose between 


the two stimuli, The next section con- 
ins a discussion of differential condi- 
ing in simultaneous discrimination 
problems. When differential condition- 
ing and simultaneous discrimination 
learning are studied in the same situa- 
tion, it is possible to investigate the 
relationship between the relative 
strengths of response tendencies to dif- 
ferent stimuli and the choice behavior 
that occurs when the different response 
tendencies are simultaneously aroused. 


SIMULTANEOUS DISCRIMINATION 
LEARNING 


The positive and negative stimuli are 
presented at the same time in simulta- 
neous discrimination learning. The sub- 
ject must choose among the two or 
more stimuli presented on each trial. 
The choice of a given stimulus, there- 
fore, does not simply reflect the abso- 
lute strength of the tendency to re- 
spond to that stimulus; it reflects the 
strength of that tendency relative to 
the strength of tendencies to respond 


CH METHODS IN CHILDREN’S LEARNING 


397 


to the other simultaneously presented 
stimuli. The typical response measures 
are the number of choices of the posi- 
tive or correct stimulus or, conversely, 
the number of errors in selecting the 
negative stimulus. If the subject is al- 
lowed to correct on each stimulus set- 
ting, two different error measures are 
frequent. One is the total number of 
errors made in reaching a performance 
criterion and the other is the number 
of trials or settings on which the first 
response was an error. 

The simplest simultaneous problem 
involves a spatial discrimination. Thus 
the subject may learn to select con- 
sistently the left one of two identical 
boxes. In more complex spatial dis- 
criminations the subject may, for ex- 
ample, learn to select the box fourth 
from the left in an array of nine identi- 
cal boxes, Stevenson and Zigler (1958) 
used a three-choice problem for pre- 
school children. Three response knobs 
were placed on the front of the ap- 
paratus. A signal light was above the 
knobs, and an aperture through which 
rewards (marbles) were delivered to 
the subjects was below the knobs. The 
left, right, or middle knob was assigned 
as correct to a given subject. From the 
Stevenson and Zigler results, it is ap- 
parent that this problem is quite easy 
for preschool children, unless it is made 
more difficult by a partial reinforce- 
ment of the correct response. 

Gewirtz and Baer (1958a,b) have 
developed a procedure for the study 
of spatial discrimination that resembles 
the free-operant conditioning proce- 
dure described by Bijou (1955, 19578). 
The subject is placed before an ap- 
paratus that contains two holes into 
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which the subject can put marbles. The 
experimenter arbitrarily designates one 
of the two holes as correct; he rewards 
the subject for placing marbles in that 
hole and gives no reward for marbles 
placed in the other hole. The total ses- 
sion is divided into equal time units, 
and the response measure is the ratio 
of correct responses to total responses 
per unit of time. The problems studied 
by Gewirtz and Baer required them to 
administer social rewards (verbal ap- 
proval), but the apparatus could readily 
be adapted to the administration of ma- 
terial rewards. 

A more difficult simultaneous prob- 
lem utilizes both spatial and nonspatial 
cues. The procedure is nicely illustrated 
in an experiment by Kuenne (1946). 
In this experiment the nonspatial cues 
were two stimulus plaques, differing in 
size, placed on the fronts of two goal 
compartments. The child found a re- 
ward in the compartment behind the 
small plaque, regardless of the compart- 
ment on which it was placed. On half 
the trials the positive cue was placed 
on the right compartment, and on the 
other half, on the left compartment. 
The problem could not be solved sat- 
isfactorily on the basis of position. 

Schaeffer and Gerjuoy (1955) de- 
vised a simpler arrangement. Rewards 
(marbles) were hidden in wooden 
cubes with holes in their bases. The 
subject was simply required to pick up 
the cube of the correct hue in order 
to receive the reward consistently. 
Jeffrey (1958) and Terrell and Ken- 
nedy (1957) place push-buttons di- 
rectly in front of their three-dimen- 
sional stimuli. The subject then signals 
his choice by pushing the correspond- 
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ing button. This technique has the ad- 
vantage that a discrete respo per- 
mits graphic or other relativ per- 
manent recording. 

The simultaneous problem can be 
adapted to children of different ages 
by varying certain parameters that de- 
termine problem difficulty. »blem 
difficulty can be increased by increas- 


ing the similarity of the stimu! to be 
discriminated. Thus a black-w!i'te dis- 
crimination can be made more dificult 
by using light-gray and dark-gray 
stimuli. An increase in the nuinber of 
stimuli to be discriminated will also 
increase problem difficulty. 1 sub- 
ject might be required to choose one 
of five rather than one of two discrimi- 
nable stimuli, regardless of spatial posi- 
tion, The difficulty of the task may 
also be increased by incre: the 
number of simultaneous discriminations 
the subject is required to learn concur- 
rently. For example, the subject might 
be required to choose the white stimu- 
lus of a pair of black and white circles 
and to choose the red stimulus if a red 
and a green circle were presented. In 
this case he would be learning concur- 
rently a brightness and a hue discrimi- 
nation, although on different trials of 
the task. 

An experiment by Weiss (1954) 
demonstrated that the preliminary in- 
structions given the subjects also affect 
the difficulty of the task. He found 
that preschool children who had been 
told that one of the stimulus boxes 
always contained the reward learned 
significantly faster than children who 
had not been apprised of the experi- 
menter’s consistency. His experiment 
also suggested that calling the children’s 
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attention to the relevant dimensions of 
the discriminanda subsequently facili- 
tated their learning. The marked dif- 
ferences between the rates of learning 
in the Kuenne experiment and in a 
similar one by Alberts and Ehrenfreund 
(1951) may be the result, in part, of 
differences in instructions. 

I n (1952) developed a useful 
procedure in which differential condi- 
tioning was studied within the frame- 
work of a simultaneous discrimination 
problem. Although Logan studied rats, 
Lipsitt and Castaneda (1958) used a 
r procedure with preschool chil- 
dren. Each block of trials contains some 
on which only the positive stimulus is 
presented and some on which only the 
negative stimulus is presented. Each 
block also contains at least one trial on 
which both stimuli are presented. The 
latter constitutes a free-choice trial. In 
this way, the subject’s choice behavior 
can be related to the relative speeds of 
his response to the positive and negative 
stimuli when they are presented alone. 
The procedure has been used success- 
fully in the study of theoretical prob- 
lems dealing with the competition or 
conflict of habits in the choice situa- 
tion. If it is feasible for the positive 
and negative stimuli to be presented 
both separately and simultaneously and 
if the subjects can respond in the same 
manner to the single and to the joint 
presentations, the Logan procedure may 
be profitably employed. 


su 


SUCCESSIVE DISCRIMINATION 
LEARNING 


If the child were required to select 
the left one of two white boxes and 
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the right one of two black boxes, were 
each pair of boxes presented on a dif- 
ferent trial, the problem would be 
called a successive or patterned dis- 
crimination problem. Thus the succes- 
sive problem can be described as one 
in which the nonspatial cues are never 
present at the same time and the sub- 
ject must choose the response to make 
on each trial. Regardless of current the- 
oretical attempts to integrate the simul- 
taneous and successive types of dis- 
crimination learning (e.g, Spence, 
1952; Weise and Bitterman, 1951), the 
performances of subjects on the two 
types of task differ markedly, It is 
therefore important to analyze each 
discrimination task to determine which 
of the two it represents. Auditory, 
gustatory, and olfactory discrimina- 
tions, of course, can be studied satis- 
factorily only in successive discrimina- 
tion situations. 

The literature does not yet contain 
many examples of the learning by chil- 
dren of successive problems. Perkins, 
Banks, and Calvin (1954) arranged a 
successive problem by presenting one 
of two discriminanda between two 
identical cups on each trial. The loca- 
tion of the correct cup on each trial 
depended on which of the two dis- 
criminanda was in place. This attempt 
to isolate one of the variables that 
affects the relative difficulty of the 
simultaneous and successive problems is 
typical of the current theoretical and 
research interest in discrimination 
learning. 

Gerjuoy (1953) used a successive 
discrimination problem with elemen- 
tary-school children. In this case the 
stimuli were three lights, differing only 
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in their positions on a stimulus panel. 
The responses involved pushing one of 
three buttons to each presentation of a 
light. Pushing the correct button turned 
out the stimulus light, and the subject 
was told that the object of the task 
was to learn to turn out the stimulus 
light with the first response on each 
trial. This type of task may also be 
referred to as motor paired-associate 
learning. The fact that it can be con- 
ceptualized either as successive dis- 
crimination or paired-associate learning 
suggests the possibility of integrating 
the extensive findings in each of these 
two areas. A more detailed discussion 
of these tasks is given in a later section 
on motor paired-associate learning (see 
p- 408). 

The difficulty of the successive dis- 
crimination problem may be varied in 
much the same way as that of the si- 
multaneous. The number of stimuli to 
be discriminated may be varied, the 
similarity of these stimuli may be in- 
creased or decreased, the time allowed 
for the choice may be varied, the sub- 
ject may be allowed to correct on each 
trial or he may be informed only that 
his first choice is right or wrong, and 
so on. The number of stimuli may be 
varied with the number of response 
choices held constant, the number of 
response choices may be varied with 
the number of stimuli held constant, or 
the number of stimuli and response 
choices may be varied. The findings of 
Perkins, Banks, and Calvin (1954) sug- 
gest that the successive problem is quite 
difficult for children. However, this 
apparent difficulty may be largely the 
result of peculiarities of instructions 
and procedures, which may be elimi- 
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nated upon further study of 
lem. 
It is apparent that successive dis- 


prob- 


crimination is a common occ nee in 
the everyday lives of children. | .carning 
to read, for example, require child 
to make a different verbal response to 
each of several similar word ithout 
having all the words sim cously 
presented. The need to study this type 
of discrimination in relative imple, 
well-controlled situations is ious. 


RELATIONAL DISCRIMINA 1 :ON 
LEARNING 


rrectly 
cessive 


A subject could perform 
in both simultaneous and su 
problems without attending to all 
stimuli on every trial. In the simultane- 
ous problem he could select the white 
stimulus (positive) without having ob- 
served the black stimulus on that trial. 
Although the subject cannot observe 
both white and black on the same trial 
in the successive problem, he can ob- 
serve “white on the left” and approach 
this compound without observing the 
other compound, “white on the right. 

Relational discrimination learning 15 
to be distinguished from these by the 
fact that the subject, if he is to perform 
perfectly, must observe all discrimi- 
nanda on each trial. Consider the fol- 
lowing arrangement: choice of the cir- 
cle is rewarded when it is presented 
with a triangle, choice of the triangle 
is rewarded when it is presented with 
a square, and choice of the square 1S 
rewarded when it is presented with the 
circle. If the spatial arrangements of 
the pairs of stimuli were systematically 
counterbalanced, the reward would not 
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be isistently associated with any 
given form, with either position, or 
eve vith any given combination of 


positon and form, This type of learn- 


ing also been called “transverse pat- 
terning” (Spence, 1952). Only if the 
subject attends to the negative stimulus, 
as well as to the positive one, can he 
receive adequate information to make 
correct choices consistently. The prob- 
lem thus demands a comparative orient- 
ing response on each trial. Research in- 
dicates that such a problem is difficult 
for preschool children. For example, in 


a problem requiring the subjects to se- 
lect the middle-sized of three stimuli 
on each trial, in which the absolute size 
of stimuli varied from trial to trial, 
only subjects for whom a test had in- 
l understanding of the concept 
“middle-sizedness” showed any evi- 
dence of learning in 30 trials (Spiker, 
Gerjuoy, and Shepard, 1956). 

It is likely that experienced preschool 
children often solve even the simulta- 
neous discrimination problems by using 
a comparative orienting response. For 
example, if two stimuli differing in size 
are presented on each trial and the re- 
ward is consistently associated with the 
larger, the preschool child who has had 
considerable experience in such prob- 
lems often is observed to turn first to 
one, then to the other, and to repeat 
this sequence one or more times before 
making a choice. It is likely that once 
a subject has developed this type of 
orienting behavior the relational prob- 
lem can be solved with relative ease. 
In their study of monkeys’ solutions of 
the oddity problems Moon and Harlow 
(1955) demonstrate that infrahuman 
Subjects can solve problems requiring 
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observation of all stimuli simultaneously 
presented. Subjects possessing a high 
level of verbal facility could learn a 
preliminary set of instructions that 
would permit errorless performance on 
the relational problems, even without 
preliminary practice on the problem. 


TRANSFER OF TRAINING 
IN DISCRIMINATION LEARNING 


The transfer of discrimination learn- 
ing has been studied in several different 
contexts. Harlow (1949) reports evi- 
dence for the development of learning- 
to-learn or discrimination-learning set 
in preschool children as a function of 
the number of previous simultaneous 
discrimination problems they had 
learned. The rate of learning improved 
with the number of previous problems 
solved, even when the stimuli in the 
series of problems were unrelated. The 
same finding was reported by Shepard 
(1957) when her preschool subjects 
learned a series of successive discrimi- 
nation problems. 

The transfer of discrimination learn- 
ing within the context of the transposi- 
tion experiment provides another exam- 
ple of transfer of training. Kuenne 
(1946) showed that preschool-aged 
children transpose on a size discrimi- 
nation, That is, after the subjects had 
learned to select the smaller of two 
stimuli, they would select the smaller 
of another pair of stimuli differing in 
size from the original pair. Whether 
such transfer is positive or negative de- 
pends on which stimulus of the second 
problem is correct. Similar findings 
were reported in the study of Alberts 
and Ehrenfreund (1951). 


402 

Cantor (1955) demonstrated that 
preliminary training involving learning 
names for the stimuli will facilitate sub- 
sequent discrimination of the same 
stimuli in a simultaneous problem. 
Norcross and Spiker (1957) confirmed 
Cantor’s results in a similar experiment. 
Norcross (1958) showed facilitation, 
resulting from previous name-learning, 
in a successive discrimination-learning 
problem. The study by Lipsitt and 
Castaneda (1958), mentioned previ- 
ously, demonstrated transfer from pre- 
vious differential conditioning to a si- 
multaneous problem involving the 
same stimuli. 


GENERAL CONSIDERATIONS 
IN DISCRIMINATION LEARNING 


Although it is not difficult to devise 
discrimination-learning problems for 
children that are analogous to those 
used with infrahuman subjects, it is 
probably a serious error to assume that 
children learn such problems in the 
same way as infrahuman subjects. Dif- 
ferences in the performance of children 
in discrimination learning after they 
have learned names for the stimuli 
(Cantor, 1955; Gerjuoy, 1953; Shepard, 
1954; Norcross, 1958; and Smith and 
Goss, 1955) demonstrate conclusively 
that the language abilities of the sub- 
jects play an important role in their 
solution of discrimination problems. 
Although the restricted motility of pre- 
verbal children makes difficult the com- 
parison of their discrimination learning 
with that of infrahuman subjects, such 
comparative studies would provide 
much information on the role of lan- 
guage in simple learning. 
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The preschool child’s ability to con- 
ceptualize the goal of a discrimination 
problem probably plays a crucial role 
in his reactions to a nonrewarded re- 
sponse. If the child has been inst ructed 
and understands that the goal is to get 
a reward on each trial, a nonreward 
seems to serve as a punishment. This 
can be concluded from the fact that 
numerous nonrewarded trials early in 
learning result in a marked negative 
attitude toward the task, a result that 
also occurs when punishments and/or 
criticisms are given for early errors. 
On the other hand, if the goal of “per- 
fection” is not mentioned or is de- 
emphasized by the experimenter, the 
child will often settle for a reward on 
only a fraction of the trials. In the 
latter case learning may not occur un- 
less the rewards are highly pertinent to 
the child’s current needs. This ability 
to conceptualize a perfect performance 
may prove to be one of the factors that 
critically distinguish children’s discrimi- 
nation performance from that of infra- 
human subjects. However, reactions to 
failure, similar to those of children, 
have been observed in monkeys and 
chimpanzees who have had consider- 
able prior experience in discrimination 
learning tasks. Indeed, recent studies 
with rats suggest that the omission of 
a customary (“expected”) reward re- 
sults in marked evidence of emotional 
reactions (Amsel and Roussell, 1952; 
Roussell, 1952). 


Rote Learning 


Certain types of learning problems 
are customarily referred to as rote- 
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ng tasks. This practice, of course, 
evidence that the laws governing 


is ne 
rote learning are fundamentally differ- 
ent from those governing other types 
of learning. Research on learning is not 


yet adequate to answer this question. 

lf a task requires a subject to make a 
different response to each of several 
stimuli and if there is no principle or 
set of principles that systematically de- 


termines which response goes with 
which stimulus, the task is usually la- 
beled a rote-learning task. If the re- 
sponses the subject makes are verbal, 


k is called a rote-verbal-learning 
if the responses are motor, it is 
called a rote-motor-learning task. 

On the other hand, if a task, for ex- 
ample, requires the subject to learn to 
respond to each stimulus word with its 
antonym, it would not be called a rote- 
learning task. In this case a single prin- 
ciple determines the stimulus-response 
pairs, and the subject's discovery and 
use of this principle will greatly sim- 
plify the learning task. Similarly, 
concept-formation task in which the 
subject is required to learn to make the 
same verbal response to each member 
of a class of stimulus items, in which 
the members of a class have common 
features (e.g., common functions, com- 
mon lenat, and similar shapes), 
would also be distinguished from rote- 
learning tasks (Metzger, 1958). 

Discrimination-learning tasks, in 
which the subject learns to select one 
stimulus that is arbitrarily designated as 
correct by the experimenter, are not 
ordinarily referred to as rote-learning 
tasks. Except for convention, however, 
there appears to be no reason to dis- 
tinguish such tasks from those that are 


the 
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called rote-learning tasks. The laws that 
govern the learning of rote tasks and 
discrimination tasks may ultimately be 
found to be similar or overlapping. 

Although convention is not entirely 
consistent on this point, the learning of 
prose and poetry is not ordinarily clas- 
sified as rote learning. This seems to be 
especially true if the subject “under- 
stands” the prose or poetry that he 
learns. On the other hand, learning to 
associate pairs of meaningful words is 
usually called rote learning, even when 
the pairs of words are highly associated 
to begin with. It is probably not surpris- 
ing that psychologists do not agree 
perfectly on the tasks to be classified 
as rote-learning tasks. The distinction 
between rote learning and more insight- 
ful types of learning is based on the 
degree to which the subject understands 
(or is likely to understand) the prin- 
ciples inherent in the task. The dis- 
covery of just how well the subject 
understands such principles is the object 
of many studies of learning. 


ROTE VERBAL LEARNING 


The use of rote-verbal-learning tasks 
in experimental psychology dates back 
to the early work of Ebbinghaus 
(1885). There are several types of 
verbal tasks that have been used with 
adult subjects as well as several different 
procedures for the study of each. Three 
types of rote verbal tasks are described 
here—list learning, serial learning, and 
paired-associate learning. 

List Learning. For this task, the sub- 
ject is presented with a list of items 
(e.g, words, nonsense syllables, and 
numbers), is allowed to study them, 
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and is then asked to reproduce the list 
in its absence. The subject is not re- 
quired to reproduce the list in the order 
in which it was originally presented. 
The list-learning procedure was used 
by Lahey (1937) to study the extent to 
which the learning of new lists affected 
elementary-school children’s recall of 
previously learned lists. 

There are two major techniques of 
presenting the list. In the complete pres- 
entation method the entire list is given 
to the subject, and he is allowed to 
study it for a specified period of time. 
After the list has been removed, the 
subject is asked to recall as many of the 
items as possible. Either the amount of 
study time or the number of study 
periods may be varied to determine the 
effect of practice on recall. With the 
second technique, each item is presented 
separately for a constant length of time 
on each trial. The second procedure is 
preferred, since it provides better con- 
trol over the amount of time the sub- 
ject spends on each item. 

For prereading subjects, a series of 
pictures or objects can be presented, 
which the subject is asked to recall in 
the same way as words, nonsense syl- 
lables, or digits (Bayley, 1926; Mc- 
Geoch, 1928). Auditory presentation of 
the materials permits the use of words, 
nonsense syllables, and digits with pre- 
reading subjects (Bayley, 1926; Hur- 
lock and Newmark, 1931). Although 
most studies involving such procedures 
have been concerned with recall fol- 
lowing a single presentation of the list 
(memory span), the procedures seem 
entirely appropriate for repeated pres- 
entation in the study of learning. 

Manual methods of presenting the 
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items may be used, but current jp)! ctice 
prefers the automatic presentation pro- 
cedures whenever possible. For verbal 
materials (words, nonsense bles, 


digits, pictures, figures, and ers), 
automatic presentation may be 


plished by use of the memory drum, or 
similar devices, which permit a stant 
exposure time for each item a ll as 
a constant between-item interva.. Con- 
trol of this interval is consid« im- 
portant, since it is known that be- 
tween-item intervals are often 1 for 
rehearsal. The auditory presenta jon of 
verbal materials can be accomplished 
by tape-recordings, in which ea: item 

time. 


is equally spaced with respect | 

Serial Verbal Learning. The major 
difference between list learning and 
serial verbal learning is that the latter 
requires the subject to reproduce tie list 
in the order in which it was pre sented 
to him. Serial-learning tasks have been 
given to children primarily to study 
their memory spans. Whereas the litera- 
ture on serial verbal learning in adults is 
massive, the research with children is 
less impressive. It may be that the read- 
ing ability of elementary-school chil- 
dren usually has been considered 
inadequate for the efficient study of 
verbal learning. 

Serial verbal learning has been pre- 
sented by two methods. The method of 
complete presentation involves, as in 
list learning, the presentation of the 
entire list to the subject, followed by @ 
test trial in which the subject is asked 
to reproduce the list in the proper order. 
Lepley (1934) conducted an extensive 
study of the serial learning of ele- 
mentary-school children with the com- 
plete-presentation method. The serial- 
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verbal learning of adults is currently 
stedied primarily by the serial-anticipa- 
tion method (Hilgard, 1951). The 
materials are typically presented in a 
memory drum. The items in the list 
appear at a constant rate, and the subject 
is required to attempt to call out the 
ne em in the list before it appears in 
the im, Once through the list is con- 
sidered a trial, and the response measure 
is usvally the number of correct antici- 
pations per trial. The progress of learn- 
ing represented by a plot of the 
number of correct anticipations per 
trial or per block of trials. Andrews, 
Shapiro, and Cofer (1954) report the 
successful study of serial learning in 
elementary-school children (9 to 13 
years of age) by the serial-anticipation 


method, Their anticipation interval was 
the usual two seconds provided for 
adult subjects. It seems likely that the 
serial-anticipation method could also 
be used with prereading subjects if 
auditory or pictorial presentation of the 
materials were used and if relatively 
long anticipation intervals were pro- 
vided. 

One of the major limitations of serial 
learning is that it is impossible to vary 
the stimulus and response items inde- 
pendently. Each item serves both as a 
stimulus item and as a response item. 
The study of the effect of stimulus 
similarity on rate of learning, holding 
response similarity constant, could not 
be accomplished with a serial task. The 
study of the effect of response famil- 
iarity on rate of learning, holding con- 
stant stimulus familiarity, likewise 
would not be possible. Since there is a 
number of important problems con- 
cerned with the manipulation of stimu- 
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lus and response factors independently, 
it is fortunate that the method of paired- 
associate verbal learning has been de- 
veloped. 

Paired-Associate Verbal Learning. 
In paired-associate verbal learning the 
subject is presented with several pairs 
of words and is asked to associate the 
pairs in such a way that, upon pres- 
entation of the first member, he will 
be able to call out the second. Two 
techniques for presenting the materials 
have been used. In the first, called the 
complete-presentation method, the pairs 
of words are exposed together, one pair 
at a time, until the list has been com- 
pletely presented. Then the first mem- 
ber of each pair is presented, and the 
subject is asked to respond with the 
second member. The amount of time 
given for the recall of each item varies 
from experiment to experiment. In some 
experiments the subject is prompted if 
he fails to recall; in others, he is not 
prompted. The order of presenting the 
pairs is varied from trial to trial to 
prevent serial learning. The paired- 
associate learning of elementary-school 
children was successfully studied by 
Postman and Murphy (1943) with the 
complete-presentation method. For- 
merly, verbal paired-associate learning 
was studied almost exclusively by this 
method. Currently, the paired-associate 
anticipation method is most frequently 
used. With this method, the first mem- 
ber of a pair is presented alone for a 
set period of time. It is followed im- 
mediately by the joint presentation of 
both members. The subject is asked to 
call out the response member before 
it appears. The usual rate of presenta- 
tion to adult subjects is such that the 


406 


subject has approximately two seconds 
in which to anticipate the response. 
Thus, on each presentation of a pair, 
the stimulus word is immediately fol- 
lowed by information as to which 
word is correct for that stimulus.' 

It is probable that the development 
and increasingly prevalent use of the 
paired-associate method in recent years 
has been due, in part, to certain limita- 
tions of the serial learning method. 
With the paired-associate method, the 
stimulus and response items can be 
independently manipulated. Paired- 
associate procedures also permit an ex- 
tensive analysis of the various types of 
error that occur during learning. When 
the anticipation procedure is used, the 
experimenter may obtain the number of 
intralist errors, extralist errors, and 
omissions, in addition to the number of 
correct anticipations. Intralist errors, 
which occur when the subject responds 
to one stimulus with a word that is 
correct for another, are particularly 
sensitive to variations in the similarity 
of the stimulus items. Intruded errors 
—response words that are not in the 
list—may be of particular interest when 
the experiment is concerned with the 
effects of interpolated learning tasks. 
Failures to respond within the anticipa- 
tion interval (omissions) constitute 


‘It may be noted that the paired-associate 
anticipation method resembles the procedure 
in classical conditioning. The stimulus word 
may be thought of as the CS, the response 
word as the US, the anticipation interval as 
the CS-US interval, the subject’s reading of 
the response word when it is exposed as the 
UR, and his overt anticipation of this word 
as the CR. It is tempting to describe paired- 
associate verbal learning as the concurrent 
classical conditioning of several responses. 
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another major source of error which 
may be of interest in investigations of 
negative transfer or interference due to 
blocking. 

Several procedures have been devised 
to adapt paired-associate learning to 
subjects who either do not read w ell or 
do not read at all. Lund (1927) used a 
task in which 5-year-old children were 
required to say the names oi color 
stimuli as rapidly as possible. iilures 
to respond or erroneous responses were 
followed by oral prompting Jersild 
(1932) used a similar task for 7 and 8- 
year-old children. 

Norcross and Spiker (1958) adapted 
the paired-associate anticipation method 
to kindergarten and first-grade c! iildren, 
These investigators presented pictures 
of single objects for both stimulus and 
response items. The subjects were in- 
structed to call out the names of the 
pictured objects, Anticipation intervals 
considerably longer than those used for 
adults were selected in order to avoid 
too frequent failures. The between- 
item intervals were kept relatively brief 
to avoid boredom and lack of attention: 
The between-trial intervals were some- 
what lengthened to prevent the devel- 
opment of excessive fatigue. ; 

Still a different method of presenting 
paired-associate materials by the an- 
ticipation procedure was used by Reese 
(1958). He studied children from the 
fourth, fifth, and sixth grades whose 
reading ability was relatively poor. T he 
stimulus items were lights differing 1 
hue and/or brightness and the response 
items were nonsense words. A bell 
signaled the end of the anticipation in- 
terval, at which time the experimenter 
orally prompted the subject, whether 
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or not the subject had correctly an- 
ticipated. Reese’s procedure should 
also be useful with older elementary- 
school children if the response items 

re nonsense syllables that were dif- 

ult to read and pronounce. 

lhe paired-associate verbal task is 
nicely suited to the study of children’s 
learning because of the ease with which 
its difficulty can be adapted, The task 
is made much easier if one- or two- 
syllable nouns naming familiar, con- 
crete objects are used. The subject's 
attention to the materials can be im- 
proved if he is required to pronounce 
both the stimulus item and the response 
item on each presentation, It may also 
be desirable to present independently 
the stimulus and response items prior to 
administering the paired list in order to 
determine any individual idiosyncrasies 
in pronunciation and to familiarize the 
child with the response items (Norcross 
and Spiker, 1958). Finally, the number 
of pairs in the list can be reduced or 
increased to adapt the difficulty of the 
tasks to the subjects used. Paired-as- 
sociate verbal learning has been studied 
in elementary-school children of in- 
termediate grades with essentially the 
same procedures used with college un- 
dergraduates (Andrews et al, 1954; 
Spiker, 1960). 


ROTE MOTOR LEARNING 


The sharp distinction that is often 
made between “verbal” and “motor” 
learning tasks would be difficult to 
defend. It is difficult to conceive of a 
motor task for human subjects, the 
learning of which is not subject to the 
influence of verbal processes. At one 
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extreme, consider a task in which the 
subject is required to press a different 
response button to each of several 
stimulus lights differing in hue. If the 
buttons can be identified by distinctive 
names, e.g., by numbers, an adult could 
conceivably learn the task without 
touching a single button, That is, he 
could learn “yellow-button 1,” “blue- 
button 2,” etc., prior to any practice on 
the task. Following the verbal learning, 
he could perform on the “motor” task 
without error, although subsequent 
practice might show improvement in 
speed of the responses, the smoothness 
with which they were executed, ete. 
(Baker and Wylie, 1950). It is likely 
that most adult human subjects, even 
if not given the prior verbal practice, 
would soon introduce verbalizations to 
facilitate learning. At the other extreme, 
there are tasks that do not seem to be 
so subject to verbal control. Throwing 
a basketball through the hoop with 
the consistency of the professional bas- 
ketball player is a skill that is not ac- 
quired merely by memorizing verbal 
instructions. Improvement in such a skill 
is dependent on the individual’s at- 
tention to the proprioceptive feed- 
back from his responses. There is pres- 
ently no language that permits the 
description and discussion of the 
proprioceptive stimuli that are crucial 
in the development of this type of skill. 
Nevertheless, many of the preparatory 
responses that facilitate the development 
of the skill are subject to instructional 
control. The prospective basketball 
player can be instructed as to the angle 
at which he should face the hoop; he 
may be told the height at which he 
should hold the basketball, and so on. 
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Despite these considerations, tasks re- 
quiring the subject to make nonverbal, 
skeletal responses are frequently desig- 
nated as “motor tasks.” But, once again, 
it is well to remember that performances 
on differently named tasks are not 
necessarily governed by different laws. 
Indeed, one of the most important en- 
deavors of the student of human learn- 
ing is to determine whether or not such 
phenomenally different types of learn- 
ing can be understood in terms of the 
same principles. 

Paired-Associate Motor Learning. It 
was noted previously that certain types 
of discrimination tasks may be con- 
sidered as paired-associate motor tasks. 
Numerous other tasks that have been 
studied with children seem to qualify 
as paired-associate motor tasks. Among 
them are the familiar coding or sub- 
stitution tasks of mental tests, such as 
the Wechsler (1944) Digit Symbol 
Test; clerical tasks such as typing and 
telegraphy; and perhaps even the form- 
board tasks often given younger pre- 
school children (e.g., Skeels, 1933). In 
each of these tests, the subject is re- 
quired to associate a different motor 
response with each stimulus. Most 
studies of children’s performance on 
such tasks have been concerned with 
obtaining normative data rather than 
with the study of variables that affect 
the learning of the tasks. 

The task used in Kirkwood’s (1926) 
study of children’s learning also belongs 
to this class. She had preschool children 
associate blocks to picture stimuli, 
where the block and picture of each 
pair had a slight resemblance to each 
other. The experimenter showed the 
child a picture and asked him to select 
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the block that went with it. l re- 
sponse measure was the num of 
correct associations the child ga ach 
time through the 20 pairs. Kir! vod’s 
task has the advantage that the strength 
of associations between stimuli | re- 
sponses can be manipulated by v! ving 
the degree of within-pair simil +a 
convenient means of varying | dif- 
ficulty. 

Spiker and Holton (1958) di ped 
a paired-associate motor task ar sous 
to paired-associate verbal learni vith 
the anticipation method. The aj itus 
permits the presentation, one at. tme, 
of six stimulus lights differing hue. 
To each light, the subject is ! ired 


to push a different one of nine push- 
buttons, arranged in a semicircle. Beside 
each button is a small pilot lamp. The 
subjects were given two seconds to 
select the correct button (the anticipa- 
tion interval). At the end of this in- 
terval, the pilot lamp beside the correct 
button was turned on and, with the 
stimulus light, remained on for two sec- 
onds (the joint-presentation interval). 
At the end of the joint-presentation 1n- 
terval, both the stimulus light and the 
information light were turned off for 
two seconds (the interstimulus in- 
terval). In a similar procedure Norcross 
(1958) used automatically presented 
pictures as stimuli. 

The motor paired-associate task has 
several advantages over its verbal coun- 
terpart. First, it is easy to equate the 
difficulty of the stimulus-response con- 
nections, since given stimuli do not 
tend to elicit particular button selec- 
tions. In some experiments it is desirable 
to equate two halves of a task so that 
the same subject may perform under 
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bot: experimental and control condi- 


tions. To equate or control the initial 
differences in the associative strengths 
of the stimulus-response pairs of verbal 
tasks requires elaborate experimental 


statistical procedures. Experi- 
control requires preliminary re- 
to select pairs with equal associa- 
engths, Statistical control neces- 
the preparation of several word 
vhich must then be used in a 


list 


counterbalancing design. The statistical 
proc: jure often results in an equivocal 
intersction between lists and experi- 
mer conditions. Comparable control 
in the motor task can be achieved with 
fewer sets of stimulus-response pairings 
(“lists”) and with less likelihood of an 


uous interaction. Second, pushing 
a button is a clear-cut discrete response, 
which makes observation and recording 
easy. Third, the response provides a 
mechanical energy, which permits con- 
venient measurement of response times. 
The use of the voice key to accomplish 
the same purpose in verbal learning is 
complicated by stammering and stutter- 
ing. Finally, since the buttons are avail- 
able to the subject at the beginning 
of the task, the learning that- occurs 
reflects the acquisition of specific stimu- 
lus-response associations and is uncom- 
plicated by the subject’s learning which 
responses are appropriate to make in 
the situation. In the usual verbal paired- 
associate procedure the subject must 
first learn which words he can use as 
response words and then associate them 
with the appropriate stimuli. The de- 
gree to which this list learning affects 
performance is not yet known. 

One of the major limitations of the 
motor paired-associate task is the re- 
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stricted number of response possibilities. 
If one wished to give a series of tasks, 
with different stimuli and different re- 
sponses in each task, the foregoing task 
would be inappropriate. A task like 
that of Kirkwood (1926) would be 
better for this type of problem, since 
a number of variations in the charac- 
teristics of the blocks could be manipu- 
lated. 

Maze Learning. The maze learning of 
children has-been extensively studied. 
These studies include children’s per- 
formance on the  ball-rolling maze 
(Mattson, 1933), the  finger-maze 
(Wieg, 1932), the slot- or stylus-maze 
(McGinnis, 1929), and the body-maze 
(Wenger, 1933). These mazes usually 
consist of several choice points, with 
either two or three choices available at 
each point. Selection of the correct 
path at each point results in arrival at 
the goal or at the end of the maze. 

The use of the maze, either the small 
hand-maze or the body-maze, permits 
the study of important variables. The 
effect of different types of incentive on 
maze performance was studied system- 
atically by Abel (1936). Jones (1945) 
compared the effectiveness of different 
events to signal correct responses. 
Holodnak (1943) studied the effects 
of signaling correct responses as Op- 

osed to signaling incorrect responses. 
Melcher (1943) compared visual and 
manual guidance in learning the maze. 
McGinnis (1929) and Jones and Yoshi- 
oka (1938) have studied transfer of 
training from one maze pattern to 
another. It would also be possible to 
vary the number and type of cue 
the subject receives at the choice point. 
With a slot-maze, in which the blind 
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alleys are hidden beneath the surface 
of the maze, it would be possible to vary 
the visual cues at each choice point, 
increasing or decreasing the distinctive- 
ness of the choice points. 

The usual maze for the study of 
children’s learning can be viewed as a 
sequence of spatial discrimination prob- 
lems, with the choice at each point de- 
termined by previous choices. As such, 
it will be recognized as considerably 
more complex than the corresponding 
paired-associate problem, in which each 
pair may be learned relatively inde- 
pendently of the other pairs. It is prob- 
ably this complexity of the maze task 
that has caused it to lose favor among 
recent investigators of simpler types 
of learning. 

Motor Serial Learning. The experi- 
mental study of motor tasks analogous 
to the verbal serial learning tasks de- 
scribed previously has not yet been 
reported in the literature. Superficially, 
it might appear that the maze parallels 
the verbal serial task. It will be noted, 
however, that only two or three dif- 
ferent responses are involved in maze 
performance. By contrast, many dif- 
ferent responses are used in the typical 
verbal serial task. A motor task more 
nearly analogous to the verbal task 
would involve, for example, ten re- 
sponse-buttons. The subject would have 
to learn a sequence of ten button-push- 
ing responses, utilizing each button only 
once. It would be interesting to dis: 
cover whether the phenomena com- 
monly observed in verbal serial learning 
(e.g., the serial position effect, remote 
associations, and savings on derived 
lists) also appear in the motor analogue. 

Complex Motor Skills. In all the tasks 
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considered thus far it has been imed 
that the responses involved arc eady 
in the repertoire of the subject such 


tasks what is learned, then, is nake 
the appropriate response to eac imu- 
lus. The present section conside \otor 
tasks in which the responses m je to 
some degree developed, eith -om- 
pletely or by the integration ele- 
mental responses already existin: in the 
subject’s repertoire. The literature on 
children’s learning contains n rous 
studies of the learning of moto- skills, 
although few of the skills h been 
studied systematically. For exai , the 
effects of practice have been investi- 
gated with such tasks as drawin writ- 
ing, climbing, cube manipu ition, 
scissor cutting, ring tossing, ti wing 


at targets, performance with mirror re- 
versals, and so on (Munn, 1954, PP. 
387-407). In most cases the experi- 
menter’s major interest appears to have 
been whether or not children’s per- 
formances on his particular task were 
subject to improvement with practice. 
Typically, the variables studied were 
individual difference variables (age, sex, 
intelligence, school grade, etc.) rather 
than variables subject to experimental 
manipulation (e.g., type of instruction, 
type, amount, and frequency of reward, 
and type, amount and distribution of 
practice). Consequently, although some 
normative and developmental data are 
available for performance on these types 
of task, relatively little is known about 
the experimental variables that affect 
the learning of them. In connection with 
the older studies, it would seem that the 
primary criterion for selection of 
the type of motor task studied was its 
relatively frequent occurrence in every- 
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day life and/or its social significance. 
Al gh the psychologist may often 
xperimental use of these “natural 
situations,” it is probably more fre- 
qui the case that his hypotheses 


req less complex, better-controlled 
situ ns, 

i student of children’s learning 
wi interested in motor skills would 
do | to examine some of the motor 
tas onventionally used with adults. 
For csample, Langhorne (1933) studied 
performance on the Renshaw-Weiss 
puis irmeter as a function of age with 
subjects as young as 7 years. More 
recently, Ammons, Alprin, and Am- 
mous (1955) investigated performance 
on the rotary pursuit apparatus with 
third-grade children. Similar adapta- 
tions could be made for the Star Dis- 
crimeter (Duncan, 1953), the two-hand 
coordinator (Shephard and Lewis, 


1950), and the modified Mashburn ap- 
paratus (Lewis and Shephard, 1950). 
Children normally find such tasks in- 
teresting and challenging, and compari- 
son of their performances on these 
devices with those of adults would 
provide a significant addition to our 
knowledge of human learning. 


Concept Formation 


It is difficult to identify the common 
features of all experiments that have 
been referred to as concept formation 
studies. As Vinacke (1951) implies in 
his thorough review of methods and 
results of concept formation experi- 
ments, there are some profound dif- 
ferences within this group of studies 
that make questionable the practice of 
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referring to all of them by the same 
name. Most such studies, however, can 
be characterized by the fact that the 
subject learns to make the same response 
(motor and/or verbal) to each member 
of aset of stimuli and learns not to make 
this response to stimuli not belonging 
to the set. In most studies the subject 
is given several sets of stimuli concur- 
rently and is required to learn a 
different response to each set. In addi- 
tion, a concept is said to be attained 
only when the subject makes the ap- 
propriate response to stimuli he has not 
previously observed in the experiment. 
Thus the members of a set of stimuli 
must have some feature in common— 
color, form, texture, size, function, 
name, material, etc—and the members 
of different sets must be discriminable 
in some respect. The latter requirement 
excludes tasks in which the subject 
learns the same name for each member 
of a set of stimuli arbitrarily and ran- 
domly assembled by the experimenter, 
since the subject could not be expected 
to make the response to a new stimulus 
without having previously learned to 
do so. 

Although some students appear to re- 
strict concept formation to situations 
involving verbal responses of human 
subjects, others (e.g., Vinacke, 1951; 
Long, 1940) would include the non- 
verbal responses of preverbal children, 
In terms of available data, there is no 
good reason to exclude infrahuman sub- 
jects if preverbal human subjects are 
to be included, If these considerations 
are accepted, the formation of con- 
cepts in animals has already been dem- 
onstrated in transposition experiments. 
Thus, when chimpanzees are trained to 
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select the smaller of two particular 
stimuli and subsequently choose the 
smaller of a new pair of stimuli, they 
have met the criterion of concept at- 
tainment. 
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When the two-year-old child is told 
on successive occasions that his wagon, 
his mother’s dress, the catsup, the stop 
light, certain of his blocks, etc., are 
“red,” he is presumably learning the 
concept “red.” When he points to the 
first fire-truck he has ever seen and says 
“red,” he is demonstrating some learn- 
ing of the concept. If he consistently 
responds to other red things with “red” 
and does not use the term for non-red 
things, he demonstrates an understand- 
ing of the concept. When reference is 
to this type of original learning, the 
term concept learning is used. 

Now consider a set of cards consisting 
of several instances of triangles, circles, 
and squares, with several instances of 
each form colored red, green, or blue. 
If an adult is asked to sort a deck of 
such cards, he may do it in terms of 
color, form, or combinations of both 
color and form. If the experimenter 
consistently rewards him when he sorts 
on the basis of color, the subject will 
eventually sort the entire deck consist- 
ently on this basis. The subject discovers, 
through experimenter prompting, the 
concept the experimenter considers 
relevant. The learning that takes place 
in this type of situation is called concept 
discovery. In concept discovery the 
subject learns to attend to dimensions or 
features of the stimuli, for which he 
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has already learned names, in crier to 
solve the problem the experime: -er has 
devised for him. 

Although something is leased in 
both concept learning and con dis- 
covery, there are obvious di neces 
in what is learned in the two situations. 
The child learns, in effect, th: jects 
having “that property” are calle ed,” 
The adult learns that the expe nter 
wishes him to call the red jects 
“meef” or to push the left butt vhen 
red stimuli are presented. It wo vid be 
surprising, indeed, if exactly thc same 


laws governed learning in the tw situa- 
tions. 

Clear-cut studies of concept Irning 
are rare in the literature, and the sys- 
tematic study of it has scarcely begun. 
The excellent study of children’s con- 
cept learning by Long (1940) is still 
a proto-type that has not been sub- 
sequently utilized. Experiments by 
Welch (1938) and Welch and Long 
(1940a,b) also provide examples of the 
study of concept learning. Most con- 
cept formation experiments with chil- 
dren, however, have not been concerned 
with the teaching of concepts. They 
have been concerned with testing to 
determine the concepts children have 
already acquired at different ages (e.g., 
Hicks and Stewart, 1930; Thrum, 1935): 
In this respect, they resemble the con- 
cept discovery experiments conducted 
with adults (e.g., Heidbreder, 1946, 
1947; Buss, 1950). The early experiment 
by Hull (1920) is one of the few sys- 
tematic investigations of concept learn- 
ing in adults. The referents of Hull’s 
concepts were Chinese characters, Un- 
familiar to his subjects, imbedded in 
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complex stimulus patterns. Thus his 
subjects were required to learn new 
concepts, 


METHODS FOR STUDYING 
CONCEPT FORMATION 


The Paired-Associate Method. The 
procedure used by Hull was subse- 


queatly modified and improved by 
Hei‘breder in a series of experiments 
with exemplary methodology (Heid- 
breder, 1946, 1947; Heidbreder, 
Bensley, and Ivy, 1948; Heidbreder and 


Overstreet, 1948). The procedure is 
it to that of paired-associate learn- 
y the anticipation method. Several 

of stimuli are prepared, each of 
which contains a different member (in- 
stance) of each of several classes of 
stimuli (concepts). The subject is given 
the first series of these stimuli and is 
required to learn a different nonsense 
syllable to each stimulus in the series. 
When the first series has been learned, 
the subject is given the second; when 
the second series has been learned, he is 
given the third; and so on. The subject 
learns the same nonsense syllable to each 
instance of a given concept. Thus, to 
the circular stimulus in each series, he 
learns the syllable “Fard,” to the 
stimulus in each series having exactly 
two identical components, he learns 
“Ling,” and so on. The primary meas- 
ure of the speed of concept attainment 
is the number of the series on which the 
subject begins to anticipate consistently 
the correct nonsense syllable for a given 
concept. This method is appropriate for 
the study of either concept discovery or 
concept learning. 


sim 
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The Stimulus-Sorting Method. Heid- 
breder (1949) also utilized a card-sort- 
ing procedure. In this case the stimuli 
were placed on cards, and the subject 
was asked to sort the deck of cards 
into a specified number of classes. Since 
Heidbreder was primarily interested in 
the order in which different types of 
concepts would be discovered, she per- 
mitted the subjects to classify the com- 
plete deck without prompting by the 
experimenter, The basic measure of the 
primacy of the concept was the number 
of subjects who correctly sorted each 
concept. It would also have been pos- 
sible to prompt the subject after each 
card was classified, with the number 
of deck-sortings required to classify 
correctly as the measure of the speed of 
concept attainment. 

Simple Discrimination-Learning 
Method. Long (1940) developed a 
nonverbal method for the study of con- 
cept learning in children with relatively 
low verbal facility. It is essentially 
a simultaneous discrimination-learning 
procedure that is equally appropriate 
for infrahuman subjects. Long used two 
compartments with glass windows on 
the front. In one of the compartments 
he placed a rubber ball and in the other 
a three-dimensional cube. To obtain a 
reward, the subject learned to press 
the window of the compartment con- 
taining the ball, regardless of the 
compartment it occupied. After the sub- 
ject had reached a high performance 
criterion on this problem, other stimuli 
were placed in the compartments. If the 
subject continued to respond to an 
orange, a golf ball, a circle, the more 
circular of two ovals, etc., he had 
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learned the concept of roundness. The 
necessary adaptations to extend this 
method to infrahuman subjects are 
obvious. 

A procedure resembling the succes- 
sive discrimination method has been 
used with adults by Buss (1950). Three- 
dimensional stimuli differing in height, 
width, and color were presented one at 
a time. The subject was rewarded for 
pushing one response key to the tall 
stimuli and a different one to the short 
stimuli. Since his subjects were college 
undergraduates, Buss’s study deals with 
concept discovery rather than concept 
learning. 

Interview-Questionnaire Method. 
This method is exemplified in its 
simplest form by Welch (1940), al- 
though it has been used extensively by 
Piaget (e.g, 1930). Welch’s subjects 
were asked questions that contained 
the concepts under investigation. For 
example, a child might be asked, “What 
kind of food do you like best?” If the 
child’s answer referred to a specific 
food, the answer indicated that the 
child understood, at least to some 
degree, the concept “food.” The 
method is better adapted to the study 
of the concepts available to the subject 
than it is to the teaching of new con- 
cepts. 

A related procedure was used by 
Welch and Long (1940a, 1940b). In 
these experiments the subjects were 
given instructions that contained con- 
cepts previously taught to the subjects. 
The instructions required the children 
to perform certain acts with the relevant 
stimuli. If the children followed the 
instructions, it was accepted as evidence 
of concept attainment. 
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General Considerations 

A review of the literature on arch 

methods in children’s learning indicates 


that a wide variety of techniques is 


presently available for the ly of 
simpler types of learning. Practical 
methods have been developed lor the 
study of classical and instrume con- 
ditioning, discrimination learn rote 
verbal and motor learning, and concept 
formation. Moreover, many pi lures 
utilized in the study of infrahum{an and 


adult human learning appear to be 


readily adaptable for use with © «idren. 
Inadequate apparatus and pr dures 
are no longer a serious obstacle to the 
intensive and extensive study chil- 
dren’s learning, although fut re- 
search will undoubtedly result in the 


refinement of all methods. 

Although the student of children’s 
learning may be encouraged by the 
variety of serviceable methods and 
procedures, the sparsity of application 
of these methods provides no cause for 
complacency. The systematic study of 
a given area of children’s learning is still 
rare. The majority of experiments are 
single-shot investigations rather than 
series of thorough studies of particular 
problems. It is to be hoped that the 
future will bring the systematic research 
with children that can already be ob- 
served in the study of the rat in the 
T-maze, the infrahuman primate in 
discrimination problems, the adult in 
eyelid conditioning, and the college 
sophomore in verbal learning tasks. 
There is a serious need to analyze the 
learning performance of children in 
the relatively complex rote learning and 
concept formation situations in am 
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attempt to apply the principles obtained 
from the simpler conditioning and dis- 
cri/unation-learning situations. System- 
atic investigations of children’s learning 
in these areas would do a great deal to 
en!) nce our understanding of principles 


of general psychology. 
ny of the procedures that have 


been discussed in the preceding pages 
appear to be appropriate for applied re- 
sear: as well as for the basic research 
for which they were developed. The 
recive research program of Skinner 
(1958) and his associates provides an 
excellent illustration. These investiga- 
tors are bringing laboratory methods to 
the teaching of academic content. They 
have devised apparatus—teaching ma- 
chines—that permits the administration 
of carefully prepared teaching programs 


to the learner, These machines enable 
the experimenter to control and manip- 
ulate with laboratory precision such 
psychologically potent factors as 
amount of practice, immediacy of rein- 
forcement, stimulus presentation meth- 
ods, and problem sequence, in order to 
maximize the transfer of previous learn- 
ing to new situations. This approach 
promises to place the findings of basic 
research in learning at the disposal of 
those whose task it is to cause children 
to learn. How successful this attempt 
will be depends to a very great degree 
on the availability of knowledge ob- 
tained from basic research on children’s 
learning. + 
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The Study 
of Problem 
Solving 
and 
Thinking 


Bärbel Inhelder 
Benjamin Matalon 
University of Geneva 


The child’s solution of cognitive prob- 
lems depends on the one hand on the 
nature of the particular problem and on 
the other hand on his own intellectual 
equipment. Whereas most psychological 
studies of problem solving have been 
concerned with the outcome of these 
intellectual processes, others have stud- 
ied the nature of the processes by which 
the child attempts to adapt himself to 
new situations. We have selected for re- 
view those methods designed for the 
study of the processes rather than of 
their results. 


We are grateful to Miss Eleanor Duckworth 
for the translation of this chapter and to Dr. 
G. N. Seagrim and Dr. D. E. Berlyne for 
their advisory assistance. 
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These processes can be studied fruit- 
fully by comparing them with animal 
behavior on the one hand and with 
adult thought on the other. But although 
the techniques used are closely related 
to those used in animal studies, with an 
increasing use of language as the phe- 
nomena under study approach adult 
thought, they cannot be modeled too 
closely on them. Techniques used in 
studies of animals and adults have been 
found to be fruitful to the extent that 
they have been harmonized with the 
characteristics of the infantile processes 
themselves. 

We must recognize that the meth- 
ods applied to the study of problem 
solving in the child cannot be reduced 
to the study either of external behavior 
(as in the case of animals) or of lan- 
guage taken alone (as is frequently the 
case in the study of adult thought) but 
must equally take note of intellectual 
operations. These operations are con- 
sidered to proceed from actions (the 
behavioral aspect) but to be structured 
later in systems that, though based on 
language, involve coordinations more 
general than the mere linguistic con- 
nections. 

We have classed the methods into 
three groups, according to the phenom- 
ena to be studied: 


1. The sensori-motor processes. It is 
in this domain that the methods de- 
rived from animal psychology have 
been most employed. Whether they are 
accompanied by verbal elements or 
not, the role of language is reduced to 
a minimum, and it plays only an ac- 
cessory part here. 

2. Thinking proper. The methods 
used to study the particular characteris- 
tics of children’s thinking—including 
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the discovery of a logical principle, the 
acquisition of an experimental method, 
and the interpretation of phenomena of 
the physical world—must be adapted to 
different levels of child thought if they 
are to avoid the dangers of adulto- 
morphism. 

3. Concept formation. Although it is 
artificial to separate the formation of 
concepts, which are the fundamental 
instruments of thought, from thought 
elaboration itself, it seemed to us pref- 
erable to devote a special section to it 
because of the importance of the prob- 
lem and the specificity of the methods 
employed. These methods are closely 
bound up with the way in which dif- 
ferent authors envisage the concept- 
formation processes, some attributing 
fundamental importance to the cultural 
verbal models, others to the structuring 
activity of the subject in the elaboration 
of the concepts. 


Since we have been asked to write 
from our own experience as experi- 
menters, the methods that are most fa- 
miliar to us have necessarily received 
more than their share of emphasis in the 
discussion that follows. 


Sensori-motor Processes 


LEARNING VERSUS INSIGHT 


With infants, and sometimes even 
older children, problem solving and 
learning have rarely been formally dis- 
tinguished, at least from the point of 
view of the experimental design. It is 
striking that in Carmichael’s Manual of 
Child Psychology problem solving or 
puzzle solving is discussed only by 
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Munn (1954) in the chapter devote:! to 
learning. Munn justifies this by the fol- 
lowing: 


Every learning situation presents a 
lem of one sort or another, Escaping 
maze, buttoning a coat... are problems 
when confronted initially (p. 423). 


Similarly, every problem, ev: he 
most abstract, presents an aspect of 
“learning” from perceived data, ‘vial 


and error, etc., which makes the « 
tion difficult from the outset. Sti 
can say, by and large, that one s} 

a “problem” when the situation p 


of a solution by logic or “insignt, 
whereas one would more readily class as 
“learning” an experience in which the 
correct behavior can be attained only 


gradually by successive trials. However, 
terminology is likely to vary con- 
siderably from one author to another. 
This apparent confusion between 
two things which seem at first sight 
very different, is justified by the diffi- 
culty of distinguishing them in practice; 
the distinction is often made after the 
conclusion of the experiments and con- 
cerns the behavior of the subject more 
than the problem. As we will see later, 
this is especially the case with all ex- 
periments bearing on insight. 
Moreover, with very young children 
as with animals, one can only set the 
task and convey what is expected of 
the subjects through a process of train- 
ing. Even with older subjects verbal 
elements are to be avoided, since they 
present a problem in interpretation, 
both for the subjects trying to under- 
stand the instructions and for the ex- 
perimenter attempting to interpret their 
verbal responses. These experiments are 
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thus conducted as if they were learning 
riments, in which the subject is led 
asp the problem as a result of 
his first successes or failures. The study 
of sensori-motor or practical intelli- 


gence obviously forms an excellent field 
for such methods, for at this elementary 
level sensori-motor behavior, thinking, 
and learning are closely associated. A 
great part of the research on these prob- 
lems originated in Kohler’s work on 
the intelligence of apes. After his pub- 
lication, several workers tried to find 
out whether the types of behavior he 
observed (i.e. success through trial and 
error or insight) are also found in chil- 


dren. Most of the problems presented 
to young children in this connection 
involve a detour or an intermediary 
(for the construction of tools that can 


be considered a kind of detour). 


USE OF AN INTERMEDIARY 


One kind of problem frequently 
studied involves placing within the 
child’s sight, but beyond his immediate 
reach, a lure consisting of a piece of 
candy, a toy, or some other attractive 
object. To obtain this reward, he must 
either draw the lure towards him, move 
towards it, or do away with the ob- 
stacle. 

Alpert (1928) studied very nearly 
the same situations as Köhler. In one 
of his experiments, for example, the 
reward was suspended from the ceiling 
in such a way that it could be reached 
only by moving over a nearby block of 
wood, and climbing onto it. 

Richardson (1932) placed a toy in 
the sight of the subject (28 to 52 weeks 
old) but too far away to be reached 
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directly. In front of the child were 
several strings, only one of which was 
attached to the object. The arrangement 
of these strings varied from one situa- 
tion to another: some of them ended 
near the toy without being attached to 
it; others went in different directions, 
and some of the correct strings made 
a detour just before reaching the toy. 
Like Alpert, Richardson noted the 
various attempts made by the child, any 
remarks he might make, and the time he 
took to reach the goal. He also took 
into account, as a criterion of insight, 
the facial expression and the suddenness 
of the response. No words were spoken; 
the experimenter merely attracted at- 
tention to the toy by showing or shak- 
ing it. 

The lever is another intermediary 
and requires a movement in the op- 
posite direction to the one that would 
be normal in order to bring the reward 
to the subject. This problem has been 
studied especially by Richardson (1934) 
and Moriya (1937). 

Finally, there is the detour proper, 
with locomotion towards a goal. In 
McGraw’s (1935) experiment the child 
had to step over a transparent partition 
in order to reach the object placed be- 
hind it. Lewin (1936) and Fajans 
(1933) simply used a barrier, which the 
subject had to walk around, so that for 
a moment he had to lose sight of the 
goal. Kendler and Kendler (1956) used 
a logical rather than motor intermediary 
to study what they call “inferential 
behavior.” The child was placed in 
front of a box with two apertures out 
of which came two strings, X and A. 
The experimenter had him pull one of 
these strings, X, and he was made to see 
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that by so doing he pulled out a little 
object, Y. Then by pulling A, he ob- 
tained another little object B. Finally, 
he observed in the same way that by 
pulling the object ¥ he could bring out 
of the box a little car, much more 
attractive than B or Y. Each observa- 
tion was repeated two or three times. 
In the test phase of the experiment all 
objects were put back into the box; the 
child was then asked to make the car 
come out again. The subject had to 
understand that he must first obtain 
Y and therefore must pull X. Different 
groups of subjects made the three ob- 
servations in all possible orders. Other 
control groups made some of the ob- 
servations but not all. 

Problems relating to practical in- 
telligence in older children (3 to 7) 
have been studied by Rey (1935). One 
of the problems involving the use of a 
tool and a detour is the following. The 
apparatus consists of a crate, three sides 
of which are transparent. One of these 
sides has an opening in it. On the 
bottom of the crate in the center is a 
sort of drawer, which is open on one 
side; it can pivot on an axle and thus 
take various positions. The apparatus is 
placed on a table. A 5-centimeter cube 
is placed in the crate, in the center of 
the drawer; two sticks are available, and 
the subject is to use them to take the 
cube out of the crate. The experimenter 
explains that the cube contains a piece 
of candy. He simply indicates with his 
hand the sticks that are necessary to 
obtain it. 

Bussmann (1946) examined prob- 
Jems of “transfer” from the same point 
of view. She presented to the child 
problems for which the necessary tool 
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had to be constructed by com! bining 
certain elements provided by ex- 


perimenter. When the subject had suc- 
ceeded, spontaneously or with hints, a 
similar problem was set him, but in 
which the dimensions of the material 
made it necessary to combine greater 
numbers of elements, For example, a 


small token with a loop had be 
taken out of a high narrow rece 
without turning the receptacl 
The child was given hooks, ¢? of 


which was too short to reach the | .op. 
The only solution was to fit t wo 
hooks together. In the transfe tua- 
tion the jar was deeper and five “voks 
were needed to reach the token. Finally, 
Bussman studied the generalization of 
the solution to a slightly differen: situ- 
ation: this time, instead of hooks, small 


sticks had to be fitted together to form 
a long rod. 

Most of these experiments and others 
following the same pattern were mod- 
eled exactly on animal psychology. The 
role of language was reduced to a mini- 
mum, and the method of analysis was 
the same, since the aim was to compare 
the behavior of children and animals. 
The experimenters tried to see if they 
could find in the child the behavior that 
was recorded in animal experiments, 
such as trial and error or insight. Hence 
the experimental settings were as similar 
as possible, with the important excep- 
tion of the motivational factors. 

It is clear that an animal must be 
hungry at the time of the experiment, if 
food is the reward, whereas a child can 
understand without an immediate need 
the few words of the experimenter, 
which explain what is to be done. The 
attractiveness of the reward maintains 
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arest but does not serve as a “re- 
as it does for the animal. 


TEMPORAL RELATIONS 
Another kind of problem requires 
the subject to introduce temporal ele- 


men 
complete internalization. 

is the case, for instance in the 
dowble-alternation problem, in which 
the subject has to give one of the possi- 
ble responses twice in succession, then 
the orher twice in succession, and so on. 


s into his responses and thus a more 


The problem demands a more thor- 
oughgoing internalization, since the 
subject has to behave differently in the 
same situation, according to the tempo- 


sition of the trial. 

periments on double alternation 
fall into two categories. First, the tem- 
poral maze consists of a chamber di- 
vided into two parts, each closed by a 
door that can be controlled silently by 
the experimenter. The right-hand door 
is opened for two consecutive trials, 
then the left-hand door twice in succes- 
sion, and so on. This device, first used 
by Hunter (1913), has been widely used 
with animals, especially rats and mon- 
keys. 

Gellermann (1931) took up the same 
device with human beings, both chil- 
dren and adults. Each trial consisted of 
eight successive responses, separated by 
intervals of approximately fifteen sec- 
onds. In the discussion of the results, 
particularly in comparing animals and 
human beings, Gellermann emphasizes 
the fact that differences in experimental 
procedures for humans and animals are 
related to differences in motivation. 
Whereas the animals were strongly mo- 
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tivated (by hunger or fear of electric 
shock), human subjects did not antici- 
pate any reward and were merely stim- 
ulated, if necessary, by the experi- 
menter’s instruction to “keep going.” 
Because of this fundamental difference 
in motivation, the number of trials per 
session was varied: with human beings, 
it was possible to continue training until 
the task was learned during the course 
of a single session; with animals, their 
motivation being short-lived, it was 
necessary to employ several short ses- 
sions. 

The temporal maze takes up consid- 
erable space. Gellermann (1931), with 
monkeys, and Hunter and Bartlett 
(1948), with children, adopted a sec- 
ond, less cumbersome method. The sub- 
ject is placed in front of two boxes. 
One of them contains a reward, namely, 
the right-hand one twice running and 
then the left-hand one twice running. 
These boxes are placed in front of a 
screen that hides the experimenter and 
his movements. First the experimenter 
lets the subject familiarize himself with 
the apparatus by manipulating it freely, 
opening the boxes, etc. He then tells 
him that he is going to put a piece of 
candy in one of the boxes. The subject 
opens one of them, and if the candy is 
not in it he looks in the other. 

The authors, in conducting experi- 
ments of this kind, noted that it is im- 

ortant that the subject see as little as 

ossible of the experimenter, since he 
will tend to be distracted by searching 
for irrelevant cues other than the suc- 
cessive positions of the reward. 

Another means of studying temporal 
relations is by delayed responses. From 
among several alternatives, the right 
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choice is indicated by a signal that dis- 
appears a certain time before the subject 
is allowed to react. As in the case of 
double alternation, the delayed response 
situation has been widely used in com- 
parative psychology, since it can be 
applied with rats, monkeys, etc., as 
well as children. The principal variables 
studied have been the interval between 
the disappearance of the signal and the 
response, the number of possibilities, 
and the nature of the different possible 
cues that the subject has to discriminate. 

In the first research of this type 
(Hunter, 1913) the subject had to 
choose one of three boxes, the correct 
one being indicated by a lamp, which 
was lighted a certain time before the 
subject had to make his choice. Later, 
the same author simply hid a toy in 
one of the boxes within view of the 
child. Miller (1934) considered the 
problem of the cues discriminated by 
the child, and he made use of the de- 
layed reaction as a means of isolating 
certain factors, such as position and 
color. The experimenter put a toy in 
one of the boxes and hid them behind 
a screen, which he withdrew after a 
fixed time to allow the child to make 
his choice. The experiment started with 
a series of trials intended to explain the 
situation to the child and during which 
the toy remained in the box in which 
the experimenter had placed it. Then, 
in the crucial trials, the boxes were 
changed, and each one now contained 
a toy. The cue that determined the 
choice (color or position) was noted. 
It is clear that this method goes beyond 
the simple study of memory, which is 
all it may seem to involve at first glance, 
and that it constitutes an instrument of 
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wider scope applicable to the study of 
the more general problem of dis rimi- 
nation. 


SENSORI-MOTOR INTELLIGENCE 


Ever since the work of Preyer (1882) 
on the beginning of mental activit in 
the child, the psychology of early in- 
fancy has constantly been hed 
by collections of observations, wee 
methods have been used: 

1. The monograph, which co ains 
information concerning the outstanding 
facts about one child. 

2. The inventory or atlas, which re- 
cords the presence of a behavior pat- 
tern at a given stage of the develop- 


ment in a sampling of children. 

3. The longitudinal study, which ob- 
serves the same child at regular inter- 
vals in characteristic situations whose 
symptomatic value has previously been 
shown by the first two methods, espe- 
cially the second. 


Although these methods have greatly 
contributed to our knowledge of the 
development of motor and emotional 
functions or of the over-all course of 
mental development, they give us little 
information about the formation of 
cognitive functions or, in particular, 
about the acts of intelligence involved 
in problem solving. 

It is true that several scales of mental 
development include problem-solving 
items for the preverbal period. The 
“baby tests” of Bithler-Hetzer (1932/ 
1935), among others, make use of tests 
such as K. Bühler’s (1918) string prob- 
lem. This test consists of placing a toy 
with a string outside the child’s immedi- 
ate reach but in such a way that the end 
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of the string is accessible to him. It has 
been shown that at first infants do not 
know what to do with the string, 
whereas 75 per cent of babies 11 months 
or older are able to make use of it to 


obtain the toy. These methods give 
us information only about the presence 
or absence of problem-solving behavior, 


not about its formation. The important 
question that arises is this: did this be- 
havior appear abruptly without ante- 
cedent, or did it develop progressively 
out of a more elementary pattern of 
behavior—grasping an object, for in- 
s 


This “genetic” approach to problem 
solving is characteristic of Piaget’s 
investigations and of his method, 
which we will call “developmen- 
tal.” In fact, Piaget has studied the 
question—whose bearing is as much 
epistemological as psychological—of 
the formation of one of the first in- 
variants in the child’s knowledge of the 
external world: the schema of the per- 
manent object. In his two monographs 
(1936/1953, 1937/1955) devoted to the 
development of the first acts of intelli- 
gence in his three children he made use 
of a mixture of observation and experi- 
ment, a sort of functional exploration 
in which experiments are suggested by 
previous observations. 

For instance, Piaget was intrigued by 
a curious observation: a 13-month-old 
child was playing with a ball, which 
first rolled under an armchair where 
he could easily find it, then under a 
couch where he could not reach it. 
After a short unsuccessful search under 
the couch, the child went back to look 
for his ball under the armchair. This 
observation led Piaget to create experi- 
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mental situations in which he could 
observe systematically the behavior of 
children towards objects disappearing 
from the visual field or moving about 
within it: 

1. An object of interest (in this case 
a watch) is taken away from the child 
while he is handling it and is placed 
within his sight, a slight distance away. 

2. The object is hidden under a pil- 
low. 

3. It is hidden under pillow A, to the 
left of the child, then brought out again 
within his sight and hidden under pil- 
low B, to his right. 

4. The same procedure as in 3, but 
with the difference that when it is 
moved from A to B the object is 
wrapped in a cloth so that the displace- 
ment is visible but not the object itself. 


Between 6 and 18 months, the three 
children who were observed succeeded 
in solving progressively, in order of in- 
creasing complexity, all the items of the 
experiment, although they had failed 
them all at first. Their behavior toward 
the missing object changed completely. 
Although they were already capable of 
coordinating the movements of their 
eyes and hands to grasp objects, never- 
theless they made no effort at first to 
find an object when it disappeared from 
their visual field, and behaved as if it 
were no longer available. A few months 
later, they were able to locate an object 
whose position had been changed in any 
of the ways described above. 

This experiment was followed by 
one involving the rotation of an object: 
at feeding time, the experimenter turned 
the baby’s bottle 180 degrees and pre- 
sented it the wrong way round, If the 
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by so-called schemata, which, for Pis- 
get, constitute the unities of behavior. 
A schemi is a mode of action elaborated 


by the subject, a mode of action capable 
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of conservation, of transforma by 
generalization, and of coordinates with 
other schemata. The functions! 

of these schemata appear as “patio ı 
behavior,” From this perspecti»: 


possible to nore a continuous =: c of 
schemata, from the grasping re. x to 
the recognition of permanent ` \cets 
and their changes of position it ace. 
This series, continuous from the point 
of view of functional activity ~ hich 
does not necessarily imply coounaity 
in the underlying neuromusculı: natus 
ration), culminates in a profoun lew 
formation of behavior, Wheres: the 
beginning of cognitive develops! ob- 
jects have no meaning excep: o the 
functi of the infants own») ee 


tive activity (ega being touched, fed, 
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did not fit into a whole network of cog 
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motor processes and the higher proc- 
esses involved in “thinking” is, of 
course, difficult to maintain in all cases, 
especially since many of the authors to 
whom we have referred do not observe 
it. In certain cases the distinction can- 
not but be arbitrary. For example, 
among the studies of sensori-motor 
processes we mentioned experiments on 
double-alternation, keeping in mind the 
fact that research workers who made 
use of this technique were primarily 
interested in relations of elementary 
temporal order. Recently, Matalon 
(1959) made a further study of the 
double-alternation problem by modify- 
ing the usual method of presentation in 
order to reveal the activity of higher 
thought processes. The main modifica- 
tion was in the method by which sub- 
jects were informed of the correctness 
or incorrectness of their responses at 
each trial. One group of 8-year-old 
children only saw whether the side 
chosen was correct or incorrect. For 
the second group, when they made an 
incorrect choice they saw the reward 
on the opposite side. In addition, the 
trials were given in a continuous se- 
quence until the subject gave 12 succes- 
sive correct responses and not, as was 
the case in Gellermann’s study (1931), 
in isolated blocks of trials. 

The group which, after an incorrect 
choice, had to infer what the correct 
choice would have been took many 
more trials to find the principle, thus 
showing that even in such a simple 
problem inferential activity seems to 
playa role. 

The processes involved are probably 
similar to those necessary for the dis- 
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covery of more complex principles, 
which we shall now consider. 


THE DISCOVERY OF A PRINCIPLE 

The general scheme is the following: 
for each trial of the experiment the 
subject must choose between two, or 
sometimes several, elements shown to 
him. During his successive trials, he is 
able, by noting his successes and fail- 
ures, to discover the rule that would 
enable him to succeed on every trial. 

Heidbreder (1927, 1928) presented 
her subjects with two boxes, in one of 


which was a doll. The experiment con- 
sisted of three parts, each one inv: lving 
a different principle. In the first the doll 
was always in the box on the right, 
whether near or far. In the second the 
correct box bore the picture of a flower, 
whereas the other one bore only a sim- 
ple figure. The third principle was more 
complicated: some of the boxes had 
labels with red dots, others did not. If 
the boxes had dots on them, the nearer 
one was correct. The instruction was 
simply to find the doll; there was thus 
no mention of a principle that might 
lead to a solution. In order to obtain in- 
formation on the mechanisms brought 
into action, the experimenter asked the 
reason for each choice; after a wrong 
choice, the subject was allowed to open 
the other box to verify that the doll was 
there. Once again, as in the sensori- 
motor problems, it is evident that the 
behavior in such an experiment can be 
regarded as a learning process. More- 
over, the technique is similar to the one 
used in the study of discrimination 
learning in animals. 


In this experiment, the principle to 
be discovered was quite arbitrary. But 
experiments deal with empirical 
or logical laws. A method similar to 
Heidbreder’s has been used by Wohl- 
will (1959) in a study of the role of 
perceptual factors in the formation of 
nur -, The subject was placed before 


other 


a screen with three windows, which he 
could open and on which the experi- 
menver could hang cards bearing a cer- 
tain number (4, 5, 6) of points, regu- 


larly arranged. It was explained to the 
subject that a token was hidden behind 
one of these windows and that on each 
tric would be given a card indicat- 
ing which window it was behind. Each 
sample card bore a number of dots, 
corresponding to the number of dots 
on one of the windows but arranged 
differently. The token was behind the 
window bearing the same number of 
dots as the card given to the subject. 
In another part of the test the sample 
card was replaced by small buttons. 
They were at first arranged in the 
same way as the dots on one of the 
windows; then the experimenter dis- 
ordered them and removed or added 
one within sight of the child. The cor- 
rect window was indicated by the final 
number of buttons. The subject’s task 
was to find the token, that is, to open 
the window that bore the number of 
dots equal to that on the model. 
Petersen’s experiment with a scale 
balance (1932) is one example of a situ- 
ation in which the principle to be dis- 
covered is an empirical law. After indi- 
cating to the subject that all the prob- 
lems he was going to set him involved 
the same general principle, the experi- 
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menter presented one of them, gave the 
solution, and showed experimentally 
that it was correct. Then he set another 
which the subject had to solve by him- 
self and continued to alternate in this 
way a problem whose solution was 
given by the experimenter with one that 
the subject had to solve. At the conclu- 
sion of the experiment, the subject was 
asked to state the principle. A correla- 
tion was computed between the number 
of correct responses given by a subject 
and his IQ. 

The same problem of balancing a pair 
of scales was studied in a quite different 
way by Inhelder and Piaget (1955/ 
1958). The child was placed in front 
of a scale balance and a series of weights 
which he could manipulate freely. He 
was asked to balance the scales while the 
experimenter interviewed him to find 
out how he went about it and noted 
how his trials proceeded. The experi- 
menter did not intervene actively, 
except to motivate the subject or to 
suggest a new situation when the sub- 
ject had achieved equilibrium in one 
situation without seeking to go any 
further. In analyzing the results, the 
emphasis was placed above all on the 
method of seeking the law, the system- 
atic nature of the trials, and the tech- 
nique of proof rather than on the re- 
sults actually obtained in each single 
case. 


VERBAL PROBLEMS 


With this last example, we come to 
a type of research that tries to grasp 
the subject’s thought through his verbal 
behavior; in the preceding examples 
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verbal intervention was avoided as 
much as possible on the part of the 
experimenter as well as on the part of 
the subject. 

The more developed the processes 
under study, the more difficult it is not 
to call upon spoken language. In fact, 
it is difficult to grasp logical reasoning 
in any other way, since formalized 
logic is more a matter of discourse than 
one of action. 

Wertheimer (1945) made an impor- 
tant contribution without using a real 
experimental method. He departs from 
simple observation only insofar as it is 
necessary for the posing of the problem 
(such as the celebrated one of calculat- 
ing the area of a parallelogram) and 
then, without intervening, observes the 
successive attempts at its solution. 

Apart from these few studies, little 
work has been done in the field of logi- 
cal thinking in the child, and what has 
been done has not involved strict ex- 
perimental methods. On the other hand, 
there is abundant data on the behavior 
of children of different ages confronted 
with tests of logical ability. Good ex- 
amples of such tests are to be found in 
Burt (1919-1920). It goes beyond the 
limits of this chapter to study these 
numerous school tests. Although they 
are useful for diagnosis, they furnish 
little information about the mechanisms 
that are brought into play or about the 
methods used in the solutions. 

This very problem already had occu- 
pied the school of Graz, Duncker, and 
Claparède (1934). They tried to deal 
with it by asking the subject to report 
verbally on his attempts and reflections. 
Apart from the theoretical criticisms to 
which this technique gives rise (lack of 
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objectivity and accuracy in the reports, 
the introspective aspect, etc.), it is obvi- 


ously next to impossible to apply it to 
children. John and Miller (1957) tried 
to fulfill the requirements of objectivity 
by using an electronic apparatus that 
records the subject's different attempts. 
Even though they used it only with 
adults and the problems set were too 


complicated for children, it would seem 
possible to extend its application with- 
out difficulty to the logic of th hild. 
We shall return later to a discussion of 
the way in which the authors a lyzed 
the data they gathered with this nx thod. 


THOUGHT CONTENTS 
OF THE CHILD 


The long-standing interest in the be- 
liefs and explanations of children had 
received renewed impetus from the 
field of anthropology in the study of 
the so-called primitive mentality of cer- 
tain societies, characterized by collec- 
tive animism, realism, and magic. In 
addition, the exploration of the un- 
conscious in civilized adults had re- 
vealed a system of projective mecha- 
nisms, which foster, among other things, 
a belief in causality through participa- 
tion. The question then arose whether 
such beliefs and explanations, which are 
also found in children, were due to $0- 
cial and emotional factors alone of 
whether cognitive factors were in- 
volved as well, the conceptions in this 
case being the authentic expression of 
a general, though transitory, stage in 
human thinking. 

This was the state of the problem 
when Piaget (1926/1929, 1927/1939) 
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und. rtook to study systematically the validity of its results. Perhaps it is 
development of the different ways in worthwhile to specify the character- 
Ww ı children represent and explain, istics of the “clinical” method used by 
in tions not involving conflict, a Piaget in exploring the beliefs of chil- 
number of everyday phenomena. dren. However, let us point out that 
investigations dealt with the the procedures used in his more recent 
worl ings of simple machines, such as (and more analytical) studies bear a 
th ycle, as well as with more com- closer resemblance to experimental 
p! tions derived from physical and methods. 
ps logical experience, such as the It is important to keep in mind the 
or of names and their relation to aims of the method: (a) to establish 
th jects they name, the movements an inventory of children’s beliefs and 
of «crs and clouds, and the attribution explanations; (b) to evaluate their au- 
of ciousness and life to living beings thenticity; (c) to distinguish the trends 
or objects. It is of much less im- followed by changes in children’s con- 
portance to know what significance the ceptions of the world in the course of 
chili attributes to one specific event their development. 
than to know the common character- The plan for each of the investiga- 
istics of his conceptions of the world. tions originated in spontaneous remarks 


The psychological study of the devel- 
opment of children’s beliefs has an 
epistemological bearing; it sheds light 
on the relations between the subject 
and the object. Thus the exploration 
of the successive types of explanation 
offered by children brought out new 
data to support the hypothesis that the 
traditional distinction between the two 
poles of knowledge—the subject with 
his thoughts, desires, and intentions and 
the object obeying physical laws—far 
from being primitive, is, on the con- 
trary, the result of a long process of 
development, based on experimentation 
and socialization. In elementary forms 
of thought the two appear undifferen- 
tiated, thus giving rise to confusions 
comparable to the adualism of the ego 
and the nonego postulated by Baldwin. 

This research, which was carried out 
quite some time ago, inspired a number 
of similar studies and raised a series of 
criticisms regarding the method and the 


of children. In the case of the relation 
between a name and its object, for in- 
stance, the point of departure was a 
conversation between two 6.6-year-olds 
as they were playing at building, 
conversation that seemed to reveal a 
tendency toward nominal realism. 


X: And when there weren’t any names... 

Y: If there weren’t any names you’d be in 
trouble. You couldn’t make anything. 
How would you make things (if there 
weren’t any names)? 


Certain authors, for example, Isaacs 
(1930), have confined themselves to 
pure observation. Certainly, this method 
has the advantage that the reflections 
remain in their natural context, and it 
avoids the risk of influencing them un- 
naturally by questions that do not al- 
ways correspond to the thoughts with 
which the child is concerned at the mo- 
ment. However, pure observation does 
not enable us to form an exhaustive pic- 
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ture of all the possible ways of solving 
a particular problem. Children do not 
generally express their beliefs because 
they think that everyone believes as 
they do, because they are afraid of mak- 
ing mistakes or, finally, because the 
ideas are not sufficiently systematized 
to be formulated. 

The clinical approach seeks not only 
to record beliefs that are already for- 
mulated but also to make explicit those 
that direct the child’s reasoning more or 
less implicitly. It provokes reflection 
with a set of questions that complete 
and supplement one another. The inter- 
rogation on nominal realism, for in- 
stance, is built around the following 
questions: 


1. What is a name? 

2. How did names start? 

3. How did we know the sun’s 
name? 

4. Where is the sun’s name? 

5. Do things know their names? 

6. Did the sun always have its name, 
or was it there without a name at first 
and got its name afterward? 

7. Why do we call the sun “the sun”? 

8. Your name is X and your friend’s 
name is Y. Could we have called the 
sun “moon” and the moon “sun”? 


Several experimenters took up Pia- 
get’s research into the mentality of the 
child, particularly in reference to ani- 
mism and magic, but using a set of 
standard questions based on Piaget’s, 
and without permitting themselves to 
follow up the answers in an exploratory 
fashion. This procedure makes it easier 
for the results to be scaled and for 
several judges to evaluate them; it also 
neutralizes the individual differences be- 
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tween interviewers. However, 21: iough 
it introduced important controls in 


areas that had already been lored, 
this technique proved to be in juate 
for exploring new ground wit! nany 
unknowns as the domain of c\\ \ren’s 
beliefs. In the rigid application pre- 
established schema, we run t sk of 
imposing on the child an o ation 
that is not his own. Moreover thout 


a knowledge of their context raw 


results are often unintelligible, there 
is no way of estimating the a nt of 
confabulation, a frequent fact t that 
age. 

Piaget, aware of the disadvar' ges of 
pure observation on the one l and 
standardized tests on the other, «dopted 
an intermediate method inspired by 


the one used by psychiatrists. 1t con- 
sists in conducting a free conversation 
with the child, and directing it toward 
the nonexplicit regions of his thought. 
By exploring the context of each of the 
child’s reflections, and asking control 
questions, the interviewer tries to verify 
their authenticity. If a child believes 
that the names of things are part of the 
things themselves and that, for example, 
it is sufficient to stare hard at the moon 
in order to find its name, one can only 
ask how is it that a child speaking an- 
other language would use another word. 
The art of the “clinical” or, better, 
“critical” method consists in letting the 
child talk and at the same time check- 
ing the working hypotheses during the 
discussion itself and not only afterward. 
Clearly, the successful use of this deli- 
cate method requires both aptitude on 
the part of the diagnostician and a long 
period of training. 

In Piaget’s exploratory researches the 
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results are evaluated qualitatively: first, 
they are classified according to the type 
of belief they reveal. Here, for exam- 
ple, are a few typical categories of re- 


that show the progressive elimi- 
ı of nominal realism: 


plies 


Names have always been a part’ of 
thi things know their names. 
Names have always been a part 
ings; things do not know their 
lames were given to things by 
me»; it is sufficient to look at things 


to w their names. It is not possible 
to nge the names of things. 

t is not sufficient to look at things 
to know their names; one invents or 
learns names. One can change them if 
everyone agrees, for example, when one 
goes to another country. 


Then these ideas are ordered as they 
gradually appear and disappear with 
age. Finding this order constant for 
each of the beliefs studied, Piaget 

was able to establish “stages” in the 
conception of the world end of cau- 
sality. The term “stage” has often 
aroused dispute, since it suggests the 
erroneous idea of discontinuity in de- 
velopment. In actual fact the transi- 
tions between the successive types of 
conceptions of the world are gradual 
and continuous and reveal a trend in 
thought from nondifferentiation to dif- 
ferentiation of subject and object. 
(Names that are first located in things 
and are part of their essence are gradu- 
ally understood to be of human inven- 
tion and arbitrarily attached.) 

Of the many attempts at standardiza- 
tion, we may mention the ones that 
Russell and Dennis (1939) devoted to 
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animism. The technique of interroga- 
tion with alternative questions mentions 
20 standard objects. Three independent 
judges, classifying the results, were in 
agreement 87 per cent of the time, and 
the test-retest correlation was 0.81. The 
authors were also concerned with sam- 
pling, a factor neglected in Piaget’s 
original work, which centered on gen- 
eral developmental mechanisms rather 
than on diversifying environmental fac- 
tors. Russell (19402) was able to show 
the generality of primitive forms of ani- 
mism and their gradual decline, regard- 
less of sex and place of residence 
(urban, suburban, or rural). Russell, 
Dennis, and Ash (1940b) found the 
same beliefs in feeble-minded children, 
and these results were confirmed by 
Werner and Carrison (1944) in their 
study on brain-injured children; Dennis 
and Mallinger (1949) found a greater 
percentage of these “infantile” beliefs 
in a group of senile adults, as compared 
with other adults and older children, 
Carrying out this research on Zuni and 
Hopi children (Dennis, 1943) they ob- 
tained results similar to those obtained 
in Switzerland. Their work, which thus 
confirms and extends the original in- 
vestigations, nevertheless puts us on 
guard against hasty generalization about 
the age limits of the stages. The ages 
seem the more likely to fluctuate the 
more the tests involve verbal-thought 
content because it is here rather than 
in the reasoning structures themselves 
that the forms of adult thought are re- 
flected. Moreover, the latter, far from 
being a homogeneous whole, always 
contains traces of the thought of the 
child. 

Interesting studies such as those of 
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Huang and Lee (1943, 1945) on Chinese 
children and of Mead (1932) on the 
young Manus indicate, on the other 
hand, a low degree of animism. Since, 
however, their investigatory procedures 
differed from those of the other writers, 
it is somewhat difficult to decide how 
much of the difference is due to the 
method and how much to cultural in- 
fluence. We can see how valuable it 
would be to pursue comparative re- 
search into the structures as well as the 
content of children’s thought by using 
methods that are comparable but ad- 
justed to different cultural contexts. 


CONCLUSIONS 


Under the rather vague term of 
“thinking,” we have grouped a series 
of studies with two goals: the first is 
to determine the mechanisms of the 
child’s thinking, the intellectual in- 
struments he possesses, insofar as they 
can be distinguished (at least in analy- 
sis) from an immediate action, in con- 
trast to sensori-motor processes. Such 
a study might be limited to observing 
the differences of behavior or efficiency 
in children of different ages faced with 
a given task; or it might center on the 
developmental process itself, in an at- 
tempt to understand how the higher 
levels of behavior are constructed from 
the lower ones, by studying their filia- 
tion. These two approaches assume dif- 
ferent theoretical backgrounds and, of 
course, different experimental methods. 
If the difference between the thought 
of two children of different ages is con- 
sidered solely as quantitative and of the 
same nature as the difference between 
two subjects of the same age, the ex- 
perimenter will devote his interest 
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above all to performances; the test will 
have to be strictly standardize] and 
presented as far as possible in the same 
way at all ages. On the other hand, if 
the center of interest lies in the evolu- 
tion of thinking and the differences be- 
tween successive stages of development, 
the method must be more supple, 
capable of distinguishing qu: 'itative 
changes from one stage to the g 
The second goal would be a iy of 
the contents of the child’s thinking: his 
beliefs, his interpretations of rt he 
observes. One important varia!) to be 
taken into consideration is } ocio- 
cultural environment and par’ ularly 
the current beliefs. The meer in 
which the child assimilates these beliefs 
to his own cognitive structure could 
then be the object of a specific study. 


Concept Formation 


The study of concept formation and 
of the acquisition of word meanings 
embraces a vast field that can be ana- 
lyzed from two points of view. On the 
one hand, the development of concepts 
is closely tied from the start both to 
the acquisition of the symbolic function 
(which consists in relating significants 
to signified and which appears equally 
in play, in drawing, etc.) and to the 
development of the child’s judgments. 
On the other hand, since concept for- 
mation is a product of both social and 
intellectual factors, we must study the 
respective roles of language (which is 
acquired by social transmission) and of 
the elaboration of mental operations 
(which are derived from sensori-motor 
actions). 


THE 
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study the significance that the 
yo child attributes to the first words 
vocabulary, the method of inte- 


gra servation is recommended, It 
ce in recording the spontaneous 
ve lary in its natural context and 
re it to the child’s activities as a 
wi 


‘s to the systematic findings of 
G W. Stern (1927) and to the 
judic ous observations of Bühler (1918) 
get (1945/1951), we have re- 
lial lata on the (developmental) 


and 


fili; characterizing the formation of 
concepts. The first verbal expressions 
such as “Mama” are still closely tied 


to desires and actions and cannot be 
understood independently of them, The 
naming function, which generally ap- 
pears during the second year, is con- 
temporaneous with the beginnings of 
symbolic activity, In fact, it is precisely 
when the child attributes various sym- 
bolic significances, first to his own ges- 
tures and then to any object whatever, 
that one can observe his increasing in- 
terest in the names of things. However, 
these first names are not yet concepts in 
the sense of logical classes. They are 
preconcepts, whose significance is rather 
clastic, like the symbols that children 
use in imaginative play. The onomat- 
Opoeia “wow-wow,” for instance, can 
refer sometimes to one dog in particu- 
lar or sometimes to any number of dif- 
ferent objects. What is missing in these 
Preconcepts is the comprehension of 
the genus-species relationship and the 
differentiation of the logical quantifiers 
“one,” “some,” “all.” One child of 2.6 
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years, out for a walk, came across two 
different snails, and each time her com- 
ment was “There is the snail.” Even 
when she went back to see the first, 
she could not decide if they were two 
members of the class snail or the same 
member. In another case the reaction 
of a 3.6-year-old shows an astonished 
interest in the significance of the “all” 
concept. She had just been told that all 
birds lay eggs, and for every bird she 
saw she asked, “That one too? That 
one too?” A child of 6.7 years, sorting 
different species of mushrooms, asked, 
“Is ‘mushrooms’ the name of all of 
them?”, thus showing the beginnings of 
an understanding of genus-species re- 
lationships. 

Questions like “Bob is a boy; are 
boys people?” appear about the same 
time. One 5-year-old established spon- 
taneously the distinction between the 
concepts “all” and “some” in the fol- 
lowing remark, “I’m not going to give 
you all my flowers, just some of them”; 
but the distinction is full of difficulties 
when it is presented in connection with 
a system of class-inclusion, as the fol- 
lowing test shows (Inhelder and Piaget, 
1959): The child is asked: “A boy gave 
me all his roses. I gave the bunch back 
to him and then I asked him, ‘Give me 
some of your flowers.’ He gave me the 
same bunch again. Was that right?” 
The solution of the problem assumes 
the understanding that all of a form 
some of b if a is included in b. 

These few examples show that even 
if it is true that only spontaneous re- 
marks can reveal the appearance and 
the significance of concepts, which 
change according to the context, none- 
theless a systematic interrogation is 
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necessary each time we want to deter- 
mine when and how the concepts be- 
come stabilized as veritable instruments 
of logical thought. 

After having noted the place of ob- 
servation in the study of spontaneous 
vocabulary, let us refer once again to 
methods designed to analyze the child’s 
understanding of adult language. There 
are two procedures capable of shedding 
light on this understanding, which be- 
comes more and more adequate as the 
child grows older. One involves defini- 
tion of words, and the other involves 
inference from cues given by the verbal 
context of the sentence. 

The definition tests introduced by 
Binet (1903, 1908) in his mental devel- 
opment scales are presented in the fol- 
lowing way: “You have seen a chair; 
tell me what a chair is.” They have 
revealed that young children tend to 
define by use (“it is for . . .”) before 
they can construct conceptual defini- 
tions, Since conceptual definitions are 
often formally taught, it is sometimes 
very difficult to dissociate the part 
played by schooling from the need the 
children themselves feel to define in 
terms of concepts. By means of a more 
clinical analysis, Piaget (1924/1926, 
1951) was able to reveal some typical 
difficulties encountered by young chil- 
dren, not only in the matter of defini- 
tion, but also in understanding concepts 
expressing reciprocal relations such as 
“brother” or “foreigner.” The intellec- 
tual egocentrism of young children, 
sometimes reinforced by emotional fac- 
tors, creates an obstacle to the compre- 
hension that they can themselves be 
brothers or foreigners from the point 
of view of other people. 


THE STUDY OF COGNITIVE CESSES 
Werner and Kaplan (19 con- 
ceived and judiciously analy a test 
for children of 8.6 to 13.6 year, a test 
whose goal is to study experi ventally 
the processes underlying the ac: iisition 
of word meaning through ver)! con- 
texts. An artificial word, jeve,” 
whose meaning is tọ be iessed 
(“wither” or “fade”), is use’ in six 
different sentences. 

1. The dinner was good, bi e fruit 
was soldeve. 

2. When we were driving in the 
evening, we did not feel safe bec things 
on the road seemed to soldeve 

3. The older you get, the sooner you 
will begin to soldeve. 

4. People like a blossoming plant better 
than one that is soldeve. 

5. Putting the dress on the sunny lawn 
made the color of the cloth soldeve. 

6. Because the windshield was frozen, 


things looked soldeve. 


The whole experiment is composed 
of 12 series of sentences, and the mean- 
ings of the words range from concrete 
to abstract. The authors warn against 
an interpretation of the levels of con- 
ceptualization as static; they are, on the 
contrary, dynamic and fluctuating. A 
level of abstract conceptualization is 
attained only insofar as the underlying 
intellectual processes themselves have 
acquired a stability that resists the 
natural tendency to regress toward 
levels of concrete symbolization when- 
ever the task becomes more difficult. 

Remarks of the same order could be 
made about the much-discussed tests of 
syllogisms (Burt, 1919-1920, 1923) and 
absurd sentences (Binet and Simon, 
1908). The following is an example 
from Burt: 


THE 


re the climate is hot, aloes and 
will grow; heather and grass will 
only where it is cold. Heather and 
e require plenty of moisture; grass 
loes will grow only in fairly dry 
Near the river Amazon it is 
and very damp. Which of the 
yws there? 


ese tests, although destined to ex- 

the capacity of children and 
adolescents to reason from hypotheses, 
are used in their classical form, without 
sufficient analysis, and they run the risk 
of measuring not so much the level of 
-ptualization as the degree of fa- 
y that a child has with a certain 
kind of new problem. 

Word associations have almost never 
been used as a means of investigating 
the mechanisms of children’s thinking, 
except for a few studies done some 
time ago by authors who accepted 
without question the pure associationist 
theory. 

Certain authors undertook an empiri- 
cal study of children’s associations with 
the idea of creating an instrument of 
intellectual diagnosis. 

Kent and Rosanoff’s (1910) impor- 
tant statistical document served as the 
point of departure for a series of re- 
searches into variations in the fre- 
quency of certain types of association 
or of certain responses (for given 
stimuli) in children and for compari- 
sons of these frequency distributions 
between children and adults (Wood- 
row and Lowell, 1916). 

However, for the study of the devel- 
opment of thinking, the value of these 
findings is very limited. The classifica- 
tions on which they are based are al- 
ways the reflection of an implicit hy- 
pothesis as to the role that associations 
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play in thinking, unless they are derived 
from external distinctions such as gram- 
matical word categories; moreover, the 
statistics concerning variations in fre- 
quency (common or original responses) 
are strongly influenced by the choice 
of stimuli. 

One of the very first experimental 
studies of word associations in children 
is perhaps the most interesting from the 
point of view of the theory of think- 
ing. Ziehen (1899/1900) limited his 
study to the notions brought into play 
by word associations, particularly the 
ones evoked by the stimuli; he distin- 
guishes two types of notion: (4) those 
that reflect individual, concrete con- 
tents and (b) those that reflect judg- 
ments. He observed that there is a clear 
predominance of the first type in chil- 
dren, whereas the second appears only 
toward adulthood. The author considers 
that all the peculiarities of children’s as- 
sociations are due to their responses in 
terms of individual contents. 

In a recent study Morf (1954) tries 
to relate adult associations to reactions 
found in very young children (4 to 5 
years old), the latter being character- 
ized by the fact that their explicit, and 
virtually their only, function is the in- 
terpretation of the stimulus. In the first 
place, he shows that the meanings at- 
tributed to the stimulus, and involved 
in the association itself, develop accord- 
ing to the developmental level toward a 
progressive conceptualization, and the 
numerous adult associations, which are 
to all appearances “associations of ideas” 
in the classical sense, are in reality con- 
cealed interpretative reactions, In the 
second place, he analyzes the nature of 
the associative connections, relating 
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them to the level of interpretation of 
the stimuli, and concludes that, in gen- 
eral, these connections are experimental 
artifacts and not the product of pre- 
existing associations, as claimed by the 
experimenters of the associationist 
school and by those who used associa- 
tions, such as Jung (1915), in psycho- 
analytic investigation. 

We have seen the diversity of meth- 
ods, based on an analysis of the child’s 
language, that are applied to the study 
of concept formation. But if these 
methods throw light on one particular 
aspect of concept formation, one must 
nevertheless recognize that concept for- 
mation goes beyond the progressive ac- 
quisition of the mother tongue or of the 
correct use of an already formed lan- 
guage. The formation of concepts in- 
volves in addition the relating of names 
to things and, more particularly, a logi- 
cal activity that appears in the guise of 
the process of classification. 


LANGUAGE IN ASSOCIATION 
WITH CONCRETE CONTENTS 


One group of experimental studies on 
concept formation has its basis in the 
method, made famous by Hull (1920), 
of associating verbal stimuli with con- 
crete situations, From the behaviorist 
point of view, a concept can be defined 
as the common response to a class of 
phenomena whose members have cer- 
tain common characteristics. Although 
there are many differences of opinion 
as to the nature of a “common charac- 
teristic” (for Hull they are identical 
elements, for Smoke (1932), dynamic 
wholes), and, although these differences 
of opinion are reflected in the choice 
of experimental material, the methods 
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recognize, nevertheless, one and the 


same principle: that concepts are based 
on objective resemblances. Thus the 
technique involves varying the objec- 
tive data so that the resemblance be- 
tween the various elements comes 


less and less marked. 
In contrast to research in the field of 


classification (which we shal! consider 
later), where the child discovers by 
himself the various possible conceptual 
criteria, in this case he learns ready- 


made concepts by association. Names, 
which, in adult speech, are attributed 
to specific objects or classes of objects, 
are replaced by nonsense syllables. Each 
experiment starts with the learning of 
associative relations, either one-one or 
many-one, between verbal signs and 
objects or pictures presented to the 
child. Heidbreder (1949) contributed 
a great deal to the development of such 
tests of concept formation in the adult, 
but Welch and Long (1940a,b) are 
mainly responsible for having studied 
this learning mechanism in children of 
different ages. For illustration, we shall 
single out their studies on what they 
term vertical and horizontal generaliza- 
tions in a conceptual system. y 

One experiment, dealing with a verti- 
cal hierarchy of concepts, is composed 
of four main items, and variations of 
them: 


1. Prelearning, associating nonsense 
syllables with blocks of different forms. 

2. A first-degree hierarchy: the child 
learns to designate two different triangles 
by MEF and TOV and their union by 
VIC; similarly, two rectangles by YOP 
and ZIL and their union by DAX. 

3- A second-degree hierarchy: he learns 
to designate the union of triangles and 
rectangles by XIP, 


4. By means of instructions like, “Put 

on DAX,” or, “Take back the 
VIC’s,” it is possible to test in this way 
how successfully a hierarchy is learned. 


e hundred and three subjects, 42 to 
83 months old (3.6 to 7 years), were 
thus examined. It was found that it is 
ea to learn to identify the objects 
than to learn first-degree hierarchies. 
Instructions referring to the second- 


de hierarchy seem to present the 
greatest difficulties. 

\ second experiment deals with hori- 
zontal generalizations, in which a given 
object may have different names as it 
is encountered in different situations. 


o stimuli, a cube and a cylinder, are 
presented in three different contexts: in 
a bowling alley situation they become 
missiles, in a drum situation, the tips of 
drumsticks, and in a scale situation, 
weights, 

The objects and the contexts are 
once again associated with nonsense 
syllables, and the child can manipulate 
the object each time the corresponding 
verbal signal is pronounced. 

With 54 children, 67 to 89 months 
old, it was found that “learning a dif- 
ferent name for the same object in 
three different situations (a horizontal 
generalization) is confusing but not as 
difficult as learning the position of an 
object in a second hierarchic struc- 
ture (a vertical relation).” 

Such a method is highly interesting 
if we make the assumption that for 
a child the objects around it have al- 
ways been recognized as separate, stable 
entities. But evidently the method loses 
some of its significance when we con- 
sider that the formation of concepts 
depends on active operations, i.e., on a 
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structuring on the part of the subject 
leading to the construction of objects 
in the outer world, and not only on the 
properties of objects already given. 

The approach of the Soviet school, 
which attributes a fundamental impor- 
tance to language as a second signal 
system, is comparable to this method 
and, at the same time, different from 
it. It is comparable in that concepts are 
assumed to be acquired by conditioned- 
reflex mechanisms and different in that 
they study the demands of socialization, 
which give birth to language, and the 
influence of language on the progress 
of thought itself. A typical example is 
provided by Luria and Yudovich 
(1956/1959), Luria being known for his 
work in the field of aphasia. They 
studied a pair of identical twins whose 
language development was still con- 
siderably retarded at 5 years of age, 
a phenomenon frequently observed 
among twins. This language retardation 
was accompanied by more general diffi- 
culties in processes of abstraction, in 
play situations, in carrying out precise 
instructions, and above all in classify- 
ing. Once separated, the twins felt the 
need to communicate with others in a 
more elaborate language, and a genuine 
linguistic learning took place, which 
was accompanied by improvement in 
their cognitive activity. 

These methods are capable of reveal- 
ing the role of certain activities of the 
subject, in this particular case the lin- 
guistic activity in ‘the acquisition of 
concepts. But it is probable that logical 
operations have a more fundamental 
basis than in language, in that they are 
based on the coordinations of actions. 
This hypothesis gives rise to two sorts 
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of methods: one group, influenced by 
the Gestalt theory, seeks to reveal the 
general types of structuring that are 
common to perception, action in gen- 
eral, and classification in particular; the 
other looks for the source of classifica- 
tions in reversible! operations, qualita- 
tively distinct from perceptual struc- 
tures. 


THE MECHANISMS 
OF CLASSIFICATION 


No concept is formed in isolation; 
each one is bound up with a whole 
system of classificatory operations, It 
is possible to distinguish two points of 
departure in research devoted to the 
study of elementary classificatory be- 
havior. One starts from the diagnosis 
of neuropsychiatric syndromes in 
adults and then brings its findings to 
bear on the special difficulties that chil- 
dren have in conceptualizing; the other 
is oriented toward the study of the 
formation of logical classes and attempts 
to analyze the mechanisms of concep- 
tualization in their development. 

Typical of the first approach are 
the sorting tests originated by Gold- 
stein and Scheerer (1941), with the 
collaboration of Gelb and Weigl, for 
the diagnosis of symptoms of mental 
deterioration caused by cerebral in- 
jury. In these tests, which inspired a 
number of investigations on children’s 
thought at the preconceptual level, two 
types of material are used: on one hand, 
everyday objects, both of natural size 
and in miniature and, on the other, 
geometrical material comprising tri- 

*Reversible refers to the possibility of can- 


celing one thought operation by its inverse 
(or negative) operation 
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angles, squares, and circles in four dif- 
ferent colors, There are essentially three 
aspects to the investigation. The first 
deals with the formation of a class; the 
experimenter, designating a single ob- 
ject, asks, “Which of all the remaining 


objects belong with it?” The second is 
concerned with discovering the crite- 
rion that determined a class thar already 
exists, “Why do all these belong to- 
gether?” The third deals with the pos- 


sibility of shifting from one criterion 
to another. 

It is important to note the different 
forms of classificatory behavior and 
their characteristics, for example, rigid- 
ity or flexibility, narrowness or over- 
generalization, concreteness or abstract- 
ness. Reichard, Schneider, and Rapaport 
(1944), applying such sorting tests to 
children, found that they remained at- 
tached to the concrete and functional 
aspects of the concepts. Classification, 
they found, precedes the discovery of 
the principle or definition of the class, 
but this active classification never be- 
comes so elaborate as the definition, 
which has a more conventional char- 
acter. 

It is well known how Goldstein’s 
ideas inspired Wallon’s (1945) notion 
of a “precategorial” level in the thought 
of a child. In this connection, one of 
his collaborators (Ascoli, 1950) under- 
took research on children’s classifica- 
tion, asking children to classify geo- 
metrical shapes according to two or 
three criteria, which they had to dis- 
cover. Oleron (1951) and Vincent 
(1957), too, took up a special aspect 
of the problem, that of the relations 
between classification and language in 
deaf-mutes. Finally, the problem has 
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been studied by means of a technique 
particularly well adapted for young 
children. Slama-Cazacu (1957) has de- 


veloped a “cupboard game” in which 
the child can find a hidden toy only if 
he opens a compartment marked by 
pictures of different species of one 
genus (for example, a dog, a horse, a 
cat, a giraffe, for the genus “animal’”). 
The author observes that in preschool 


children the notions of genus are un- 
stable as far as their denotation and 
content are concerned, but she also 


notes a process of generalization, based 
on abstraction. 
ypical of the second approach are 
the investigations of Vigotsky (1931) 
who, by using a classification test de- 
vised by Ach, showed that the evolution 
of concept formation in children fol- 
lows differentiated stages, beginning 
with pseudoconceptual and so-called 
complexual forms. (Instead of grouping 
objects according to criteria defining 
a class, children merely put them to- 
gether in a complex heap.) Hanfmann 
and Kasanin (1937) started from these 
genetic studies when they developed 
their tests for adults, which make it 
possible, in Rapaport’s (1946) phrase, 
“to diagnose the encroachment of mal- 
adjustment upon conscious thinking.” 
The particular problem taken up by 
Inhelder and Piaget (1959) was that of 
understanding why the organization of 
classificatory behavior takes the forms 
it does and why these successive forms 
tend toward logical structures. The fol- 
lowing three methods have been com- 
bined in this study, but any one by itself 
is not sufficient: (1) spontaneous classi- 
fications of natural and geometrical ob- 
jects; (2) the shifting from one criterion 
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to another and the anticipation of visual 
classifications (for instance, the child is 
asked what would be the minimum 
number of envelopes needed to class 
geometrical figures, differing in form, 
size, and color, and presented to him 
in a random order); (3) the formation 
of the operational mechanisms them- 
selves. Techniques were designed to 
bear on the problem of how the struc- 
ture of inclusion is formed. This struc- 
ture is certainly not present from the 
beginning, either hereditarily or as a 
Gestalt, and its development is much too 
laborious to permit of an explanation 
simply in terms of verbal learning. 

One experiment already carried out 
(Piaget and Szeminska, 1941/1952), 
and standardized later but yet to be 
published, revealed the difficulty ex- 
perienced by children of less than 6.6 
years of age in understanding that there 
are more elements in the whole class B 
than in the sub-class A included in 
it (if B = A + A’). In this technique the 
child is presented with an open box 
containing a dozen beads, which he can 
easily see are made of wood. Most of 
the beads (ten) are brown, and two 
are white. He is asked, “Are there more 
wooden beads or more brown beads in 
this box? Could we make a longer neck- 
Jace with all the wooden beads or with 
all the brown beads?” (He is made to 
understand that after making the neck- 
lace of wooden beads he can take it 
apart again to make the one of the 
brown beads.) Finally he is asked, cal bau | 
take all the wooden beads out of the 
box, will there be any beads left? If I 
take out all the brown beads, will there 
be any beads left?” Because of the 
immobility of his thought, the child 
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younger than 6 years of age cannot 
conceive of the same beads as forming 
part of two collections. 

He answers that there are more 
brown than wooden beads, “because 
there are only two wooden ones.” Once 
the brown ones are included in one 
class, he cannot at the same time in- 
clude them in a second class. He can 
compare subclass A (the brown beads) 
only to subclass A’ (the white beads) 
and not to the total class B (all the 
beads). 

It was discovered that the problem 
of inclusion, even when it was solved 
with material consisting of a collec- 
tion of countable objects, continued to 
offer difficulties when applied to a hier- 
archy of concepts such as the system 
of classes of animals. Without entering 
into the details of the technique, let us 
remark that it involved organizing pic- 
tures of familiar animals into a hier- 
archy of three classes and that the con- 
cept of the hierarchy was concretized 
by means of transparent boxes that 
could fit one inside the other. In this 
particular case, the correct solution 
consisted in putting the ducks into the 
smallest box, A; the class of birds, com- 
posed of A and its complement A’ 
(birds that are not ducks) into the 
second box, B; and the animals, A + B 
+ B’ (B’ being animals that are not 
birds) into the largest box, C. Once this 
hierarchy is concretely established by 
the child, he is asked questions of the 
same kind as those regarding the beads. 
Here, as in all the research on clas- 
sification, it was seen that the logical 
mechanisms do not operate as com- 
pletely developed structures, which are 
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applied to any content, but that they are 
gradually developed in relation to the 
different contents to be conceptus|:zed. 


The divergence among authors as to 
the age at which the first conceptual 
systems appear would seem, then, to be 
partly the result of observations that too 
often have been sporadic or of experi- 


mentation limited to a single cont 


Another question deserving of analy- 
sis in the field of genetic psychology and 
necessitating specialized methods is that 


of the relations between perceptual con- 
figurations and operational mechs 
and their variations with age. This 4 
tion arose with some urgency in co 

tion with the matrix tests ada; 
children (Raven, 1938), in whi 
subject is to find among several pictures 
the one that fulfills two or three criteria 
at once. These criteria have to be ab- 
stracted from other pictures arranged 
in matrix form. The test is supposed to 
reveal the mechanism of logical multi- 
plication, but the spatial arrangement 
of the elements can facilitate certain 
of the solutions to such a degree that 
it is difficult to distinguish the respective 
parts played by perceptual and opera- 
tional mechanisms. In order to distin- 
guish them, we must analyze the pro- 
cedures used by children of different 
ages by systematically varying the fac- 
tors involved. Not only the number 
of criteria but also the number of 
possible choices must be varied. [It 
was found that with perceptual con- 
figurations certain three-criterion mat- 
rices are more easily solved by very 
young children than ‘by older ones, but 
only as long as there are very few 
choices (Inhelder and Piaget, 1959)-1 
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In addition, we must do more than 


record the first choice; we must also 
offer other possibilities, since the degree 
of certainty or uncertainty is reveal- 
ing. Finally, we must try to get the 
child to give the reasons for his choices, 
since the agreement or lack of agree- 


ment between actions and motivations 
vother significant index of his level 
of conceptualization. This method (used 
by Inhelder and Piaget, 1959), which 
is sebtler than the usual test procedure, 
ndardized and compatible with 
ical evaluation of the results, 
s, in fact, a progressive decline of 
urational factors in favor of oper- 
ational factors. 


isa 


GENETIC ANALYSIS 
OF INTELLECTUAL OPERATIONS 


Among the concepts that are formed 
during childhood, those that serve as a 
point of departure for scientific thought 
deserve special interest on the part of 
psychologists. Piaget and his colleagues 
have studied the formation of the 
notions of number, space, speed-time, 
and probabilistic causality from an 
original point of view. Having first 
analyzed the operations that are funda- 
mental to these concepts once they are 
acquired, they asked whether the con- 
cepts resulted only from socially 
transmitted learning or whether the 
individual had some active contribution 
to make in their formation, The fol- 
lowing experiments were designed es- 
sentially to examine this intervention of 
the child’s activity. 

We can follow step by step this de- 
velopment in the child, from its origin 
in his sensori-motor activities, through 
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the period when it is enriched by men- 
tal representations, and finally to its 
accomplishment in a genuine system of 
operations. 

We can outline here only a few of 
the techniques used: 


1. The principle of numerical in- 
variance (Piaget and Szeminska, 1941 7 
1952). This technique was designed to 
study the manner in which children 
from 4 to 6 come to establish two 
equivalent collections of objects and to 
discover that this equivalence is pre- 
served no matter how the elements may 
be arranged. In one form of the tech- 
nique the children were first asked to 
take from a small basket as many eggs 
as there were eggcups in a row before 
them. The experimenter noted, in 
particular, the way in which the chil- 
dren established the correspondence 
between the two collections. (The 
youngest children, not yet having dis- 
covered one-to-one correspondence, 
tended to construct two rows of equal 
length, regardless of the number of 
objects.) Once the one-to-one corre- 
spondence was established, the experi- 
menter increased the spacing of one of 
the rows. Then, to find out if the child 
still accepted the equivalence of the 
collections, he put a series of questions 
in the following vein: “Are there the 
same number of eggs as eggcups?” “Are 
there more or less eggs than eggcups?” 
“Can we put each egg in an eggcup 
without there being any empty egg- 
cups or any eggs left over?” In all 
the experiments of this kind they took 
care that neither the intonation of the 
voice nor the order of the questions was 
suggestive. The analysis of the children’s 
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behavior and replies revealed that under 
the conditions described here, and in 
this particular cultural environment, nu- 
merical conservation is absent in most 
4-year-olds and that it is acquired grad- 
ually between the ages of 4 and 6, until 
most 6-year-olds assert it with a feeling 
of logical evidence. This process of ac- 
quisition, which can, of course, be ac- 
celerated by training, corresponds to a 
general progress toward an “opera- 
tional” quality in the thought of the 
child. The qualitative analysis of the 
reasoning seems to show that little by 
little the transformations imposed on 
collections of objects come to be con- 
ceived operationally; that is, each trans- 
formation can be mentally canceled by 
its inverse. This can be formulated by 
the child in the following way: “I 
only have to put the eggs back opposite 
the egg cups and I know there is always 
the same number.” 

2. The principle of conservation of 
quantity. The formation of invariant 
concepts has been followed system- 
atically in different domains of the 
child’s thought. Once attained, each 
of the concepts is based on a nucleus 
of operations comparable to mathemati- 
cal group structures (but more limited 
than they are), which Piaget has named 
a semigroup or “grouping” of concrete 
thought. 

To study the physical invariants of 
matter, weight, and volume, as the ideas 
develop between the ages of 4 and 11, 
one method involved two balls of plasti- 
cene, identical in shape, size, and weight. 
(Piaget and Inhelder, 1941.) One of the 
balls was stretched into the shape of a 
sausage, broken into little bits, etc., and 
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questions of the following sort were 
asked: 

Is there the same amount of plasticene 
in the sausage as in the ball? 

Is there more in one than in the « 

How do you know that there is al- 
ways the same amount? How do you 
know that there is more? (Or how do 
you know there is less?) 

Will the ball and the sausage weigh 
the same on the scale? 

A similar experiment concern 
conservation of the weight of 
when it is dissolved in water. 

3. Spontaneous measurement (Piaget, 
Inhelder, and Szeminska, 1948). Of a 
number of experiments in the 
of spatial concepts, we shall 
out the one dealing with spontaneous 
measurement, How do 4-year-olds dis- 
cover on their own the conditions nec- 
essary for measurement, such as the use 
of a common measure involving tran- 
sitivity, the constancy of the measuring 
instrument when its position is changed, 
and the construction or breaking down 
of the instruments into units. The 
authors started with a situation familiar 
to the child: building a tower with 
wooden blocks. They asked the child to 
build the tower just as high as a model 
tower, made of a single piece of wood 
and standing on a table at a different 
height from the one on which the child 
was to build. Every measurement in- 
volves a transportation of the measuring 
instrument, and in this experiment it 
was possible to classify the different 
types which tend to follow one another 
as the child develops. The earliest are 
direct visual or manual comparisons of 
height from the floor. They do not 


ther? 
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use of a common measure. Next 
arious attempts to use part of the 


mak 
come 


body or a gesture as the common mea- 
sure, neglecting the conservation of the 
“transported” dimensions or distances. 
Finally, the child discovers the need for 
a common measure with stable dimen- 
sions. At first, this has to be the same 
size es the model; but it is replaced by 


wes that are first bigger, then 
and finally by repeatable units 
y size. After this general experi- 


ment, 2 series of more specific studies 
was designed, with a view to examining 
one by one the conditions necessary for 
measurement in Euclidean space, and 
the application of measure to areas and 


volumes. 

4. Speed (Piaget, 1946). The study 
of the complex of speed-time-space con- 
cepts has shown that the notion of 
“passing,” as a change in order, is both 
fundamental and relatively primitive. 
However, its development follows cer- 
tain stages, which one can discern with 
the help of the following material: (a) 
two cars on two straight, parallel tracks, 
moving for equal periods of time at 
different speeds; (b) two cars, again 
moving for equal periods of time at 
different speeds but on two circular, 
concentric tracks, Although the starts 
and finishes are simultaneous, the dis- 
tances traveled are unequal. The phe- 
nomena included in the experiment are 
passing, catching up, gaining ground, 
and crossing. Each of the children is 
asked to say, after witnessing the phe- 
nomena, if the two cars moved at the 
same speed. The factors that the chil- 
dren take into account in comparing 
the speeds vary with their ages. Between 
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the ages of 5 and 9, the following re- 
actions are observed: first, they take 
account only of the passing of one car 
by another, then only of relative posi- 
tions at the finish, gradually, they take 
account of the starting points, as well, 
and finally of the set of rectilinear and 
circular paths traversed. At the end of 
this development, they discover the 
elementary operations of the concept 
of speed and the establishment of cor- 
respondences between displacements of 
objects in a given time. 

5. Chance and probability (Piaget 
and Inhelder, 1951). The formation of 
these concepts has been studied with 
the help of a number of games of chance 
of the roulette or progressive mixing 
type. The accidental character of 
chance as an interference between inde- 
pendent causal series is discovered by 
the child only through a system of logi- 
cal or mathematical operations. For ex- 
ample, in the case of the progressive 
mixture of balls, the child first comes to 
understand the fortuitous character of 
the mixture as the interference of inde- 
pendent movements. Later, after he has 
dissociated in this way the certain from 
the possible, he is gradually able to 
quantify the latter and to make prob- 
ability judgments. In terms of our ex- 
ample, he is now able to refer to a 
theoretical system of permutations to 
conceive of the quantitative aspect of 
the progressive mixing. 


In contrast with the studies of Piaget 
and Inhelder, which are limited to the 
concept of objective probabilities in 
children 5 to 14 years of age, Cohen and 
Hansel (1955) have explored the realm 
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of subjective probability and its use in 
predictions with preadolescents and ad- 
olescents; for instance, for the study of 
the prediction of a binary event on the 
basis of previous outcomes they utilized 
a display board with 16 pairs of lights. 
One light of each of the first ten pairs 
was successively illuminated, nine on 
one side and one on the other. The 
position of the single lights varied (in 
random order) from position 1 to posi- 
tion 10 in different trials. After seeing 
the lights, one to ten, illuminated in 
rapid succession, the subject had to 
guess on which side the eleventh light 
would appear. 


THE EXPLORATIVE APPROACH 
TO THE STUDY OF 
INTELLECTUAL OPERATIONS 


The essential characteristics of the 
methods adopted by Piaget and his col- 
leagues seem to be the following: 

The experiments are designed to 
study one specific but fundamental 
aspect of the formation of the concepts, 
viz., the construction of the underlying 
intellectual operations. They enable us 
to establish the total inventory of opera- 
tions that are active in children’s think- 
ing as well as the obstacles that hinder 
their formation. The different experi- 
mental situations reveal their real sig- 
nificance only when their results are 
compared, since they offer a system of 
mutual cross-checks and points of com- 

arison. 

In all the experiments the children 
are confronted with changes in the 
physical world. In contrast to the tech- 
niques inspired by the associationists 
or Gestalt psychologists, who studied 
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reactions to isolated elements to 
configurations, the so-called ra- 
tional” techniques present the children 
with processes of transformatio ind 
study the way in which they are pro- 
gressively interpreted in terms of actual 
mental operations. 

The approach is essentially develop- 


mental. For each particular problem 


situation, they study the changes that 
come with age in children’s reasoning 
and active behavior. In this way, they 
attempt to distinguish the staves of 
intellectual elaboration and its implicit 


hierarchy in order to grasp the forma- 


tive processes underlying this ©) \bora- 
tion. 

The method aims at an analy f the 
different qualitative forms of th intel- 
lectual elaboration. It attempts to list 
them and to evaluate their interna! con- 
sistency. It is somewhat analogous to 


the experimental method used in studies 
of problem solving in that it uses ap- 
paratus and to the interview method in 
that it involves discussion with the sub- 
ject. This aspect, the discussion, plays 
an important role in that it enables the 
experimenter to grasp the type of rea- 
soning that underlies the child’s con- 
duct. 

Among the objections made to this 
kind of investigation, let us mention the 
two main ones, one referring to the 
flexible nature of the questioning pro- 
cedure and the other to the lack of a 
statistical analysis of the results. For 
the sake of comparative studies, research 
is now being carried out in Geneva that 
will lead to a standardization of proce- 
dure and a statistical evaluation of the 
frequencies of different types of solu- 
tion. But let us insist upon the fact that 
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the flexible, exploratory investigation 
is the only way of revealing all the 
frequently unsuspected peculiarities of 
concept formation in the child. The 
ndardization, which necessarily in- 
es a certain schematization, is fruit- 
only after the whole range of pos- 
e qualitative solutions has been estab- 
vd, One further difficulty involved 
s method is the amount of training 
required of the experimenter: he must 
nov only record but also provoke the 
problem-solving behavior; at the same 
time he must neither project his own 
interpretive scheme to the child nor lose 
sigħt of the information he is seeking. 


Use of Theoretical Models 


LOGICAL MODELS 


When the presentation of problems 
is essentially verbal, it is generally 
tempting to analyze the different stages 
of the solution in terms of formal logic 
and to describe the observed behavior 
in terms of logical operations. This has 
the advantage of reducing the qualita- 
tively different performances found at 
different stages to a common language, 
thus enabling one to identify the opera- 
tions of which children are capable at 
various ages. 

Logical errors have been studied fre- 
quently enough in the adult. The 
numerous studies on the syllogism, par- 
ticularly on “atmosphere effects,” emo- 
tional factors, etc., are well known 
(Woodworth and Snell, 1935). In each 
of these cases the assumption is that the 
subject (a normal adult) possesses the 
necessary logical instruments, and the 
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aim of the research is to study the fac- 
tors that prevent these instruments from 
functioning and which thus result in 
false reasoning. 

In the child the logical analysis of 
behavior can take on a more profound 
significance: here we are actually try- 
ing to find out what logical tools a given 
subject possesses. The disturbing emo- 
tional variables, interesting as they are in 
the case of adults, must then be reduced 
as far as possible so that the child can 
give his maximum. 

This is the kind of analysis that Piaget 
has undertaken, On the one hand he has 
tried to establish a formalized model 
of natural thought and its development 
(1949) and on the other to study this 
development concretely, using the logi- 
cal model to describe it. [Inhelder and 
Piaget (1955/1958).] Their subjects 
(aged 5 to 16) are presented with ap- 
paratuses that exemplify simple physical 
laws (for instance, the equilibrium of a 
scale balance, the inclined plane, non- 
elastic reflection, and communicating 
jars), which they must discover by ex- 
perimenting freely with the material. 
The level of organization of the experi- 
mentation, the conclusions that the sub- 
ject draws from each new item of in- 
formation, and the generalizations made 
are analyzed in terms of the classical 
logic of propositions. In their latest 
work (1959) in which they analyze 
classification and seriation they make 
use of the logic of classes and relations 
for their description. The essential point 
of interest is that in both these cases 
the logical analysis treats the actions and 
not only the verbal behavior of the 
subject. 

Apart from the work of Piaget and 
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his school, no other attempts seem to 
have been made to examine the thought 
of the child from the point of view of 
mathematical logic. 

Nevertheless, we may ask which type 
of logic can accomplish this study most 
satisfactorily. Piaget uses classical logic, 
with the qualification that his classes are 
serially inclusive, since this is the type 
of classification that seems to him espe- 
cially adapted to natural thought. But 
perhaps it would be possible to take a 
weaker model, such as intuitionist logic, 
which rejects the principle of the ex- 
cluded middle. Certain relationships that 
are erratic from the point of view of 
classical logic can be described in this 
form, and it would be interesting to see 
if thought could be considered con- 
sistent in terms of this weaker frame- 
work. 

Instead of using a logical model to 
describe behavior and to compare it to a 
norm, several authors have constructed 
problems in which different logical con- 
stants come into play. With subjects 
7 to 12 years of age, Morf (1956) pre- 
sented verbal problems that require 
disjunction, conjunction, implication, 
etc. This approach was also suggested in 
a project by Moore and Anderson 
(1954), although they had in mind its 
application to adult thought only. In 
similar fashion, John and Miller (1957) 
constructed their problems on the basis 
of Boolean algebra. The application of 
this type of problem to children seems 
to present certain difficulties, since most 
of them necessitate hypothetico-deduc- 
tive reasoning, too difficult for children 
before 11-13. This is no doubt why 
applications to children are almost 
nonexistent. 
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STATISTICAL MODELS 

Unlike logical models, statistical 
models are not well suited to describing 
the processes involved during the solu- 
tion of a problem. 

It is for this reason that statistical 
analysis of problem solving has been 
limited for so long to the study of 
performances, quantified more less 


arbitrarily with a view to comparing 
different distributions (in different con- 
ditions or at different ages) or to cal- 


culating the correlations of these per- 
formances with other variables. 
Peterson (1932), for example, in the 
research already mentioned the 
comprehension of the equilibrium of a 


scale balance, correlates the perform- 
ance of each subject with his intelli- 
gence quotient. 

This is the technique used in work 
based on a factorial conception of in- 
telligence. Quite aside from the fact 
that this procedure provides no in- 
formation whatever about the process 
that led the subject to his solution, we 
must note a serious insufficiency: the 
difficulty of finding criteria of 
quantification. Since each subject must 
receive a “score” for his performance, It 
is first necessary to find a scale of meas- 
urement on the basis of which a specific 
“score” can be attributed to each distin- 
guishable type of behavior. In particu- 
lar, we must avoid the situation 1 
which the same score is attributed to 
two subjects simply because they have 
made the same number of mistakes, al- 
though the items on which the mistakes 
were made may be entirely different. 
This approach may be perfectly justi- 
fied in applied psychology, in which an 
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external validation is sufficient, but it is 
of little value in a study of the thought 
mechanisms involved. 

‘ne fruitful approach to these prob- 
which partially escapes these 
icisms, consists of the application in 
the realm of thought, particularly from 
the developmental point of view, of the 
techniques used by social psychologists 
in the study of attitudes (see Stouffer 
et al, 1950). A series of items on an 
Opinion questionnaire is said to be scal- 
able if a positive reply to item (7) im- 
plies that a positive reply was given to 
item (n— 1) and thus to the (n— 1) 
previous items. That is possible if the 
questionnaire has been constructed so 
that the questions imply one another 
the subject is consistent. Al- 
though these conditions are difficult to 
obtain for attitudes, they are probably 
fundamental in cognitive development. 
In fact, it is probably not too strong a 
hypothesis to suppose that certain con- 
ducts that appear in the course of devel- 
opment are scalable in this sense. 

We would thus have a series of prob- 
lems of increasing complexity, such that 
any subject who solved the nth prob- 
lem would also have solved the 2 — 1th 
problem. Specifically, we could apply 
this technique in describing the de- 
velopment of integrated stages, that is, 
of a series of stages, each of which in- 
tegrates the achievements of the preced- 
ing ones. 

It is naturally difficult to find prob- 
lems that are strictly scalable, since 
other factors, notably the content, can 
prevent a subject from solving a prob- 
lem that is in reality easier than others 
he had solved. We are more likely to be 
able to find “semiscales.” Vinh Bang 


and i 
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(1959) and Nassefat (1960), among 
others, have undertaken to describe 
cognitive development in this way. 
Starting from Piaget’s investigations 
employing the “clinical” method, they 
standardized the procedure by substi- 
tuting for free-interrogation a series of 
predetermined questions and problems. 
The subjects were still required, how- 
ever, to justify the reply they gave. The 
replies given were classified according 
to the degree of correctness of the an- 
swer and the justifications given by the 
subject. They verified as stringently 
as possible the hypotheses that (1) the 
responses to a given problem showed 
a progressive and constant change with 
age, and, (2) that a group of these 
problems formed a Guttman-scale. 

Of course, this successive integration 
is not the only aspect of intellectual 
development. It is also possible simply 
to order the various reactions of sub- 
jects of different ages to a given prob- 
lem in terms of a criterion such as 
“correctness” without introducing “in- 
tegration.” 

At the present time it would appear 
that only a longitudinal study could 
reveal whether a succession found in 
this way is constant or if the order can 
vary from one individual to another, 
Nonetheless, it is to a certain extent 
possible to verify it by finding the cor- 
relation between ages and the hierarchy 
of conducts arranged according to a 
given criterion (a logical criterion above 
all). Nassefat does just that in the work 
mentioned above. He arranges the ob- 
served responses according to the num- 
ber of elements that the subjects take 
into account and then establishes the 
significance of the correlation between 
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this order and age. Theoretically, it 
would be possible to avoid such an a 
priori order simply by taking the one 
that is best correlated with age develop- 
ment. Unfortunately, this procedure re- 
quires endless calculations, and it 
scarcely seems possible without im- 
proved computers. 


Conclusions 


A first factor brought to light in the 
preceding analysis is the considerable 
diversity of methods for each of the 
points that we have examined. This 
diversity might be disquieting if the 
psychological problems were reducible 
to one single type. But the problems 
themselves remain extremely varied, in 
view of the fundamental fact that the 
psychology of thinking seems to be just 
at the junction of biology (behavior), 
sociology (language, education), and 
logic and mathematics (since the 
thought of the child is itself the source 
of logicomathematical structures). This 
variety of problems explains, and at the 
same time justifies, the variety of meth- 
ods used. 

Thus our second conclusion is that 
one method is no better or worse than 
another: their values are essentially rel- 
ative to the particular problem to be 
solved. This assertion is perhaps not so 
trite as it seems. It sometimes happens 
that a method is chosen and used with- 
out sufficient awareness of the problems. 
The first condition of an adequate 
method is a sufficiently thorough pre- 
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liminary analysis of the problem that 
one is called upon to solve. 
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What are the major methodological and 
interpretative problems encountered by 
the research worker who uses tests with 
children? What types of instruments are 
available to him, and what considera- 
tions should guide his choice of specific 
tests? In this chapter, questions such as 
these are considered with special ref- 
erence to the use of standardized tests 
for the measurement of cognitive traits 
or abilities, as distinguished from the 
assessment of emotional, social, motiva- 
tional, and attitudinal characteristics 
commonly designated as “personality” 
and treated elsewhere in this book. It 
should be noted, however, that the 
distinction between ability and per- 
sonality tests—though customary—is 
neither sharp nor basic. By altering in- 
structions to subjects and other con- 
comitant circumstances, the same type 
of test content can be adapted to the 
measurement of both cognitive and per- 
sonality traits. Such a situation is under- 


standable when we realize thət in 
taking any test, as in all his be! 
the individual is influenced by his 
abilities and by his emotional, i 


tional, interest, and other nonin 


traits. A differentiating feature of ility 
tests is that they undertake to discover 
the individual’s maximum performance 
in a given area (Cronbach, 1960, pp. 
29-34). In administering such tests 
every effort is made to establish optimal 
conditions and to ensure that the in- 
dividual is motivated to “do his best” 
on the test. 

Although the principal types of 
available ability tests for children are 
summarized and illustrated this 
chapter, no attempt will be made to 
survey or evaluate specific tests. For 
this kind of information, the investiga- 
tor is directed to a number of published 


sources, the principal one being the 
series of Mental Measurements Year- 
books edited by Buros (1959). These 
earbooks cover nearly all commer- 
cially available psychological, educa- 
tional, and vocational tests published 
in English-speaking countries; they are 
especially complete for paper-and- 
pencil tests. Each yearbook includes 
tests published during a specified period, 
thus supplementing rather than sup- 
planting the preceding yearbooks. The 
Fifth Mental Measurements Yearbook, 
published in 1959, covers the period 
from 1952 to 1958. In addition to such 
routine data as publisher, price, forms, 
age levels, and the like, each yearbook 
provides critical reviews of most of the 
tests, prepared by one or more test 
experts. A nearly complete list of names 
and addresses of test publishers is also 
given, 

Other sources that can readily be con- 
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sulted for information on recently 
lished tests include the Psychologi- 
cai Abstracts, which contains a special 
section on new tests, and such journals 
cational and Psychological Meas- 
ent and the Journal of Consulting 


Psychology, which list and review new 
tests, Descriptions of the principal tests 
in cach area, as well as fuller expositions 
of some of the testing principles dis- 


cussed in this chapter, can be found in 
any recent textbook on psychological 
or educational testing (e.g., Anastasi, 
1954; Cronbach, 1960; Lindquist, 1951; 
Nunnally, 1959; Thorndike and Hagen, 
1955). A standard guide for the evalua- 
tion of psychological tests is the 
Technical Recommendations for Psy- 
chological Tests and Diagnostic Tech- 
niques, published by the American 
Psychological Association (1954). A 
similar guide has been prepared with 
special reference to achievement tests 


(1955). 


TYPES OF ABILITY TESTS 


The investigator who plans to utilize 
standardized ability tests in research 
with children has a wide choice of 
instruments for measuring his subjects 
from cradle to college. Tests applicable 
prior to school entrance are generally 
subdivided into infant tests, designed for 
the first 18 months of life, and preschool 
tests, covering the ages of 18 to 6o 
months (Anastasi, 1954, Ch. 11). With 
regard to test administration, the infant 
must be tested while lying down or 
supported on a person’s lap. Speech is of 
little or no use in giving test instruc- 
tions. Most of the tests at this level are 
actually controlled observations of sen- 
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sori-motor development, as illustrated 
by the infant’s ability to lift his head, 
turn over, reach for and grasp objects, 
and follow a moving object with his 
eyes. At the preschool level, on the 
other hand, the subject can walk, sit at 
a table, use his hands in manipulating 
test objects, and communicate by 
language. At these ages the child is also 
much more responsive to the examiner 
as a person, whereas for the infant the 
examiner serves chiefly as a means of 
providing stimulus objects. Preschool 
testing is a more highly interpersonal 
process—a feature chat increases both 
the difficulties and the opportunities 
presented by the test situation. 

Some available scales, such as the Yale 
Developmental Examination ( Gesell and 
Amatruda, 1947), span both the infant 
and preschool periods. Others are re- 
stricted to one or the other level, as in 
the case of the California First-Year 
Mental Scale and the Minnesota Pre- 
school Scale. Of special interest for 
longitudinal studies is the Cattell Infant 
Intelligence Scale, which was developed 
as a downward extension of the Stan- 
ford-Binet, thus providing a uniform 
score scale from the age of 2 months 
to the adult level. 

In the development of some infant 
scales, such as the Northwestern In- 
telligence Tests (ages 13 to 36 weeks), 
a special effort was made to utilize items 
measuring the child’s adaptation to the 
physical and social environment rather 
than physical growth or maturation. 
Examples of such items from the North- 
western scale include responsiveness to 
the sudden disappearance of the ex- 
aminer’s face, retrieving a toy hidden 
under a towel, pulling a string to obtain 
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a toy, and looking for a toy that has 
been removed. In the previously men- 
tioned Cattell scale there are also items 
involving rudimentary instrumentation, 
such as pulling a string or using a stick 
to attain a toy that is out of reach. 
Vocalization and linguistic responses are 
likewise found in several parts of the 
scale. There are indications, moreover, 
that infant and preschool tests of the 
future will make increasing use of items 
involving learning and linguistic devel- 
opment and may as a result prove to be 
somewhat better predictors of the 
child’s subsequent intellectual progress, 

Following school entrance, there is 
a rapid increase in the variety of test 
content, as well as testing techniques, 
which may be utilized. An important 
distinction is that between individual 
and group tests. Individual tests must 
be administered singly to each subject 
and usually require a highly trained 
examiner. Although more time-consum- 
ing, such tests frequently permit better 
control of testing conditions and 
provide additional opportunities for the 
observation of work methods, social and 
emotional reactions, and other qualita- 
tive aspects of the child’s behavior. 
Group tests, on the other hand, are par- 
ticularly designed for mass testing. 
They not only enable one examiner to 
test a large group but are also relatively 
easy to administer and score. 

Infant and preschool scales are 
typically individual tests, although some 
group tests constructed for the primary 
grades can be administered to kinder- 
garten children in small groups. At the 
elementary- and high-school ages, in- 
dividual tests such as the Stanford-Binet 
and the Wechsler Intelligence Scale for 
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Children (WISC) are used principally 


as clinical instruments for intensive 
study of special cases, whereas group 
tests serve for most other testing pur- 
oses. The latter are very numerous 
and can be illustrated by the various 


levels of the Pintner General Ability 


Tests: Verbal Series and of the Lorge- 
Thorndike Intelligence Tests (A nastasi, 
1954, Ch. 9; Cronbach, 1960, Ch 8). 


Tests may also be classified with re- 
gard to testing medium and content. 


The commonest type employed from 
the fourth grade through the acu level 
is the paper-and-pencil verbal test. To 
take such a test, the subject must have 
an adequate understanding of iglish 
and must be able to read and w Al- 


though the content is predominantly 
verbal, arithmetic problems and other 
numerical items are often included. 
Group tests for the primary grades have 
sometimes been described as “non- 
verbal,” since the items usually con- 
sist of printed pictures and diagrams, 
which the child marks in accordance 
with oral instructions by the examiner. 
In Fig. 11-1 are four illustrative items 
from the Pintner-Cunningham Primary 
Test, with the corresponding oral in- 
structions. Such tests obviously call for 
an understanding of spoken language- 
Some of their subtests, moreover, are 
designed specifically to test linguistic 
comprehension. 

Nonlanguage tests, on the other hand, 
require neither spoken nor written 
language, having been specially planned 
for use with foreign-speaking, illiterate, 
or deaf groups. Familiar examples 1n- 
clude the Army Beta and the Pintner 
Non-Language Test (Anastasi, 1954 
Ch. 10). Sample items from the six 


\DARDIZED ABILITY TESTING 


a 


like it in the dots. 


MO Re 


Test 1. Mark the things that we need when we go out in the rain. 


Test 7. Look at each picture; see how it is drawn; make another one 
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Fig. 11-1. Illustrative items from the Pintner-Cunningham Primary Test. (Copyright by 
World Book Company.) 


subtests of the latter are reproduced in 
Fig. 11-2. Performance tests, such as 
the Arthur Performance Scale and the 
performance section of the WISC, 
utilize such materials as form boards, 
picture puzzles, blocks, and the like, 
instead of paper and pencil. Some per- 
formance tests can be given both with 
and without oral instructions. The latter 


procedure utilizes pantomime, gesture, 
and demonstration. Examples of per- 
formance tests can be seen in Figs. 11-3, 
11-4, and 11-5. It should be noted 
parenthetically that a number of non- 
language and performance tests have 
been specially developed or adapted for 
use with physically handicapped per- 
sons, such as the blind, deaf, and ortho- 
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cca | 
1, Figure Drawing: Choose the line or lines which show how the figure at the left | 
can be cut up to look like the pieces at the right. | 


ae E 
Ta e 


2. Reverse Drawings: The two drawings at the left are alike, but the second ha 
been turned over and one line is missing from it. Find the missing line 


among those given. b 
O O © O 


3. Pattern Synthesis: If the two drawings at the left are superimposed, which of 
the four given drawings will the result look like? 


eo/oh ee 


r------- 


4. Movement Sequence: The figure at the left has been turning in the direction 
indicated by the three successive positions. Find the correct fourth position 
from those given. 


o 


ie CRs Isl 


6. Paper Folding: The drawings at the left show a piece of paper which has been 
folded twice and a piece has been cut out of it. Find the drawing which shows 
how the paper would look if it were unfolded. 


Fig. 11-2. Sample items from the Pintner Non-Language Test. (Copyright by World Book 
Company.) 
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Fig. 11- 
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3. Seguin Form Board, from Arthur Point Scale of Performance Tests, Revised Form 


II. The task is to insert the blocks in the appropriate recesses as quickly as possible. (Courtesy 


The Psychological Corporation.) 


pedically handicapped (Anastasi, 1954, 
Pp. 268-272). 

Still another basis for differentiating 
among tests is the extent to which 
power and speed influence scores. In a 
pure power test items are steeply graded 
in difficulty (ie, of progressively 
greater difficulty) and time limits are 
long enough to allow all subjects to try 
every item. Pure speed tests, on the 
other hand, present relatively easy items 
of uniform difficulty and employ short 
time limits, Individual differences in 
score on the latter tests thus reflect only 
variability in speed of work. Most tests 


fall between these two extremes, involv- 
ing varying combinations of power and 
speed. It should be borne in mind, 
however, that the relative contribution 
of power and speed to a given test 
may vary with the age, ability level, 
and other characteristics of the subjects. 

Finally, ability tests have been tra- 
ditionally classified into those designed 
to measure intelligence, special apti- 
tudes, and achievement, respectively. 
Intelligence tests characteristically pro- 
vide a global score, such as an IQ, 
purported to gauge the individual’s 
“general intellectual level.” A wide 
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Fig. 11-4. Arthur Stencil Design Test I, from Arthur Point Scale of Performance Tests, 
Revised Form II. The sample design in the center is to be reproduced by superimposing 
cut-out stencils in different colors upon a solid card. (Courtesy The Psychological Corporation.) 


variety of tasks is included on the as- 
sumption that through “the sinking of 
shafts at critical points” (Terman and 
Merrill, 1937, p. 4) all important in- 
tellectual functions will be sampled. In 
actual practice, intelligence tests are 
overweighted with certain functions, 
usually verbal ability, and may com- 
pletely omit others. Moreover, since 
they have usually been developed 
within an academic framework, intel- 
ligence tests are primarily measures of 
scholastic aptitude. At the same time, 
different intelligence tests, such as the 
previously mentioned verbal and per- 


formance types, may cover a different 
combination of functions. An IQ is thus 
of little meaning unless referred to the 
particular test from which it was de- 
rived, 

Shortly after the first intelligence 
tests came into general use, their limited 
coverage became apparent and efforts 
were made to construct special aptitude 
tests to fill the gaps. Since intelligence 
tests concentrate chiefly upon relatively 
“abstract” verbal and numerical func- 
tions, a particular need was felt for 
tests to cover the more “concrete” and 
“practical” areas. Accordingly, me- 
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Fig. 11-5. Object Assembly Test of the Wechsler Intelligence Scale for Children (WISC). 
When properly assembled, the cut-out pieces form an animal. (Courtesy The Psychological 
Corporation.) 


chanical aptitude was among the first 
special areas for which tests were de- 
vised, Tests of musical and artistic apti- 
tudes followed, as well as some having 
a more directly vocational orientation, 
such as clerical aptitude tests (Anastasi, 
1954, Chs. 15 and 16; Cronbach, 1960, 
Chs. 10 and 11). 

A large number of standardized tests 
have likewise been constructed for 
Measuring various aspects of vision, 
hearing, and motor dexterity (Anastasi, 
1954, pp. 388-408). Certain rapid 
Screening tests of visual and auditory 
functions are now routinely admin- 
istered to school children in order to 


identify cases of mild sensory deficien- 
cies. More precise instruments, which 
yield a multiplicity of scores, are avail- 
able for intensive individual measure- 
ment (Anastasi, 1954, pp. 390-402). 
Certain vision tests, for example, 
provide measures of near and far acuity, 
depth perception, lateral and vertical 
phorias (muscular balance), and color 
discrimination, Of special interest in 
connection with research on children 
are the Oseretsky Tests of Motor 
Proficiency (Doll, 1946), which yield 
a “motor age” analogous to the mental 
age found on intelligence tests. More 
detailed discussion of sensory and motor 
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tests may be found in Chapters 7 and 8, 
respectively. 

A word of explanation should be 
added regarding the concept of “special 
aptitudes.” The term originated at a 
time when the major emphasis in test- 
ing was placed on “general intelligence.” 
Mechanical, musical, and other special 
aptitudes were thus regarded as sup- 
plementary to the IQ in the description 
of the individual. Following the applica- 
tion of refined statistical procedures and 
the accumulation of research data on 
trait organization, however, the picture 
has changed considerably. It is now 
recognized that “intelligence” itself 
comprises a number of relatively in- 
dependent “special aptitudes,” such as 
verbal comprehension, numerical rea- 
soning, numerical computation, spatial 
visualization, associative memory, and 
the like (Anastasi, 1958a, Chs. 10 and 
11). In the more recently developed 
multiple-aptitude batteries the various 
aptitudes covered by conventional in- 
telligence tests (e.g., verbal comprehen- 
sion, numerical reasoning, computation) 
are combined with several of those tra- 
ditionally measured by special aptitude 
tests (e.g., perceptual speed and ac- 
curacy, mechanical comprehension, 
spatial visualization). Such batteries are 
discussed in a later section of the present 
chapter concerned with the differentia- 
tion of abilities. 

The principal object of achievement 
tests is to appraise the effects of a course 
of instruction or training (Anastasi, 
1954, Chs. 17 and 18; Cronbach, 1960, 
Ch. 13; Nunnally, 1959, Ch. 12). Al- 
though not restricted to school work, 
such tests have found their principal 
application in education. At the ele- 
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mentary-school level there are available 
a number of comprehensive batteries, 
such as the Sequential Tests of Educa- 
tional Progress (STEP) and the Stan- 
ford Achievement Test. Similar bat- 
teries have been developed for the 
high-school student, although with in- 
creasing curricular specialization at the 
upper levels more use is made of 
achievement tests in special areas. Of 
particular interest in psychological re- 
search with children are the many tests 
designed to gauge readiness for, or to 
identify special disabilities in, reading 
and arithmetic. 

Achievement tests have generally 
been contrasted with aptitude tests, the 
latter including both intelligence and 
special aptitude tests. From one angle, 
the difference between achievement and 
aptitude testing stems from the degree 
of uniformity of relevant antecedent 
learning. Achievement tests measure the 
effects of relatively standardized sets of 
experiences, such as a course in fourth 
grade arithmetic or American history. 
By contrast, aptitude-test performance 
reflects the cumulative influence of 4 
multiplicity of experiences of daily life. 
Thus achievement tests may be said to 
measure the effects of learning under 
partially known and controlled condi- 
tions, whereas aptitude tests measure the 
effects of learning under relatively un- 
controlled and unknown conditions. In 
this connection we should guard against 
the erroneous assumption that achieve- 
ment tests are concerned with the 
effects of learning, whereas aptitude 
tests tap “innate capacities” inde- 
pendently of learning. Although this 
misconception was prevalent in the 
early days of testing, it is now rec- 
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ognized that all tests measure current 
de ped abilities, which inevitably re- 


flect the influence of prior learning. 

A second distinction between apti- 
tude and achievement tests pertains to 
their use, Although aptitude tests serve 
to predict subsequent performance, 


achievement tests generally provide a 
terminal evaluation upon the comple- 
tion of training. Even this distinction, 


however, is one of degree, Any achieve- 
ment vest, for example, could be used 
as a predictor of future learning and 
thus serve the purpose of an aptitude 
test. Thus the progress that a pupil has 
made in arithmetic, as revealed by his 
present achievement-test score, may be 
employed to predict his subsequent 
success in algebra. The fact that a test 
score has a “past” does not preclude 
its having a “future.” It should also be 
added that a number of recent achieve- 
ment tests cover relatively broad and 
unstandardized educational experiences 
and thus fall midway between tradi- 
tional aptitude and achievement tests. 
As examples may be cited tests of 
Creative thinking, critical ability, rea- 
soning, and scientific aptitude (Anastasi, 
1954, pp. 489-492, 514-516). Such tests 
stress the application of previously ac- 
quired knowledge to novel situations, 
as illustrated by the evaluation of con- 
clusions from given facts or the der- 
ivation of scientific principles from 
imaginary data. 


PROBLEMS OF TEST 
ADMINISTRATION 

Each type of test and each age level 
present their own characteristic prob- 
lems of test administration. Individual 
tests, such as the Stanford-Binet, as 


465 
well as infant and preschool tests, re- 
quire a special course of training before 
they can be properly applied. Practical 
suggestions regarding general testing 
procedure at the infant and preschool 
level can be found in Goodenough 
(1949, Ch. 20) and Watson (1951, Ch. 
12). Terman and Merrill (1960, pp. 46- 
58) discuss problems in the individual 
testing of older children. Helpful hints 
regarding group testing are given by 
Thorndike (1949, pp. 261-265) and 
Ligon (1942). A more detailed analysis 
of desirable group testing practices, 
with supporting data, is provided by 
Traxler (in Lindquist, 1951, Ch. 10). 
Some of the major recommendations re- 
garding test administration will be con- 
sidered in the paragraphs which follow. 
But the reader interested in testing at a 
particular level is urged to consult the 
original sources cited for fuller discus- 
sions. 

A fundamental requirement of all 
good test administration is advance 
preparation. In testing there can be no 
emergencies. Special efforts must there- 
fore be made to foresee and forestall 
emergencies. The type of necessary 
preparation may range from the 
memorization of oral instructions, 
through the checking of materials and 
the choice of an adequate testing room, 
to conferences and “dry runs” with 
proctors and assistants. 

Test scores can have little meaning 
unless they are obtained under uniform 
testing conditions. Failure on the part of 
inadequately trained examiners to realize 
the susceptibility of test scores to even 
slight variations in conditions is one of 
the chief causes of inaccurate test re- 
sults. The need for uniformity applies 
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not only to such obvious factors as time 
limits and wording of directions but 
also to more subtle conditions. In cer- 
tain tests, for example, performance 
may be appreciably affected by the 
rate at which the examiner speaks, 
where he places emphasis and when 
he pauses in his presentation, his facial 
expression while pronouncing key 
words that might give away the correct 
answer, and the position of materials to 
be employed by the subject. There is 
evidence that the use of desks rather 
than chairs with desk arms may sig- 
nificantly improve scores on tests given 
with a separate answer sheet (Lindquist, 
1951, p. 360). Any unusual condition 
of the subject, such as illness, fatigue, 
or excessive worry, is, of course, likely 
to affect test scores adversely. Even the 
nature of the subject’s activities im- 
mediately preceding a test must be 
taken into account. Thus two studies 
employing the Goodenough Draw-a- 
Man Test with school children showed 
that mean scores tended to be higher 
after an emotionally gratifying ex- 
perience than after a neutral or de- 
pressing activity (McCarthy, 1944; 
Reichenberg-Hackett, 1953). 
Underlying the use of all ability tests 
is the assumption that the subject is 
“doing his best.” Consequently, if con- 
ditions are to be kept uniform in this 
regard, every subject should be moti- 
vated to put forth his maximum efforts 
on the test. Among the motivational 
and emotional conditions that have 
been found to affect test scores may be 
mentioned praise, reproof, ridicule, 
knowledge of results, presence of ob- 
servers, competition and rivalry, and 
various conditions evoking feelings of 
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frustration, failure, and discouragement 
(Anastasi, 1954, pp- 47-50; Cronbach, 
1960, Ch. 3). When a group of children 
were examined with the Stanford-Binet 
upon entering kindergarten and re- 
tested with a parallel form two nonths 
later, a significant mean rise in IQ was 
found (McHugh, 1943). This gain was 


attributed largely to the effec of the 
kindergarten experience in rc jucing 


shyness, fear of strangers, and other 
attitudes inhibiting oral speech. Support 
for such a hypothesis was found in the 
fact that the mean test-retest 1mprove- 
ment was only 4.7 per cent in manipu- 
latory tasks, as contrasted to 11.2 per 


cent in the oral items. In another study 
(Hutt, 1947), the Stanford-Binet was 
administered by a modified proce dure in 
which each task failed was followed by 
an easier item. In the standard adminis- 
tration of this scale failures are often 
followed by more difficult items, the 
test continuing until all items w ithin a 
single year level are failed. The hypoth- 
esis under investigation was that such a 
procedure may produce a mounting 
awareness of failure, which prevents the 
child from doing as well as he might 
on the later tasks. The results showed 
that poorly adjusted children tended to 
score higher when tested by the adapted 
than by the standard procedure, 
whereas well-adjusted children did 
equally well by both methods. 

It is likely that motivational factors! 
general play a more prominent role in 
the test performance of certain types 
of subjects. Most middle-class Ameri- 
can school children today are not only 
fairly test-wise, but they are also as @ 
rule motivated to succeed in test situa- 
tions. Special motivational problems are 
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encountered, however, in testing such 
groups as emotionally maladjusted in- 
dividuals, preschool children, ethnic 
minorities, and members of low socio- 
economic classes, Juvenile delinquents 
and others tested in institutional settings 
are likely to manifest unfavorable atti- 
tudes, such as suspiciousness, insecurity, 
fe or cynical indifference. Specific 
factors in the past experiences of such 
individuals are also likely to influence 
their test performance adversely (Sears, 
1943). Such children may, for example, 
have developed feelings of hostility and 
inferiority toward any academic ma- 
terial as a result of early failures and 
frustrations experienced in school work. 
Experimental findings regarding the 
effects of motivation upon test scores 
have a number of important implications 
for testing procedure. First, such results 
highlight the necessity for adhering to 
the prescribed motivating conditions in 
test administration as long as the in- 
dividual’s performance is to be evaluated 
in terms of standard norms. A second 
implication is that in the interpretation 
of scores any unusual motivating condi- 
tions must be taken into account. This 
is especially true in the case of subjects 
Whose experiential background is un- 
like that of the standardization sample. 
Finally, it is apparent that the establish- 
ment of rapport prior to the adminis- 
tration of a test is an important part of 
the testing procedure. Insofar as the 
situation permits, the examiner should 
make certain that the subject is ready 
to do his best before the test begins. 
The specific techniques for establish- 
ing rapport vary with the age level and 
other characteristics of the subjects. In 
testing preschool children, special fac- 
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tors to be considered include shyness 
with strangers, distractibility, and 
negativism. A friendly, cheerful, and 
relaxed manner on the part of the ex- 
aminer helps to reassure the child. The 
shy, timid child may need more pre- 
liminary time to become familiar with 
his surroundings. For this reason, it is 
better for the examiner not to be too 
demonstrative at the outset but rather 
to wait until the child is ready to make 
the first contact. Test periods should 
be brief, and the tasks should be varied 
and intrinsically interesting to the child. 
The testing should be presented to the 
child as a game, and his curiosity should 
be aroused before each new task is in- 
troduced. A certain flexibility of pro- 
cedure is necessary at this age level 
because of possible refusals, loss of 
interest, and other instances of nega- 
tivism. 

Children in the first two or three 
grades of elementary school present 
many of the same problems encountered 
in testing the preschool child. The 
“game” appeal is still the most effective 
way of arousing their interest in the 
test. The older school child, on the 
other hand, can usually be motivated 
by an appeal to his competitive spirit 
and his desire to do well on a test. It 
should be borne in mind, however, that 
every test presents an implied threat 
to the individual’s prestige. Some reas- 
surance should therefore be given at the 
outset. It is helpful to explain, for ex- 
ample, that no one is expected to finish 
or to get all items correct. It is also 
desirable to eliminate the element of 
surprise from the test situation insofar 
as possible. For this purpose, many 
group tests provide a preliminary ex- 
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planatory statement, which is read to 
the group, or even a practice booklet 
administered in advance of the testing 
session. 

That test scores may be appreciably 
raised by special practice and coaching 
has been repeatedly demonstrated (Ana- 
stasi, 1958a, pp. 190-194). The effects 
vary markedly, however, with the 
nature of the test and with the age, 
ability, and previous experience of the 
subjects, In the case of certain tests 
repetition of the identical form within 
a few weeks may make the retest score 
meaningless because of recall of re- 
sponses, change in work methods, and 
other disturbing factors. Even retesting 
with alternate forms generally leads to 
some improvement when time intervals 
are short. In such cases some adjustment 
in score should be made to allow for 
practice effect. General test sophistica- 
tion must also be considered in inter- 
preting test performance. In general, the 
child who has had considerable test- 
taking experience tends to score higher 
than the one who is taking his first 
test. 

In evaluating the effect of practice 
and coaching upon test scores, a funda- 
mental question to consider is the 
breadth of such influences (Anastasi, 
1950a). Is the improvement limited to 
the specific test items, or does it extend 
to the broader area of behavior that 
the test is designed to predict? The 
answer to this question represents the 
difference between coaching and educa- 
tion. Rote memorization of the correct 
responses to a list of vocabulary test 
items will markedly raise a child’s score 
on that particular test without appre- 
ciably improving his over-all apprecia- 
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tion of word meanings. This would be 
coaching in the narrow sense and would 
reduce the predictive value of the test 
for that child. On the other hand, a 
year of carefully selected reading, ac- 
companied by discussion of word mean- 
ings, would improve both the child’s 
vocabulary test score and his general 
verbal comprehension, thereby leaving 
unaltered the test’s validity. 

Test scores may also be invalidated 
through deliberate cheating on the part 
of subjects. Cheating on ability tests 
may take various forms: copying froma 
neighbor in group examinations; illicit 
use of outside sources such as books or 
notes; “jumping the gun,” or premature 
starting before the signal to begin, as 
well as continuing after the instruction 
to stop; and obtaining prior information 
regarding specific test items. Improper 
and unscrupulous coaching would also 
fall under the last-named category. 
General precautionary measures include 
safeguarding the security of test mate- 
rials and providing adequate proctoring 
and proper seating arrangements in 
group testing. The use of tests with 
several alternate forms also helps tO 
minimize cheating. Premature starting 
and failure to stop on signal can be 
partially controlled by the design of 
test booklets in such a way that a page 
must be turned at the beginning of each 
separately timed part. In scheduling 
test sessions, attention should also be 
given to reducing the opportunity for 
communication among subjects rested 
at different times. If copying or collu- 
sion with another examinee is SUS 
pected, certain statistical checks based 
on number of identical errors may be 
applied to detect cheating (Anikeeff, 


an 
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Dickenson, 1945). In the use of 
with older children for research 
ses only, preserving the subject’s 
ymity is often an effective way to 
reduce the motivation to cheat, Reas- 
surance that the tests are to be utilized 
for research only usually has somewhat 


the same effect. It should be noted, 
however, that these motivating condi- 
tions are not comparable to those pre- 


vailing during the operational use of 
ests, and results may vary from one 
on to the other. 


NORMS AND THE INTERPRETA- 
TION OF TEST SCORES 


A major step in test standardization 
is the establishment of norms. The 
“raw score,” such as the number of 
items correct or the time required to 
complete a task, is meaningless until 
compared with the norm or average 
performance, Psychological tests have 
no pre-established standards of passing 
or failing. In the process of developing 
a new test, it must be administered to 
a large, representative sample of the 
population for which it is designed. 
The scores obtained by this group pro- 
vide the test norms. Test manuals 
should always include such norms, to- 
gether with information on the size 
and nature of the normative sample. 
The test user will thus be in a position 
to evaluate the adequacy of the pub- 
lished norms as well as the compara- 
bility of the normative sample to the 
type of subjects he plans to test. Owing 
to differences in the characteristics of 
samples upon which different tests have 
been standardized, the same individual 
may appear to score much higher on 
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one test than on another. This is one 
of the reasons why test scores should 
always carry the name of the test with 
which they were obtained. 

An individual’s position in relation to 
the norms may be expressed in a num- 
ber of specific ways. Basically, how- 
ever, all such derived scores in current 
use can be subsumed under three types 
(Anastasi, 1954, Ch. 4; Cronbach, 1960, 
Ch. 4). The first is the traditional ratio 
IQ, found by dividing mental age by 
chronological age. Theoretically, men- 
tal age represents the age of normal 
children whose test performance the 
subject equals. Thus if a child does as 
well on the test as the average 10-year- 
old, his mental age is 10, regardless of 
his chronological age. A second major 
type of score is the percentile rank, 
which indicates the percentage of cases 
in the normative sample falling at or 
below the subject's score. Percentiles 
are not to be confused with the ordinary 
percentage score representing the per- 
centage of items correctly completed. 

Despite their wide popular appeal, 
both ratio IQ’s and percentiles have 
serious limitations and are not in gen- 
eral suitable for research purposes. 
Ratio IQ’s can be properly applied only 
to age scales that have been specifically 
designed to yield comparable [Q’s at all 
ages. When indiscriminately employed, 
ratio IQ’s may vary widely in meaning 
at different age levels. On the Merrill- 
Palmer Scale, for example, an IQ of 
114 at one age level indicates the same 
degree of superiority as an IQ of 141 
at another age (Stutsman, 1931). As for 
percentile scores, their chief weakness 
stems from the marked variation in size 
of unit. Owing to the greater clustering 
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of cases at the center of the range and 
the decrease in frequency as the ex- 
tremes are approached, percentile units 
near the center (i.e., 50) cover a much 
smaller distance than those near the 
ends of the scale (i.e., o and 100). 

The most precise measure is provided 
by standard scores and their various 
derivatives. In all such scores the indi- 
vidual’s distance from the mean is ex- 
pressed in standard deviation units 
(SD). Thus, if the normative sample 
has a mean of 38 and an SD of 4, a 
raw score of 34 would correspond to 
a standard score of —1.00. Such simple 
standard scores, based directly on the 
SD units, are often called z scores. Sev- 
eral available variants of standard scores 
provide more convenient scales, which 
avoid the use of negative values and 
decimals. An example is the T score, 
representing standard scores in a nor- 
mal distribution with a mean of 50 and 
an SD of 10. A z score of — 1.00 would 
thus correspond to a T score of 40. 
Another type of standard score is the 
stanine, which utilizes a single-digit 
scale ranging from 1 to 9 and having 
a mean of 5 and an SD of 2. 

Of particular interest is the deviation 
IQ, currently employed in the WISC 
and Stanford-Binet, as well as in several 
recently developed group tests. WISC 
deviation IQ’s are actually standard 
scores with a mean of 100 and an SD 
of 15. Thus a child with a deviation 
IQ of 100 falls exactly at the mean 
of his age group. Deviation IQ's of 115 
and 85 correspond to one SD above and 
one SD below the mean, respectively; 
a deviation IQ of 130 represents two 
SD’s above the mean, and so forth. In 
a similar fashion, the 1960 Stanford- 
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Binet uses a deviation IQ with a mean 
of 100 and an SD of 16. The relation 
of the various forms of standard 
to one another and to percentiles is 
given in Fig. 11-6. 


scores 


TEST RELIABILITY 


One of the questions that needs to 
be asked in evaluating any p ychologi- 
cal test concerns its reliability. In gen- 
eral, test reliability refers to the con- 
sistency of scores obtained by the same 
individuals on different occasions. Ran- 
dom fluctuations in test scores may re- 
sult from a number of chance errors 
affecting test administration, scoring, 
condition of subject, or selection of 
items constituting a particular form of 
a test. Different methods of computing 
test reliability are influenced by one or 
more of these various types of chance 
errors of measurement. Consequently, 
not all reliability coefficients have the 
same meaning. To interpret a reliability 
coefficient, it is necessary to know by 
what procedure it was found. A 

The principal types of test reliability 
in current use include retest, equivalent- 
form, split-half, and Kuder-Richardson 
reliability (Anastasi, 1954, Ch. 5; Cron- 
bach, 1960, Ch. 6). All express reli- 
ability in the form of a correlation 
coefficient between independently se- 
cured measures. Retest reliability 3s the 
correlation between scores on the 
identical test administered at different 
times, The interval may vary from ? 
few days to several months. Obviously 
such a correlation indicates tempora 
stability from one test session to a- 
other. When parallel forms of the test 
are administered on the two occasions: 
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Number of cases 
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Fig. 11-6. Types 


the correlation between them repre- 
sents equivalent-form reliability. Such 
a measure is influenced both by tem- 
poral fluctuations and by the extent of 
individual variability from one com- 
parable set of test items to another. 
The latter characteristic alone is 
measured by split-half reliability. The 
commonest way of computing split- 
half reliability is to find each person’s 
score on the odd items and on the even 
items. From these two scores the re- 
liability of the whole test can be de- 
termined in several ways (Anastasi, 


of test norms. 


1954, 108-109; Cronbach, 1951; Thorn- 
dike, 1949, Ch. 4). Like split-half reli- 
ability, Kuder-Richardson reliability is 
derived from a single test session. Being 
ultimately based on interitem consist- 
ency, however, it is also influenced by 
homogeneity of test content. For ex- 
ample, in a test consisting of 50 verbal 
analogies and 50 number-series comple- 
tions, the Kuder-Richardson reliability 
would be lower than in one composed 
of 100 items of either type alone. Any 
discrepancy between subjects’ perform- 
ance on the verbal and on the numeri- 
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cal items would be reflected in a low- 
ered Kuder-Richardson reliability in 
the former test. 

Mention should also be made of ex- 
aminer and scorer reliability. Most tests 
provide such highly standardized pro- 
cedures for administration and scoring 
that variations attributable to the ex- 
aminer are negligible. This is particu- 
larly true of group tests designed for 
mass examining and machine scoring. 
In most infant and preschool tests, how- 
ever, as well as in other individual tests, 
such as the Stanford-Binet, testing pro- 
cedure leaves more room for the skill 
and judgment of the examiner to op- 
erate. Under such conditions, it has 
been found that even properly qualified 
examiners may obtain appreciably dif- 
ferent results from the same subjects. 
Various techniques, including correla- 
tion and analysis of variance, may be 
employed to check on examiner relia- 
bility. Certain types of tests, such as 
the Goodenough Draw-a-Man Test, 
present special problems of scorer reli- 
ability. In one investigation of this test 
interscorer discrepancies amounting to 
as much as a year of mental age were 
found for about 25 per cent of the 
drawings (McCarthy, 1944). In any re- 
search project utilizing tests whose 
scoring has qualitative features at least 
a sample of papers should be independ- 
ently rescored by another examiner and 
the degree of agreement determined. 

In interpreting the reliability coeffi- 
cients reported in test manuals, certain 
cautions should be observed. First, 
single-trial methods, such as the split- 
half and Kuder-Richardson techniques, 
are inapplicable to speed tests. In a pure 
speed test, for example, subjects will 
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make no errors (or possibly a igible 
number of errors through carelessness). 
Thus an individual with a total score of 
20 would have odd and even scores of 
10 and 10, respectively, one scoring 28 
would obtain odd and even scores of 
14 and 14, and so on, Such a «i/uation 
would obviously result in periect or 
nearly perfect reliability. Bı is re- 
liability would have no reference to 
speed of work, which is the ential 
determinant of scores on suci: a test, 
Insofar as speed plays an aj ciable 
part in determining test sco relia- 
bility coefficients computed these 
methods are correspondingly inated. 

A second caution concerns the sub- 
jects upon whom reliability wos estab- 
lished. All correlation coefficicnts are 
affected by the range of scores within 


the group. The greater the variability, 
the higher will be the correlation. Since 
tests are usually employed to discrimi- 
nate among individuals within single age 
or grade levels, reliability coefficients 
should be reported separately for these 
levels. Such a procedure also makes it 
possible to determine whether the test 
is equally reliable for all ages. A single 
reliability coefficient found on a highly 
heterogeneous standardization sample, 
on the other hand, may be misleadingly 
high. Finally, it should be noted that 
reliability is directly related to test 
length. Thus an abbreviated form of a 
test can be expected to show some loss 
in reliability. Similarly, subtests or parts 
of a test will have lower reliability than 
the complete test. If subtest scores are 
to be separately analyzed, as in an apti- 
tude profile, reliability coefficients 
should be determined for each subtest. 

With school-age children, most well- 
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constructed ability tests yield reliability 
cocificients clustering around .go. In 
the case of infant and preschool tests, 


however, reliabilities tend to run lower 
(Anastasi, 1954, Ch. 11). Among the 
conditions contributing to such low re- 
liability may be mentioned distract- 
ibiliry, shyness, negativism, and other 
factors interfering with rapport and 
test administration. Reliability may be 
further lowered by scoring irregulari- 
ties, since many test responses of the 
young child consist of fleeting move- 
ments that leave no permanent record. 
Despite these difficulties, some of the 
more carefully developed infant tests, 
such as the Cattell Infant Intelligence 
Scale and the California First-Year 
Mental Scale, have yielded promising 
results, their split-half reliabilities fall- 
ing between .7o and .95 after the first 
3 months of age (Anastasi, 1954, P- 
288). Even during the first three 
months, there is evidence that when a 
larger number of items is employed and 
purely motor tasks are excluded, as in 
the Northwestern Infant Intelligence 
Tests, split-half reliabilities in the .80’s 
and .90’s are obtained (Anastasi, 1954, 
p. 288). 


TEST VALIDITY 


The most important property of a 
test is undoubtedly its validity. The 
concept of validity concerns the “ex- 
ternal relations” of a test to other data 
about the individual. Such relations en- 
able us to state what the test measures 
and how well it does so. It is meaning- 
less to say that a test is “valid” or has 
“high validity,” without indicating the 
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purpose or criterion for which it pos- 
sesses the specified validity. For the se- 
lection of appropriate tests, as well as 
for the proper interpretation of test 
scores, full information is required re- 
garding the procedures followed in esti- 
mating test validity. 

In accordance with current usage, 
the various kinds of validity may be 
classified into four categories, namely, 
predictive, concurrent, content, and 
construct validity (American Psycho- 
logical Association, 1954, pp. 13-28; 
Anastasi, 1954, Ch. 6; Cronbach, 1960, 
Ch. 5). To measure predictive validity, 
test scores are checked against per- 
formance in a specified area at some 
future time. For example, scores on a 
reading-readiness test administered upon 
school entrance may be correlated with 
actual reading achievement at the end 
of the first grade. Concurrent validity 
utilizes essentially the same procedure, 
except that no time lag is involved. The 
administration of an intelligence test to 
school children and to children of the 
same ages in an institution for mental 
defectives would illustrate this type of 
validation. If the mental defectives 
score significantly poorer than the 
normal children, the test discriminates 
in the expected direction and can in 
that sense be said to be measuring “in- 
telligence.” 

Content validity is most often used 
in the case of achievement tests, Also 
designated as “intrinsic” and “logical” 
validity, this type of validity concerns 
primarily the adequacy with which the 
test items sample the content area to be 
measured. The test may thus be checked 
against relevant course syllabi, text- 
books, and the judgment of subject- 
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matter experts. Coverage of essential 
topics, in the correct proportions, is 
the major consideration. It is also de- 
sirable to secure, if possible, objective 
data on the extent to which irrelevant 
abilities, such as reading comprehension 
in a mathematics test, may affect scores. 
Gulliksen (1950) has suggested a num- 
ber of objective procedures for improv- 
ing content validation. 

Construct validity is a very broad 
concept, covering a variety of valida- 
tion procedures. It is concerned pri- 
marily with an experimental verification 
of hypotheses regarding the psycho- 
logical traits or “theoretical constructs” 
that account for performance on the 
test (Cronbach and Meehl, 1955). The 
commonest application of this valida- 
tion procedure to ability tests is found 
in factorial validity. Thus, if a factor 
of spatial visualization has been estab- 
lished by factor analysis, the correlation 
of a test with this factor is its factorial 
validity. Essentially, such a correlation 
indicates the extent to which the test 
measures an ability common to a group 
of similar tests which sample a rela- 
tively unified area of performance. 
Almost any experimental evidence re- 
garding the behavior of test scores 
would contribute to construct validity. 
If, for instance, it is hypothesized that 
a given ability is not susceptible to 
practice effect, a comparison of test 
scores obtained before and after prac- 
tice would provide information relevant 
to construct validation. Another exam- 
ple is provided by a study of the role 
of reading comprehension in tests de- 
signed to measure such traits as numeri- 
cal or spatial aptitudes. As a part of the 
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validation of such tests, it should be 
established that low scores do not com- 
monly result from inability to under- 
stand complicated wording of questions 
or instructions. 


Most aptitude and intelligence tests 
have been validated in terms of pre- 
dictive or concurrent validity. These 
two types of validity, which have some- 
times been jointly designated ss “em- 
pirical validity” (Anastasi, 19 Ch. 6; 
Guilford, 1946), consist basically in the 
correlation or other statistical compari- 
son of test scores with a criterion. The 
criterion is an independent index of 
those behavioral characteristics that the 
test is designed to measure. Special apti- 
tude tests and multiple-facto: batteries 


have frequently been validated against 
performance in different types of 
courses, such as English, mathematics, 
shop work, mechanical drawing, art, 
and music. Intelligence tests have uti- 
lized many criteria, although the com- 
monest center around general academic 
achievement. Preschool and infant tests 
have generally been validated in terms 
of age differentiation. In other words, 
individual items, as well as the test as 
a whole, are checked against chrono- 
logical age to determine whether per- 
formance shows a progressive improve- 
ment with advancing age. 

Ratings of intelligence by teachers, 
nursery-school workers, and other ob- 
servers have provided criterion data for 
validating certain intelligence tests. 
Contrasted groups have likewise been 
employed, as when intelligence test 
scores of children in institutions for 
mental defectives are compared with 
those of unselected public school pupils 
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£ the same ages. Actually, such a cri- 

erion is based upon the cumulative and 
uncontrolled selective influences oper- 
in daily life that determine the 
individual’s membership in one or the 
of the contrasted groups. Internal 
stency is sometimes reported as 
ence of validity. In this case either 
test items or subtests are correlated with 
total test score as a criterion, Although 
indicative of test homogeneity and uni- 
dimensionality, which are themselves 
desivable properties, this type of meas- 
ure should not be regarded as a sub- 
e for validity. It might be added 
that some psychologists would classify 
a differentiation, contrasted groups, 
and internal consistency under con- 
struct validity (Cronbach and Meehl, 


sti 


1955). The distinction cannot be very 
sharply drawn in terms of current 
usage. 


Partly because of the multiplicity of 
possible criteria, different intelligence 
tests may vary widely in the nature of 
abilities measured. This is especially 
true of tests designed for different age 
levels or types of subjects. In fact, even 
a single test, such as the Stanford-Binet, 
may tap a different combination of 
functions at different ages. A consid- 
eration of such variations in coverage 
among “intelligence” tests is particu- 
larly important in planning and inter- 
preting longitudinal studies (Anastasi, 
1958a, pp. 225-226). For example, a 
factorial analysis of intelligence-test 
scores obtained over an eighteen-year 
period by the subjects of the Berkeley 
Growth Study suggested that the func- 
tions measured during the first 2 
years could be characterized largely as 
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“sensori-motor alertness”; those covered 
between 2 and 4 years, as “per- 
sistence”; and those tested after the age 
of 4, as “manipulation of symbols” 
(Bayley, 1955; Hofstaetter, 1954). 


LONG-RANGE PREDICTION 


In the evaluation of infant and pre- 
school tests the results of long-range 
follow-up studies have often been cited 
in connection with both reliability and 
validity. When the time interval is of 
several years’ duration, however, it does 
not seem appropriate to regard a retest 
correlation as a reliability coefficient. 
Changes occurring over such periods 
are likely to be cumulative and pro- 
gressive for each individual, rather than 
random. Moreover, they usually charac- 
terize a relatively broad area of be- 
havior rather than being restricted to 
the test items. Nor can such retest cor- 
relations be properly considered as 
validity coefficients. Whether the same 
or a different test is used as the “cri- 
terion,” it is likely that somewhat dif- 
ferent functions are sampled when test- 
ing is repeated at widely separated age 
levels, Furthermore, intervening ex- 
periences may substantially alter the 
child’s intellectual status, producing 
changes that should be detected by a 
sensitive test. Although affected by 
both the reliability and validity of the 
measuring instrument, long-range cor- 
relations reflect primarily the regularity 
of behavior development and the inter- 
correlations among functions measured 
on the different occasions. It should be 
added that at the infant and preschool 
levels development occurs so rapidly 
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that retests over as short a period as 
three months may be influenced by 
broad behavioral changes and should 
therefore be classified with long-range 
follow-ups. 

The traditional problem of the “con- 
stancy of the IQ” is concerned with 
long-range predictability of intellectual 
functioning and should be differenti- 
ated from considerations of test reli- 
ability and validity (Anastasi, 1954, pp. 
290-291, 302-304; Anastasi, 1958a, pp. 
231-238). On the basis of the Berkeley 
Growth Study data, Bayley (1955) con- 
cluded that tests given at 4 years of age 
may permit grade-school predictions 
within wide classifications; tests given 
between 2 and 4 years of age will pre- 
dict 8- and 9-year-old performance with 
some success; but scores obtained before 
18 months are completely useless in the 
prediction of abilities during school 
ages, the correlations being zero or 
negative. Similar results have been ob- 
tained by other investigators (Bradway, 
1944; Goodenough and Maurer, 1942; 
Honzik, Macfarlane, and Allen, 1948). 
Shorter follow-ups within the preschool 
period yield higher correlations. Thus 
correlations of .56, .67, and .71 were 
found between Stanford-Binet IQ’s at 
the age of 3 years and Cattell Infant 
Intelligence Scale administered at the 
ages of 12, 18, and 24 months, respec- 
tively (Cattell, 1947). During the 
school years retest correlations with 
both individual and group intelligence 
tests are sufficiently high to make pre- 
dictions over five to ten years practi- 
cable. 

The magnitude of retest correlation 
depends upon a number of factors. 
Length of interval over which predic- 
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tions are made is obviously a major 
condition. Among school-age children, 
for example, correlations drop from 
.go for immediate retests to about .70 
after a five-year lapse (Thorndike, 
1933, 1940). Nature of intervening ex- 
periences influences the amount and di- 
rection of change in individi! chil- 
dren. It has been repeatedly Jemon- 
strated that even in later childhood 
large shifts in IQ may occu Such 
changes are often associated with the 
cultural milieu and emotions! climate 
of the child’s environment (Bayley, 
1955; Bradway, 1945; Honzik. Mac- 


farlane, and Allen, 1948; Sontag, Baker, 


and Nelson, 1958). In general, children 
reared in underprivileged environments 
tend to lose with age, and those reared 

1, in re- 


in superior environments to gai ¢ 
lation to test norms. Large changes im 
IQ have also been found to be 4 
ciated with certain personality charac- 
teristics of the child, such as emotional 
independence and achievement motiva- 
tion (Sontag, Baker, and Nelson, 
1958). 

The comparison of results obtained 
with preschool and elementary-school 
children has already suggested that age 
at the time of the initial test is an 1m- 
portant determiner of retest correla- 
tions. When length of interval is con- 
stant, predictions from infant and pre- 
school tests are much less certain than 
those made from school-age tests. There 
are several reasons for this increas- 
ing “constancy of the IQ” with age. 
The generally lower reliability and the 
less satisfactory procedures of test con- 
struction, administration, and scoring, 
which characterize available preschool 
and infant tests, are undoubtedly con- 
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tributing factors. Nature of test content 


may likewise account for part of the 
Terence, An item analysis of the Min- 
nesota Preschool Scale suggested that 
tests of sensori-motor coordination and 
those calling for specific items of in- 
formation have less predictive value 
than those emphasizing perception of 


spatial relations, controlled attention, 
memory, and logical relations (Maurer, 
1947). A third possible factor concerns 
school entrance itself, Unlike the school 
child, the infant and preschool child 


have not been exposed to the relatively 


standardized series of experiences rep- 
resented by the school curriculum. 
Hence in the latter cases the test con- 
structor lacks the large pool of com- 
mon experiential material from which 


test items can be drawn. Under these 
conditions, the development of effective 
testing instruments is much more diffi- 
cult. 

A final explanation is provided by 
Anderson’s “overlap hypothesis” (An- 
derson, 1940). Anderson writes, “Since 
the growing individual does not lose 
what he already has, the constancy of 
the IQ is in large measure a matter of 
the part-whole or overlap relation” 
(Anderson, 1940, p. 394). Even if an- 
nual increments in intellectual devel- 
opment bore no relation to one another, 
the intellectual level would exhibit 
growing constancy with age because 
earlier acquisitions constitute an ever- 
increasing proportion of the child’s to- 
tal intellectual fund. Anderson demon- 
strated that test-retest correlations ob- 
tained in published longitudinal studies 
agree closely with the correlations the- 
oretically expected on the basis of this 
overlap hypothesis. 
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One of the major distinguishing fea- 
tures of current psychological testing is 
its emphasis upon the differential meas- 
urement of ability. More and more, 
aptitude profiles, which provide a set 
of scores in different abilities, are being 
substituted for such global measures as 
the IQ. An example of a profile is given 
in Fig. 11-7. From many angles, psy- 
chologists have become increasingly 
aware of intra-individual differences in 
intellectual development. The applica- 
tion of factor analysis to the study of 
trait organization. has provided the the- 
oretical basis to such a differential ap- 
proach. 

Elementary introductions to the logic 
and mathematical techniques of factor 
analysis can be found in Adcock 
(1954), Anastasi (1958a, Chs. 10 and 
11), Fruchter (1954), and Guilford 
(1954, Ch. 16). For the present pur- 
pose, it suffices to note that factor 
analysis identifies traits by an investi- 
gation of the intercorrelations among 
a wide variety of tests. In his pioneer 
work on the nature of intelligence 
Spearman (1927) reported a single 
common factor through all intellectual 
functions, which he called a general 
factor (g). Subsequent research has 
revealed a number of group factors, 
each common to a more limited area 
of activity (Anastasi, 1958a, Ch. 10). 

The principal effect of factorial re- 
search upon test construction has been 
the development of multiple-aptitude 
batteries, which yield a set of scores in 
relatively independent abilities. Follow- 
ing their investigations on college stu- 
dents and school children, the Thur- 
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Fig. 11-7. Aptitude profile of an eighth-grade boy on the Differential Aptitude Tests. The 
heights of the vertical bars show how far above or below the norm the subject scores in 
each aptitude. (Copyright by The Psychological Corporation.) 


stones (1938, 1941) prepared the tests 
of Primary Mental Abilities. In their 
later abbreviated versions these tests 
are available for three age levels, At the 
youngest level, suitable for ages 5 to 7, 
separate scores are provided for verbal- 
meaning, quantitative, space, percep- 
tual-speed, and motor factors. An illus- 
trative item from each of the tests used 
to measure these factors is shown in 
Fig. 11-8. The 7-to-11-year level in- 
cludes verbal-meaning, number, reason- 
ing, space, and perceptual-speed factors. 
The number and reasoning factors rep- 
resent further differentiation of the ru- 
dimentary quantitative factor appearing 
at the younger level. The highest level, 


designed for ages 11 to 17, comprises 
verbal-meaning, word-fluency, number, 
reasoning, and space factors. ‘Thus there 
are two verbal factors represented at 
this stage, viz., verbal-meaning and 
word-fluency. An associative memory 
factor, included in the earlier, longer 
version of these tests, has been omitted 
from the later forms, Several other 
multiple-aptitude batteries are now 
available, but they are directed pri- 
marily to the high-school level (An- 
astasi, 1954, Ch. 14, 1958a, Ch. 11; Cron- 
bach, 1960, Ch. 10). y 
Factorial analyses conducted at dif- 
ferent ages suggest that intelligence 
may be relatively undifferentiated 10 
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early childhood, becoming increasingly 
specialized with age. Spearman’s g fac- 
tor would thus be more prominent at 
the younger ages, whereas group fac- 
tors would predominate at the older 
ages. Garrett (1946) formulated such a 
developmental theory on the basis of 
factorial results obtained with different 
ge groups. Corroborative data were 
likewise provided by both cross-sec- 
tional and longitudinal investigations 
specifically designed to test the age dif- 
ferentiation hypothesis. Since a number 
of subsequent studies have yielded in- 
sistent data, no conclusive statement 
regarding age changes in trait organi- 
zation can be made at this time. Nev- 
ertheless, Burt (1954), who proposed 
a similar age-differentiation hypothesis 
as early as 1919, has recently reaffirmed 
this position following an examination 
of later published work. The contra- 
dictory findings of other studies he at- 
tributes to methodological limitations. 
Although Burt offers a maturational in- 
terpretation of the differentiation of 
abilities, there is some evidence to in- 
dicate that educational experiences may 
lead to the development of group fac- 
tors (Anastasi, 1958a, pp. 360-361). 


CULTURAL FACTORS 
IN TEST SCORES 


The literature on cultural differences 
in test performance has attained vast 
proportions (Anastasi, 1960). Even 
within a single nation, large significant 
mean differences in test scores are 
found among socioeconomic classes, 
urban and rural populations, and vari- 
ous ethnic minority groups, such as 
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American Negroes and American In- 
dians. On tests like the Stanford-Binet 
and the WISC, for example, the IQ’s 
of children whose fathers are in the 
professions average about 20 points 
higher than those of children of un- 
skilled laborers (McNemar, 1942; Sea- 
shore, Wesman, and Doppelt, 1950). 
Any group comparison in “intelli- 
gence,” however, must be qualified by 
the fact that the relative performance 
of groups varies with the nature of the 
test. For instance, children from higher 
socioeconomic levels tend to excel more 
in verbal than in spatial and mechanical 
aptitude tests (Eells et al., 1951; Havig- 
hurst and Breese, 1947; Havighurst and 
Janke, 1944; Janke and Havighurst, 
1945). Similarly, the fund of common 
knowledge from which many intelli- 
gence-test items are drawn differs 
among cultures or subcultures. In a 
pioneer study on this question Shim- 
berg (1929) constructed two forms of 
a general information test, one on urban 
and one on rural children. When both 
forms were administered to new sam- 
ples of city and country children, the 
former excelled on the urban version, 
the latter on the rural. Suggestive re- 
sults along the same lines have likewise 
been obtained with tests developed in 
other cultures, such as a Draw-a-Horse 
Test standardized on American Indian 
boys (DuBois, 1929) and a footprint- 
matching test constructed by Porteus 
(1931, p. 401) for Australian aborigines. 
A growing trend in the psychological 
study of group differences is the shift 
from a description of the status quo 
to an analysis of change. Several large- 
scale surveys of the intelligence test 
performance of comparable populations 
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Fig. 


THE STUDY OF COGNITIVE | 


VERBAL-MEANING is measured by Vocabulary, Sentence Com- 
prehension, Sentence Completion, Paragraph Comprehension, 
and Auditory Discrimination items. In each type of item, the 
subject marks the picture which best fits the examiner's oral 
statement, as shown below. 


Vocabulary: Mark the Dog. ge | 
eC | 
e 


Sentence Comprehension: Mark the picture which answers this | 
question. In which one does mother carry her money when ! 
she goes shopping? 


Sentence Completion: If you want to keep something cold you 
keep it in the .......-... Mark it. 


Paragraph Comprehension: Mark the picture that goes with 
this story. Beity is hanging the doll’s dress on the clothesline. 


(Copyright by Science Research Associates.) 
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11-8. Illustrative items from SRA Primary Mental Abilities Tests for ages 5 toe 
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PERCEPTUAL-SPEED. 
Mark the kite in the little box. Then mark the other kite 
that is just like the one in the little box. 


F| 


QUANTITATIVE. 
Three policemen were standing on a corner. One 
went home. How many policemen were there then? 
Mark them. 


SOOO 


Mark the largest doll and the smallest doll. 


MOTOR. 
Draw lines connecting top and bottom dots as quickly as 
possible. Four samples have already been filled in. 


plate ee a e anys) Ware fe: “eel © 
Een 
SPACE. 
Mark the two parts which can be fitted together to 


make a square. 


eS 


Fig. 11-8, (Continued) 
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have revealed significant rises in mean 
scores that paralleled improvements in 
socioeconomic and educational condi- 
tions. Such findings are illustrated by 
the Scottish surveys of nearly com- 
plete samples of 11-year-old children 
tested in 1932 and in 1947 (Scottish 
Council, 1949), by Finch’s (1946) study 
of American high-school students over 
a twenty-year period, and by Wheeler's 
(1942) 10-year retests in a rural Ten- 
nessee community that had made un- 
usual progress in social and educational 
conditions during the interval. There is 
also evidence of a progressive diminu- 
tion of the gap between urban and 
rural groups (Anastasi, 1958a, Ch. 15). 

One of the major sources of variation 
among cultural groups is to be found 
in the amount and kind of their formal 
schooling. Educational differences are 
especially conspicuous in the case of 
social classes and ethnic minority 
groups. Until recently, urban-rural dif- 
ferences in this regard also were large. 
Such group differences stem in part 
from economic factors and outside 
pressures on the child’s time and in part 
from more subtle motivational and at- 
titudinal conditions. That the extent 
and nature of schooling that the child 
receives will in turn affect his intelli- 
gence-test performance is now gener- 
ally recognized by psychologists. Evi- 
dence in support of this conclusion may 
be found in the previously cited cross- 
sectional investigations of comparable 
populations as well as in longitudinal 
studies conducted in America (Lorge, 
1945; Owens, 1953) and in Sweden 
(Husén, 1951). 

The testing of children reared in dif- 
ferent cultures or subcultures presents 
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many technical difficulties. In the case 
of groups speaking different languages 
most available intelligence tests are in- 
applicable. Linguistic deficiency may 
also seriously handicap the perfor 
ance of bilingual children on verbal- 
type tests. Although under certain con- 
ditions the acquisition of a second 
language will enrich verbal comprehen- 
sion and stimulate intellectual develop- 
ment, bilingualism as it occurs among 
children of immigrants or other imni- 
nority groups often serves to reduce 
proficiency in both languages and re- 
tard school progress (Anastasi, 19584, 
Ch. 16; Arsenian, 1945; Darcy, 1953). 
On verbal intelligence tests such chil- 
dren average consistently lower than 
monoglots from the same groups. When 
nonlanguage or performance tests are 
substituted, their scores reach or excel 
those of monoglots. It cannot be as- 
sumed, however, that all such intelli- 
gence tests measure the same abilities. 
Most nonlanguage and performance 
tests draw heavily upon spatial visuali- 
zation and perceptual functions, in con- 
trast to the predominantly verbal and 
numerical aptitudes covered by the 
commoner intelligence tests. 
Moreover, persons from certain cul- 
tures or subcultures will be handi- 
capped even on nonlanguage tests be- 
cause of the specific information, skills, 
and work habits required. Each culture 
fosters the development of a different 
pattern of abilities. Tests constructed 
within a particular culture reflect such 
an ability pattern and thus tend to favor 
individuals reared in that culture. To 
meet these additional difficulties, a num- 
ber of so-called culture-free tests have 
been prepared (Anastasi, 1954, PP: 2557 
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Current examples include the 
International Performance Scale, 
attell Culture-Free Test of Intel- 
e, the Raven Progressive Matrices, 
ic Goodenough Draw-a-Man Test 
eferences, cf. Anastasi, 1954, Ch. 
Sample items from two of these 
ire shown in Figs. 11-9 and 11- 
\ test specifically designed to 
rate “social class bias” from test 
‘ells Games (An- 
1954, pp. 266-268). 
hough no test can be truly free 
cultural influences, it is theoreti- 
possible to construct a test that 
pposes only the experiences that 
mon to different cultures. This 
iat available culture-free tests have 


it is the Davis 


Fig. 11-9. Typical materials 
test illustrated is the Analogies Pro; 
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tried to accomplish. In actual practice, 
however, such tests fall short of their 
objective. No existing test is entirely 
unrestricted in its cultural reference, 
the difference between ‘“culture-free” 
and other tests being one of degree. The 
mere use of paper and pencil or ‘the 
presentation of abstract tasks that have 
no immediate practical significance will 
favor some cultural groups. Other rele- 
vant factors that differ among cultures 
or subcultures include extent of fa- 
miliarity with pictorial or diagram- 
matic representation, rapport with the 
examiner (especially if he belongs to a 
different race from the testees), test- 
taking motivation, drive to excel, and 
previously developed problem-solving 


for use in the Leiter International Performance Scale. The 
gression Test from the six-year level. Each block must be 


inserted in the proper recess. (Courtesy R. G. Leiter.) 
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Fig. 11-10, Two items from the Progressive Matrices. The subject must choose the missing 
part of each design from the six or eight alternatives provided. (Courtesy J. C. Raven.) 


attitudes. In this connection it is inter- 
esting to note that in a review of over 
twenty years of research with the 
Draw-a-Man Test, Goodenough and 
Harris (1950, p. 399) concluded that 
“the search for a culture-free test, 
whether of intelligence, artistic ability, 
personal-social characteristics, or any 
other measurable trait is illusory.” š 

Finally, it should be recognized that 
the reduction of cultural differentials 
or the elimination of cultural bias in a 
test may reduce its validity for the pre- 
diction of many important criteria 
(Anastasi, 1950a). Cultural factors in- 
fluence behavior development at many 
levels (Anastasi, 1958b). Not only test 
performance but also broader aspects 
of the child’s intellectual development 
may be fundamentally affected by the 
cultural conditions under which he is 
reared. In other words, the criteria that 
the tests are designed to assess may 
themselves be culturally loaded. Test 


items that exhibit the smallest differ- 
ences between cultural groups are likely 
to measure the more trivial and socially 
less important intellectual functions. It 
should be obvious that the contribution 
of cultural factors to individual de- 
velopment cannot be ruled out by 
eliminating cultural differentials from 
test scores. 
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During the last thirty or forty years 
there has been an increasing awareness 
among educators, clinicians, psycholo- 
gists, speech therapists, and people in- 
terested in child development of the 
existence in the population of a con- 
siderable number of individuals with 
various degrees of speech handicaps. 
Surveys on the incidence of speech dis- 
orders in the general population, how- 
ever, are not in agreement. Milisen 
(1957) attempted a summary statement 
that gives only approximate estimates. 


From kindergarten through fourth 
grade, roughly 12 to 15 percent of the 
children have seriously defective speech. 
In the next four grades between 4 and 5 
percent are seriously defective. General 
estimates above the eighth grade are based 
on highly selected samples and therefore 
the best guess as to the incidence of speech 
disorders in persons over 14 years of age 
would be about the same as for the upper 
elementary grades—4 to 5 percent (p- 
250). 
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Milisen’s estimates are for the inci- 
dence of serious speech handicaps 
among school children. His figures do 
not include those cases in which speech 
difficulties are less serious. Moreover, 
the incidence of speech problems 
among handicapped persons, such as the 
mentally retarded, the cerebral palsied, 
and the aphasics, is known to be very 
much higher than among the general 
run of school children. 

The recognition of the striking inci- 
dence of language disturbances in the 
population has forced professional at- 
tention to these problems. Accordingly, 
strenuous efforts are being made to im- 
prove the training of speech therapists 
as well as of research workers. The de- 
velopment of enriched therapy pro- 
grams and of methods for the evalua- 
tion of abnormalities of articulation, 
voice, rhythm, and of symbolization 
are seriously needed. 

All this points to the importance of 
the area of language and communica- 
tion as a discipline in its own right. The 
materials presented in this chapter aim 
to provide a summary of the more com- 
monly used research methods that have 
been applied to children in this field of 
investigation. 

The problem of selecting a compre- 
hensive set of categories for classifying 
the numerous methods of research on 
children’s speech and language is not a 
simple one. A number of classifications 
may be feasible. The method of straight 
observation may be compared with one 
in which calibrated instruments are em- 
ployed. A second dimension might be 
in terms of the longitudinal-cross-sec- 
tional approach. A third organization 
might involve control groups as con- 
trasted with a method that does not 
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avail itself of this device. Another 
might be the single variable versus the 
multiple variable type of study. Obvi- 
ously, any one of these may serve as 
the main category, the others being sub- 
sumed under it. There is no logical rea- 
son for such an arrangement, except 
perhaps personal preference. No matter 
what approach is used, there would 
still be several research techniques that 
could not be subsumed under the 
scheme. Among them would be the 
manner in which the data are recorded, 
the problem of establishing the reli- 
ability of the observer, and the verbal 
test method. 

In the absence of any comprehensive 
principle for organizing the method- 
ology in the speech and language area, 
a simple listing of methods found in the 
literature will be followed. The manner 
of presenting the materials is to explain 
various methods and to describe several 
of the important researches that illus- 
trate each method and to supplement 
them with briefer mention of other 
studies. This treatment is applied in 
turn to the several areas of language 
development, such as infant vocaliza- 
tion, vocabulary measurement, the mas- 
tery of sentence structure, thought unit 
analysis, speech errors, functions of lan- 
guage, and diagnosis and assessment of 
deviant speech. 

Much of what is known about infant 
speech actually is based on research 
with adults. For example, our knowl- 
edge of the placement of the tongue in 
articulation and a great deal of our 
knowledge of speech errors and diffi- 
culties that comes from work with 
adults has been applied to children. The 
approach here is concerned with recent 
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methods that have been used in re- 
searches on the speech of children. 

In the literature of infant vocalization 
there are many scattered reports by 
parents and relatives on the speech of 
single cases. Instances are those by Lo- 
bischi (1851), Taine (1877), Hum- 
phreys (1880), Schultze (1880), Stern 
and Stern (1907), Nice (1915), Hoyer 
and Hoyer (1924). Other examples are 
by Darwin (1877) and Preyer (1889). 


Perhaps the most famous report of this 
type is that by Shinn (1900). Most of 
this work was done in a very casual 


and unsystematic manner. It is spotty, 
meager, unscientific, and almost entirely 
unconcerned with standardized record- 
ing techniques. Its value is largely pre- 
scientific. Occasionally it throws light 
on where significant problems of early 
speech lie and shows flashes of insight, 
but it reveals little in the way of care- 
ful methodology. Exceptions to these 
strictures must be made to the work of 
Leopold (1939) and of Lewis (1951). 


THE INTERNATIONAL 
PHONETIC ALPHABET 


One of the important requisites for 
the accurate recording of speech sound 
data is the use .of a scientific alphabet. 
The International Phonetic Alphabet 
was invented for this purpose. The or- 
dinary alphabet is too limited in the 
number of its symbols. It has too many 
discrepancies between its symbols and 
pronunciation that make it unreliable 
for precision work, The international 
alphabet is a set of symbols, each of 
which represents one and only one ele- 
mental speech sound. In a broad tran- 
scription there are about 40 symbols; in 
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a narrow transcription others are added 
to represent finer shadings of sound. 
Fairbanks’ (1940) arrangement of a 
broad transcription with a few modi- 
fications has proved to be a useful 
instrument for work with American 
infants and children and may be em- 
ployed in analyzing the speech of chil- 
dren of other nationalities. 

Phonemes are elemental speech 
sounds that are classified as vowels and 
consonants. A vowel is a sound that is 
uttered with little or no stoppage of 
the emitted breath stream. A consonant 
is a sound uttered when the breath 
stream is interrupted in some manner. 
Vowels are classified as front, middle, 
and back. The basis for this classifica- 
tion in the main is the position of the 
arch or hump of the tongue. In the 
process of uttering vowels the tongue 
arches or humps up. In the case of 
front vowels the hump is in the front 
part of the tongue. The vowel [i], as 
in feet, is articulated by the front part 
of the tongue and mouth. Back vowels 
are uttered when the hump is located 
in the posterior part of the tongue. The 
sound [u], as in food, is uttered with 
a raised hump in the back of the tongue. 
Likewise, a middle vowel such as [s], 
as in bird, is articulated with the hump 
in the middle of the tongue. Diphthongs 
are the rapid blending of two vowel 
sounds, the first stressed and the second 
unstressed, so that together they seem 
to be a different single sound. There 
are in broad transcription two classes 
of diphthongs, [u] and [i]. The basis 
of classification usually is the second 
unstressed vowel of the pair. An ex- 
ample of the [u] diphthong occurs in 
the word toe. The two-vowel sounds 
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in the word sail illustrate the [i] diph- 
thong. 

Consonants are categorized in several 
ways. One classification is in terms of 
voiced and voiceless sounds. A voiced 
sound is caused by the air stream setting 
the vocal cords into vibration. In a 
voiceless sound the cords are not vi- 
brated. For instance [b], as in ebb, is 
voiced, whereas [p], as in step, is voice- 
less. Another set of consonantal cate- 
gories depends on the place or the parts 
of the mouth involved in the articula- 
tion. The categories are labials (the two 
lips articulating together), labio-dentals 
(upper teeth on lower lip), lingua- 
dentals (tongue tip touching gum 
ridge), post dentals (tongue tip point- 
ing toward but not touching the gums 
and teeth), velars (back of tongue 
touching the soft palate), and glottals 
(sudden opening of the glottis). A third 
classification is based on the manner of 
articulation, depending on how the 
breath stream is acted upon. The cate- 
gories are nasals (resonance in nasal 
passages), plosives (sudden release of 
air pressure in the air tract), semivowels 
(a sound partaking of the nature of 
both vowel and consonant quality), 
fricatives (air stream forced through 
a narrow opening), and glides (transi- 
tion from one sound to another). There 
is a large array of consonant blends. 
They are classed as double consonant 
blends, such as [pl] in place, and as 
triple consonant blends, such as [spl] 
in splash. 

Both the vowels and the consonants 
in their various subdivisions appear in 
the table which follows. A word should 
be said for inclusion of the voiceless 


stopplosive [P]. It is a glottal stop. In 
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the German language this sound is 
added in the initial position of open- 
vowel words. It is rarely used in adult 
English speech, but it is almost univer- 
sally found among the sounds of infants. 
It is for this reason that it is included 
here. Presented is a broad transcription 
of the International Phonetic Alphabet 
(IPA), with key words adapted from 
Fairbanks (1940). 


INTERNATIONAL PHONETIC ALPHABET 


Vowel Symbols 


Front Middle Back 
lil seat [a] ask [a] calm 
fi] sit [al hut [>] hot 
[e] bait [ə] above [ə] tall 
[e] bet [5] bird [o] coke 
[æ] bat [u] full 
[u] tool 
Diphthongs 
U Diphthongs I Diphthongs 
[ou] oak fer] hail 
[au] cow far] eye 
for] soil 


Among the workers who have em- 
ployed the International Alphabet to 
transcribe infant speech are Wellman, 
Case, Mengert, and Bradbury (1931), 
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Irwin and Curry (1941), Irwin (1941), 
Chen (1942), Lewis (1951), Fisichelli 
(1950), Spiker (1951), and Catalano 
and McCarthy (1954). Lynip (1951) is 
critical of the International Phonetical 
Alphabet on the grounds that it is too 
dependent upon the vagaries of the hu- 
man ear. He holds that the use of this 
alphabet by an observer is too subjec- 
tive a procedure and that it is difficult 
to obtain reliable data with it. 


OBSERVER RELIABILITY 


The problem of the reliability of the 
individual who observes and records 
the speech of infants is of paramount 
importance because an investigation 
stands or falls on the ability of the ob- 
server to transcribe sounds accurately. 
Establishing the reliability of the hu- 
man recorder is analogous to the cali- 
bration of an instrument. Frequently 
the transcription must be recorded live, 
that is, with paper and pencil, depend- 
ing on where the subjects are found 
and on their ages. The presence of a 
tape recorder or any mechanical ap- 
paratus frequently inhibits speech pro- 
duction in young children. At other 
times a record can be made on tape 


Gregoire (1937), Krehbiel (1941), with an electrical recorder, In either 
Consonant Articulation 
Nasal Stop-plosive  Semivowel Fricative Glide 
Voiced Voiceless Voiced Voiced Voiceless Voiced Voiceless Voiced 
Labial [m] [p] [b] 
Labio-dental [f] [v] i +? 
Lingua-dental [a] [8] 
Post-dental [n] G] [d] [r] [s] [z] [j] 
nl [89] [3] 
Velar [n] [k] [g] [x] Iç] 
Glottal LI [h] 
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case the reliability is measured by the 
percentage of agreement between two 
or more trained listeners, each of whom 
records the child’s vocalizations simul- 
tancously and independently. A similar 
dure is followed for establishing 
ver reliability when tape recorded 
sounds are listened to and transcribed. 

A number of studies on observer re- 
liability in which live transcriptions 
were used have been reported by Irwin 
and his associates. Among them are 
Irwin and Curry (1941), Irwin and 
Chen (1941), Krehbiel (1941), Irwin 
(1945), Spiker (1951), McCurry and 
Irwin (1953). Among studies establish- 
ing reliability in hearing sounds from 
a tape record are those by Fisichelli 
(1950), by Flores and Irwin (1955), by 
Irwin (1959). In general, the percent- 
ages of agreement between two ob- 
servers in these studies vary from about 
70 to 100 and average about 9o. This 
means that when this method is used 
the error amounts to about 10 per cent. 

The employment of electric record- 
ers assures a permanent record and en- 
ables the observer to read and check 
the tape record repeatedly and thus to 
safeguard the observations. In the field 
of infant work the leaders in its use 
were Fisichelli (1950) and Catalano and 
McCarthy (1954). Irwin (1956a,b,c, 
1957b,c) has made extensive use of tape 
recording to establish observer relia- 
bility. 


CROSS-SECTIONAL 
AND LONGITUDINAL METHODS 


Essentially, the cross-sectional and 
longitudinal methods consist of an 
analysis of age trends. The distinctive 
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feature of the cross-sectional method 
is that a different sample of subjects is 
used at each of several age levels. In 
contrast, the longitudinal approach uses 
the same sample of subjects at several 
age levels. 

The advantage of the cross-sectional 
method is the relative economy in se- 
curing data. For instance, the researcher 
who wishes to study the speech errors 
of children in grades one through six 
can test the available children in each 
of the grades. The administration of 
tests can be completed in shorter time 
than if the subjects in grade one were 
followed year by year until they had 
reached the sixth grade. On the other 
hand, something may be lost because 
of the lack of continuity of the sub- 
jects. Extraneous factors, such as varia- 
tion from classroom to classroom, dif- 
ferent teaching methods, and different 
school neighborhoods, may affect the 
results and might be better controlled 
by following through with the same 
sample of subjects and teachers. How- 
ever, individual trends cannot be fol- 
lowed with the cross-sectional methods. 
This, on the other hand, is the great 
advantage of the longitudinal approach, 
but the time factor is a practical diffi- 
culty in its use. The time needed to 
complete a longitudinal study is ordi- 
narily almost prohibitive. 

A careful research that illustrates the 
cross-sectional method of studying the 

honetic development of infants is 
Fisichelli’s (1950). The subjects were 
100 orphanage infants, 20 in each of 
five age groups, 6, 9, 12, and 15 months. 
A microphone was suspended over the 
child’s crib or play pen and a warm-up 
period of twenty minutes was given to 
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allow the subject to become accus- 
tomed to the situation, The recorder 
was then turned on for a half hour. A 
minimum of 30 phonemes during the 
half hour was recorded on the tape. 
Other nonverbal behavior was noted in 
half-minute intervals during the period. 
The data were transcribed from the 
tape by two phoneticians in the Inter- 
national Phonetic Alphabet. Observer 
reliability was determined, the values 
ranging from 86 to 95 per cent agree- 
ment, most of which exceeded go per 
cent. The reliability of the data was 
also determined by correlating the odd 
versus the even sounds in the record, 
These correlations varied from .82 to 
.99 for the five age groups. There was 
regular progression of scores with in- 
creasing age. 

The cross-sectional method is also il- 
lustrated in the classical work of Smith 
(1926), a study of the vocabulary and 
sentence structure of children 8 months 
to 6 years old. Her main interest was 
in the growth of these two language 
skills with increasing age. She did not 
use the International Phonetic Alphabet 
but devised a shorthand of her own. 
The cross-sectional method was also 
employed by Wellman, Case, Mengert, 
and Bradbury (1931) in a study of 
phonetic development of children 2 to 
6 years old. These workers transcribed 
the data live in the International Pho- 
netic Alphabet. The children’s vocaliza- 
tions were elicited by the presentation 
of toys and pictures and by questions. 
The sounds, 133 in all, 66 consonants, 
48 consonant blends, 15 vowels, and 4 
diphthongs were tested. The subjects 
were children in the preschool labora- 
tories of the Iowa Child Welfare Re- 


THE STUDY OF COGNITIVE PROCESSES 


search Station, and the average IQ was 
115.9 = 15.4. Test reliability was deter- 
mined by the split-half method on 123 
sounds by 57 children, The correlation 
was .96 = .o1. The range of item 
culty for consonants calculated by the 
number of children passing the items 
was 40.3 to 98.5 per cent. For the 
blends, it was 27.4 to 82.7, for vowels, 
78.2 to 95.9, and for diphthongs it was 
80.8 to 84.4 per cent. The validity of rhe 
test was not reported. 

Williams and McFarland (1937) and 
his students revised the Wellman test 
by reducing the number of items from 
133 to 98. He selected the most fre- 
quently used sounds in the Intern: itional 
Kindergarten Union Test and the first 
2500 words of the Thorndike list. His 


lif- 


test was then given in two parts, Form 
I and Form II. The responses were ob- 
tained by means of pictures and by 


having the subjects repeat the words 
after the experimenter. Using 65 chil- 
dren of the university and hospit: al staff 
at the University of Michigan, Poole 
(1934) studied their phonetic responses 
to pictures, objects, and questions. Her 
purpose was to determine the order of 
difficulty of consonants. She reported 
that the first sounds were labials, a re- 
sult that later work has not substanti- 
ated (Winitz and Irwin, 1958). 

An example of the use of the longi- 
tudinal approach to speech develop- 
ment is a study by Chen (1946). Its 
purpose was to determine the course 
of mastery of the elemental speech 
sounds during the first year of life. The 
measures used were the number of 
phonemes, designated phoneme types, 
and the number of repeated sounds, 
called phoneme frequency. The sub- 
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jects were 47 boys and 48 girls, chil- 
dren of laborers, skilled workers, clerks, 
businessmen, lawyers, physicians, and 


college teachers. About half of them 
were classified in the category of pro- 
fessional families, according to the Min- 
nesota Occupational Scale, The infants 


were visited in their homes twice a 
month, at which time a sample of spon- 
taneously produced sounds was re- 
corded live. The broad transcription of 
the International Phonetic Alphabet ac- 
cording to Fairbanks (1940) was used, 

he behavorial unit method was used 
by Chen to observe the sounds. This 
was an innovation by Irwin and Curry 
(1941). With this device, the unit of 
observation is the single respiration of 
the infant, instead of a time unit. 
Sounds uttered on the exhalation phase 
of a breath are noted and recorded by 
the observer. The advantage of the 
method is that the number of sounds 
vocalized by the infant on a single 
breath lies within the ordinary attention 
span of the observer. 

The sounds carried on 30 breaths 
constituted the sample at each visit. 
Observer reliability varied from 72 to 
gt per cent. The reliability of the data 
was determined by correlating the odd 
versus the even sounds in the record. 
For the period of one to twelve months, 
the correlation was .97 for vowels and 
.9o for consonants. A further check was 
made by comparing the sounds carried 
on the first 30 respirations with those 
on the second 30. Fifty-seven per cent 
of the correlations exceeded .70. A spe- 
cial problem for the observer arose 
when the infant vocalized rapidly on 
consecutive breaths. The method of 
solving this problem was as follows. 
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Two observers recorded live the sounds 
of a group of infants on the consecu- 
tive, alternate, and fourth exhalations 
when the subjects were producing 
sounds rapidly. The distribution _ of 
phoneme frequencies was similar for 
the three samples. Therefore, if the ob- 
server occasionally failed to record 
sounds on the consecutive breath and 
took the sounds on a subsequent breath, 
the difference would not be great 
enough to distort the result. Irwin and 
Chen (1946) and Irwin (1947b,c), us- 
ing the same procedures, have extended 
the investigation to cover the age range 
from the first to the thirtieth month 
and have reported the findings in terms 
of phoneme type and phoneme fre- 
quency. 

Other studies that utilize the longi- 
tudinal method are by Krehbiel (1941) 
and Anderson (1942). Shirley (1933) 
jotted down the vocalizations of 25 
babies during the first two years of life, 
A phonetic alphabet was not used to 
record the sounds, the earliest of which 
were described as coos and grunts. She 
also recorded comprehensible words in 
terms of length, that is, one-, two-, and 
three-syllable words. 


THE CONTROL GROUP 
OR COMPARATIVE METHOD 


A technique for studying a variety 
of developmental processes is the com- 
arative method. The control may be a 
specially selected group, or it may be 
constituted by data from another re- 
search, The matching may be by 
groups, or it may be by individuals on 
the basis of external criteria, such as 
age, mental status, economic back- 
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ground, or education. The method may 
be used to provide a basis of compari- 
son of the speech of the sexes, of age 
groups, of normal and of various types 
of speech-deviate children, Carrow 
(1957) has studied the linguistic func- 
tioning of bilingual and monolingual 
children. The two groups were similar 
in regard to age, grade, socioeconomic 
status, and intelligence. Several tests of 
language skills were administered, 
namely, silent reading (vocabulary and 
comprehension), oral reading (accu- 
racy, comprehension, and rate), hearing 
vocabulary, spelling, and arithmetic rea- 
soning. Comparison between the two 
groups was made with regard to articu- 
lation, number of words used, length 
of clause, degree of subordination, com- 
plexity of sentence structure, and the 
number and type of grammatical errors. 

A careful and valuable study, com- 
paring four psychological scaling meth- 
ods of articulation defectiveness, was 
done by Sherman and Moodie (1957). 
The scaling methods were equal-appear- 
ing intervals, successive intervals, paired 
comparisons, and the method of con- 
stant sums. The materials used for the 
four methods of scaling articulation de- 
fectiveness consisted of five-second ex- 
cerpts from the continuous speech of 
children 5 to 10 years old. The excerpts 
ranged from normal to severely defec- 
tive speech. Records of the i samples 
were played back to 350 listeners who 
judged the quality of articulation in 
terms of the four scaling methods. The 
method of equal-appearing intervals 
seems to be the most useful for scaling 
defective articulation. 

An interesting and suggestive variant 
of the comparative method is Templin’s 
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(1957) terminal status technique. She 
resents terminal status measures for 
each of four major language areas: ar- 
ticulation of speech sounds, speech- 
sound discrimination, sentence struc- 
ture, and vocabulary. Separate scores 
are presented by age levels for boys 
and girls and for children in the upper 
and lower status groups. For each of 
the four language areas, the mean score 
of the 8-year-old group was taken as 
the terminal status measure. The per- 
centage of the value of this mean score 
for each younger age level was calcu- 
lated. Thus, with the mean score of 
the 8-year-old subjects as the terminal 
status measure, comparisons were made 
of the development of articulation, sen- 
tence structure, sound discrimination, 
and vocabulary. Chen’s (1946) differ- 
ential percentage index is a similar de- 
vice, which uses adult speech status as 
a basis of comparison for the speech 
of infants. 

A careful investigation was con- 
ducted by Fisichelli (1950) who com- 
pared the speech-sound status of 100 
orphanage babies with that of a group 
of Chen’s (1946) subjects as controls. 
These orphanage children showed a 
paucity of speech-sound mastery 1n 
comparison to children living in their 
own homes. Brodbeck and Irwin 
(1946), in a similar job with 95 orphan- 
age babies under 6 months of age, com- 
pared their phonetic output with that 
of infants of the same age range living 
in their own homes. Curves of speech 
sounds of the orphanage children were 
significantly lower than those of the 
control group. Carr (1953) published 
an investigation of the speech-sound 
status of fifty 5-year-old deaf-born 
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children, with that of hearing infants 
as controls, The former were greatly 
inferior to the controls. 

An instance of the combination of 
the comparative and longitudinal meth- 
ods is a study by Irwin (1958a,b,c). In 
order to increase the amount of speech- 
sound stimulation in the homes of 
working-class people, the mothers of a 
group of children 13 to 30 months of 
age were instructed to read to them 
daily for a period of fifteen minutes 
from illustrated baby books. The moth- 
ers pointed out objects in the pictures, 
named them, made up stories about 
em, and talked frequently to the chil- 
dren. A second group of infants of 
comparable ages in working-class homes 
in which no systematic stimulation oc- 
curred served as the control. The dif- 
ferences in amount of phoneme fre- 
quency between the two groups after 
the eighteenth month were significantly 
in favor of the experimental group. The 
study is an illustration of the combina- 
tion of the control or comparative 
method with the longitudinal approach. 
It also illustrates two limitations of the 
longitudinal method. Five years were 
required to collect the data, and during 
that time a number of subjects dropped 
out because their families moved to 
other communities. 

The use of the comparative approach 
to speech behavior in children has been 
productive of a large body of new in- 
formation about the speech of both 
normals and deviants. With proper in- 
strumentation and appropriate statisti- 
cal controls, the comparative method 
undoubtedly is the most reliable and 
precise of all the methods discussed in 
this chapter. Instances of the compara- 
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tive method with the use of instruments 
are those by Travis (1937) employing 
the brain-wave technique to compare 
stutterers and nonstutterers. Perlstein, 
Gibbs, and Gibbs (1953) have done an 
exhaustive research on the electroen- 
cephalogram in 1500 cases of infantile 
cerebral palsy, Williams (1953) used 
electromyography to study the action 
currents in the masseter muscles of 
stutterers and nonstutterers. 

An especially effective comparative 
method uses the same subjects as con- 
trols. A test measuring some aspect of 
speech defectiveness is administered to 
a group of subjects before a course of 
treatment is applied, a set of instruc- 
tions given, or a remedial program is 
undertaken. Following the treatment, 
the test is again administered. Octoby 
(Branscom et al., 1956) measured the 
effect of instructions to stutterers to 
prevent, modify, or replace their cus- 
tomary patterns of reaction to expect- 
ancy of stuttering. With stutterers a 
method of measurement is to count the 
number of stuttered words in a selected 
passage read by the subject. In this 
study the subject was required to read 
a 200-word passage aloud five times in 
succession. Jones (1956) investigated 
the effect of clinical instruction for pe- 
riods varying from two weeks to two 
years. He used five 250-word factual 
reading passages. Another test was ad- 
ministered after an interval during 
which spontaneous recovery: of stutter- 
ing had occurred. 

In the area of verbal learning in chil- 
dren a comparative method now being 
used with the same subjects is based on 
the principle of mediated generalization. 
It is applicable to the phenomenon of 
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associative transfer in paired-associate 
learning. In the paired-associate method 
the child is presented with a series of 
pairs of words, the first word of each 
pair being the stimulus and the second, 
the response. The child is instructed to 
associate each response with its stimulus 
so that when he sees the first word 
he can say the second. For instance, if 
the pair is “cake” and “boat,” he would 
say boat when he sees the word cake. 
Usually, in an experiment, a list of pairs, 
perhaps 6 or as many as 15, are pre- 
sented. On the first run through the 
list the child observes the pairs and tries 
to associate each response with its stimu- 
lus. On the second trial the stimulus is 
presented first, and the response word 
is presented following an interval of 
several seconds. This is called the 
anticipation interval. The subject is re- 
quired to call out the response during 
this period. If he succeeds, it is recorded 
as correct, but if the response word is 
presented before he is able to say it 
an error or omission is recorded. The 
pairs are presented in a different order 
on each trial, It is necessary to use two 
or more lists of word pairs to study 
transfer effect. 

In this connection, it is necessary 
also to define the concept of generaliza- 
tion. When a stimulus A comes to evoke 
response B, stimuli similar to A, such 
as Ar, A2, likewise tend to elicit B. 
This is stimulus generalization. When 
stimulus A is able to evoke response B, 
it will likewise elicit similar responses, 
Br, Bz, etc. This is response generaliza- 
tion. A third form of generalization is 
known as the phenomenon of associa- 
tive transfer and is used in verbal paired- 
associate learning with children. The 
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principle involved is that a response to 


a stimulus in turn becomes a stimulus to 
another response. The design calls for 
two preliminary lists of paired words 
and a third test list administered in an 


A-B, B-C, A-C sequence. That is, B is 
the response to A, C is the response to 
B, and C in the test list is the response 


to A, It will be noted that B is in the 
intermediate position and serves a 
mediating function in the transfer from 
A to C. The degree of associative trans- 
fer is tested by administering the \-C 
list. A further variant of the design may 
be made by giving a D-C list of werds 
to serve as a control. The D list is a 
new list. 

The lists are presented visualiy by 
means of a card master, which consists 
of two parts. One part is a meta! box 
containing a bin in which a stack of 


cards with paired words is placed ind 
a conveyer system that delivers the 
cards one at a time to an aperture. Here 
the card is exposed for a given period, 
the anticipation interval. The second 
part of the apparatus is a system of 
electric timers that controls the rate of 
presenting the cards at the aperture as 
well as the exposure time. A more 
detailed description of the apparatus and 
the results of experiments with children 
may be found in the chapter on Re- 
search Methods on Children’s Learning 
by Dr. Spiker (see Chapter 9). Here 
only the general principles of paired- 
associate learning have been explained. 
Two carefully designed studies in this 
field, with children as subjects, are those 
of Norcross and Spiker (1958) and 
Spiker (1958). 

The particular advantage of using the 
same subjects in a test-retest situation 
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is shat the method not only provides 
information about the progress in re- 
habilitation made by the group as a 
whole, but it reveals specific informa- 
tion about each individual composing 
roup. In statistical language, it 


the 


provides a within-group analysis. Ac- 
cordingly, the method is of value to 
the speech therapist working with a 
single child. It assumes, of course, the 


availability of a reliable and valid 
rc isuring instrument. 


INSTRUMENTAL METHODS 


number of mechanical and electri- 
cal devices has been used to analyze 
ious aspects of speech, although not 
all have found extensive use with chil- 
dren. An attempt will be made to 
describe them as nontechnically as 
possible. For the technical details of 
their construction and operation, the 
reader is referred to a comprehensive 
treatment with numerous diagrams and 
pictures in Handbook of Speech 
Pathology edited by Travis (1957). 

A widely used device for recording 
speech sounds is the magnetic tape re- 
corder. The entire unit consists of a 
microphone, a coil, an amplifier, and a 
loud-speaker. A tape treated with 
magnetic oxide of iron upon which a 
pattern is impressed is passed through 
the coil. The passage through the coil 
produces a series of electrical impulses, 
which, upon amplification, are con- 
verted by a loud-speaker into sounds. 
Thus, by means of the recorder, sound 
is translated into electricity and then 
converted back into sound. Its ad- 
vantages are ease and fidelity of record- 
ing and permanence of the record. The 
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unit is somewhat heavy and cumber- 
some for transportation, although light- 
weight recorders are being manu- 
factured. 

For the purpose of detecting struc- 
tural abnormalities of the lips, teeth, 
palate, pharynx, larynx, and nasal pas- 
sages, several instruments are available. 
These include tongue blades, mirrors, or 
laryngoscopes, which combine a mirror 
and a light source, and the nasal 
speculum, a device for distending the 
nostrils. There are several devices for 
measuring speech-sound intensity. The 
deflections of the pointer on an oscil- 
lograph may be photographed on film 
on which decibel lines are traced. The 
fundamental frequency of the human 
voice has been investigated with the 
phonophonelescope. This consists of a 
motor-driven turntable film-drum, a 
phonograph recording and pick-up 
heads, an amplifying unit, and an optical 
system. Fairbanks (1940) investigated 
pitch of children’s voices with the 

honophonelescope. 

The Bell Telephone Laboratories 
have developed the sound spectrograph 
for use in recording and analyzing 
speech sounds. It has had considerable 
use with adult subjects, but work with 
infants and children is in an early stage. 
It provides a highly promising tech- 
nique for the acoustic analysis of the 
phonetic equipment of young subjects. 
The instrument is a device for con- 
verting auditory into visible records. A 
technical description of recent refine- 
ments of the sound spectrograph is 
given by Prestigriacomo (1957). The 
measurements are in terms of the fre- 
quency, intensity, and duration charac- 
teristics of sound. It is important, how- 
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ever, to recognize that the instrument 
yields acoustic rather than phonetic or 
linguistic measures. A necessary pro- 
cedure for the interpretation of these 
measures is the identification by trained 
listeners of the sounds uttered by the 
subjects. 

Essentially, the apparatus consists of 
a means of recording magnetically a 
sample of speech sounds to be analyzed. 
The analysis of sound samples is made 
by a heterodyne wave analyzer. The 
resulting data are plotted “with fre- 
quency on the vertical axis and time on 
the abscissa. The intensity of the sound 
is indicated by the darkness of the 
marking. Another plot, often referred 
to as a cross-sectional picture, has ampli- 
tude in decibels on the ordinate and 
frequency on the abscissa. This plot 
appears as a series of harmonics and 
inharmonics, and the peaks or points 
of energy concentration in the spectrum 
are known as formants. They are usually 
numbered according to their order in 
a frequency scale, from low to high as 
formant one, formant two, etc. 

It has been suggested that it is im- 
portant to distinguish auditory and 
acoustic phenomena and that the final 
interpretation of the physical properties 
of sounds depends upon their auditory 
identification by skilled listeners. This 
is but an illustration of the fact that all 
scientific findings by means of an ap- 
paratus, no matter how precise or re- 
fined it may be, in the last analysis 
must be checked and identified in terms 
of the sense data of an observer. Like- 
wise, the formants, for their valid in- 
terpretation, must be correlated and 
identified with the phonemes they are 
supposed to represent acoustically. This 
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is the procedure followed by workers 
in this area of investigation. K 
(1952) plotted formant one versus 
formant two of the same vowel si 


uttered by a man, a woman, and a 
child. He found that the frequencies for 
similar vowels produced by the three 
individuals were not alike but rhat 
nevertheless the speech was intelliy ible. 
G. E. Peterson obtained the same re- 
sult. A study by Peterson and Barney 
(1952) is pertinent to this discussion. 
They used 33 men, 28 women, an 15 
children to correlate acoustic measure- 
ments and auditory identifications of 
ten vowel sounds. The average correct 
recognition of the ten vowels amounted 
to 79 per cent. Thus the identification 
of sounds uttered by vocal cavities of 


different sizes and shapes is not an 
impossible task. Liberman, Harris, 
Hoffman, and Griffith (1957) have 
demonstrated that listeners will identify 
a given phoneme, for example [b], al- 
though the spectrogram exhibited 
several acoustically different patterns. 
The listeners perceived and divided a 
stimulus continuum including [b], [d], 
and [g] into three sharply bounded 
phonemic categories. For a study on 
some of the factors in vowel identifica- 
tion, see Tiffany (1953). The matter of 
reduction to the sense data of an ob- 
server has been stressed because of its 
importance in interpreting spectro- 
graphic work with infants and children. 

Two studies, one by Lynip (1951) 
and the other by Winitz (1958) have 
used the spectrograph to study the 
acoustic characteristics of sounds ut- 
tered by infants. Lynip criticized 
previous work on the speech develop- 
ment of infants on the grounds that the 
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vathering of speech data must be 
devoid of such handicaps as the fal- 
libilities of the human ear and also of 
the use of phonetic systems and that the 
data must be analyzed in an objective 
manner, He worked with one infant 
from birth to 56 weeks. Lynip’s view is 
that the resonance pattern of the infant's 
vowel sounds is different from that of 
adult vowels. That there are differences 
berween the acoustic patterns of infants 
and adults is not in itself the least un- 
usual, as indicated in Peterson and 
ney’s (1952) study in which dif- 
erences among 33 men, 28 women, and 
is children speaking various dialects 
were reported. Thus there is some 
question about Lynip’s interpretation of 
his data. Winitz (1958) repeated 
Lynip’s experiment, using 31 vowel 
sounds agreed upon by seven judges, 
with results similar to those reported 
by Koch (1952). 


METHODS OF STUDYING 
SPEECH SOUND ERRORS 


It is obvious that the focus of the 
therapist’s attention should be upon 
the errors made by a child, for here the 
burden of remedial effort will be 
directed, It follows then that methods 
of studying speech-sound errors occupy 
a major role in the diagnosis of the 
child’s speech status. 

A number of studies have been con- 
cerned with noting the errors in chil- 
dren’s speech. Four types of error are 
usually reported: substitutions, omis- 
sions, distortions, and additions. A sub- 
stitution error is the use of one con- 
sonant for another, such as [w] for 
[r] or [s] for [z]. An omission means 


499 


that a sound is lacking in its proper 
place in a word. A distortion is usually 
defined as a modification that only ap- 
proximates the standard sound, such as 
a hissing or whistling [s]. Frequently 
the [s] sounds take on the quality of a 
lateral, [{], in which case it becomes 
a substitution. An addition may be il- 
lustrated by the pattern “puhlease” for 
lease. 

The methods of collecting data on 
speech-sound errors vary. Simple ob- 
servation with live recording (paper 
and pencil) is frequently used in the 
studies. Sometimes the spontaneous 
sounds are recorded in play situations. 
In other studies test items are assembled 
to elicit responses to objects, pictures, 
and questions (Branscom, Hughes, and 
Octoby, 1956). In some of the work 
selected words are read to the subject, 
who is instructed to repeat them. 

Bryngelson and Glaspey (1951) have 
developed picture articulation tests for 
commonly missed consonant sounds for 
rapid testing of young children. This 
type of test does not, however, provide 
a complete phonetic inventory. Hal- 
stead and Wepman (1949) developed 
a test for aphasics, and Fisenson (1954) 
published a graded test for the examina- 
tion of aphasics. In tests of stuttering 
the subjects are requested to read 
selected passages, and stuttered words 
are counted, In the sound discrimina- 
tion test pairs of similar sounding words 
with but one different phoneme are 
presented to the child either verbally or 

ictorially. 

Wellman, Case, Mengert, and Brad- 
bury (1931) tested 204 nursery-school 
children to produce the sounds of the 
English language by recording live in 
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the International Phonetic Alphabet 
verbal responses obtained by questions 
and the presentation of pictures and 
toys, The relative difficulties of con- 
sonant elements, consonant blends, 
vowels, and diphthongs were studied. 
Williams and McFarland (1937) fol- 
lowed this research with a study on the 
nature of the commonest errors of pre- 
school children and concerned himself 
mainly with the most frequent substi- 
tutions in 104 speech sounds in con- 
sonant elements, consonant blends, 
triple blends, vowels, and diphthongs. 
Melreaux (1950) used 116 children of 
average or above intelligence in a 
guidance center. There were seven 
ages: 18, 24, 30, 36, 42, 48, and 54 
months. The responses of each child 
were recorded in the international 
alphabet. Correct production, substitu- 
tions, and omissions of sounds in the 
initial, medial, and final positions in 
words were noted. 

Roe and Milisen (1942) published an 
extensive survey on consonant substitu- 
tion and omission errors in the speech 
of 1989 normal children in grades one to 
six. They found that substitution errors 
were more frequent than omissions, 
Sayler (1949) continued the Roe and 
Milisen work upward through grades 
seven to twelve with the same result, 
Bangs (1942) selected a homogeneous 
group of primary aments from an in- 
stitution and studied the substitutions, 
omissions, and additions in their speech 
by using 65 picture cards as stimuli. He 
reports that retarded children com- 
mitted fewer substitution errors than 
omissions. Irwin (1959) devised a short 
test consisting of the ten most difficult 
consonants reported in the literature 
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and administered it to a group of re- 
tarded children with IQ’s ranging {om 
25 to 49 and to another group with 1Q’s 


ranging from 50 to go. The first ¢ 
made more omission than subs 


up 
on 


errors, but the reverse situation was 
found for the second. Omission ©: ors 
in the final position accounted for the 
bulk of errors made by the low IQ 


group. Irwin (1956b) found that is- 
sions are more frequent than substitu- 
tions in the speech of cerebral ied 


children, Johnson and Leuteneeyger 
(1956) have edited a number of studies 
on stuttering, and Johnson, Brown, 
Curtis, Edney, and Keaster (194%) have 


discussed the speech problems of Hendi- 
capped school children. 

Irwin (1956c, 1957¢,f, 1958a,! 
constructed a series of short tests for 
use with cerebral palsied children. Since 
these children fatigue quickly and their 
attention span often is limited, each of 
the tests was limited to about 15 or 20 
words, in which speech elements were 
placed in various positions. A further 
aim of the project was to secure a na- 
tionwide sample, so that it is representa- 
tive of the entire population of the 
country. The observer reliability in 
reading the tape recordings averages 
about go per cent. Thus the error in this 
work amounts to about 10 per cent. The 
reliability of the test, using the Hoyt 
(1951) formula based on an analysis of 
variance, is .g1 + .o1. The criterion of 
age progression of the scores in terms 
of phoneme type and phoneme fre- 
quency is not always met with these 
children. The ranges of difficulty of the 
items vary from about 20 to go per 
cent. The correlations of items with 
the score of the total test is high, 


has 
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eas the intercorrelations of items 
rally is low. Thus the discriminat- 
power and the uniqueness of the 
are both statistically satisfactory. 
The validity of these tests was deter- 
mined by the method of extreme groups 
(\dkins, 1947). Irwin (1957¢) has also 
i nted good evidence of its validity. 
estlake (1951) has built a test for 
ining the oral muscular functions 
rebral palsied children. The test 
sts of four parts, organized as 


VS: 


Fongue 

1. Move tip below lower lip 

2, Move tip above upper lip 

3. Move tip from corner to corner 
4. Move tip against lower teeth 

5. Move tip against upper teeth 


Lips 

1. Extend corners of lips back- 
ward from forward rounded 
position (teeth together) 

2. Open and close lips (teeth to- 
gether) 


C. Mandible 


1. Open and close mouth (jaw 
and lips move together) 


D. Respiration 
1. Sustain even phonation of the 
sound for 10 seconds 
2. Sustain even silent exhalation 
for 10 seconds 


The number of times in one minute 
each of the oral movements occurs 
successfully constitutes the score. Their 
sum is the total score. The number of 
seconds the respiratory responses are 
sustained yields another score. Irwin 
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(1957c) has determined the reliability 
of this test in terms of correct scores 
recorded by each of two observers. The 
mean of one observer was 62.4, of the 
other, 62.5. The ranges were identical. 
Irwin has correlated the scores on West- 
lake’s oral test and a short consonant 
test with 226 cerebral palsied children 
34 to 19 years old. The 7 is .534 = -049. 
The r for correct speech scores and 
respiration was .296 + .063. It would 
seem that oral movement is somewhat 
better as a predictor than respiration of 
consonantal articulation. 

The speech-sound discrimination test 
offers a method that is both objective 
and easy to score. Speech-sound dis- 
crimination is usually defined as the 
ability to recognize auditory distinctions 
among different phonemes. It repre- 
sents a perceptual factor in speech and 
language ability. The test is constituted 
by a series of words paired in such a 
way that only one sound in each pair 
differs. A simple illustration would in- 
volve the ability of a child to recognize 
and discriminate the difference between 
the initial consonant in hat and fat or 
in the pair “horn-corn.” The subjects 
may be presented with paired words, 
spoken orally by the experimenter, or 
with paired pictures and asked to tell 
whether they are the same or different. 

An early speech-sound discrimination 
test was constructed by Travis and 
Rasmus (1931). It has served as the 
prototype of subsequent tests. They 
compared functional articulation de- 
fectives with good freshman speakers 
at a university and at the elementary- 
school level. At every age compared the 
cases with disorders made significantly 
more errors than the normal speakers. 
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Mansur (1950) built a sound discrimina- 
tion test, which was later modified by 
Pronovost and Dumbleton (1953)- It 
consists of 36 pairs of discrimination 
words presented in 72 picture items. 
Templin (1957), influenced by this 
work, has reported two speech-sound 
discrimination tests, one designed for 
use with children 3 to 5 years old and 
one with children 6 to 8. The first test 
includes 59 picture items or pairs. Pairs 
of pictures of familiar objects whose 
names are words similar in pronuncia- 
tion except for single sounds were 
asted on a card and shown to the 
child. An example is “box” and 
“blocks.” The examiner pronounced 
the name of one of the two objects, and 
the child was required to point to it. 
This is called the stimulus word method. 
It is a useful device, but a difficulty 
may arise because of the child’s limited 
vocabulary. By carefully selecting 
words used in preschool word and 
story books, in childrens’ conversations, 
and in the vocabulary sections of pre- 
school intelligence tests, this difficulty 
in large part may be obviated. The dis- 
crimination test for the 6-to-8-year-old 
children was composed of 50 ‘pairs of 
words which were read to the children. 
Schiefelbusch and Lindsey (1958) 
have constructed a new test of sound 
discrimination, Eleven consonant sounds 
were selected for the initial and final 
positions in words selected for the 
study. They are [s], [U], [r], 1, 
[6], [v], [t], [gl], [k], and [tf]. The 
words with their picture representations 
were arranged on cards. On each card 
were three pictures, two alike in regard 
to rhyming, initial, or final sounds and 
a third that contained a sound fre- 
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quently substituted for the on n- 
ployed in the other two pictures. re 
was a total of go pictures. This test 
used to determine three discrimin 


abilities, one as the children hear the 
items spoken by others, another as they 
produce them themselves, and < ird 
art in which they evaluate the icems 
silently. The picture cards were pre- 
sented in three ways: 

1. The therapist named thi ree 
pictures on each card and asl the 
subject to select the two that $ ded 
alike. 

2. The child was instructed jame 
the three pictures and to determine 
the two that were alike. 

3. The child was instructed look 
silently at three pictures and select the 
two that contained the similar elements, 


The pictures were presen ed to 
matched groups of children with defec- 
tive articulation and children with 
normal speech. Significant differences 
were found between the speech-defec= 
tive and the normal speaking groups in 
ability to discriminate sounds. The 
method of presentation did not show 
any conclusive results that would in- 
dicate that speech-defective children 
have greater difficulty when instructed 
to look silently at the three pictures 
and to select the two that contained the 
similar element. 


VOCABULARY MEASUREMENT 


Before proceeding with the evaluation 
of vocabulary tests, something should 
be said about the studies that report the 
appearance in infant speech of the first 
word. There is a number of reports 19 
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t literature on this matter. Bühler 
(1930), Leopold (1939), and Lewis 
(1951) have summarized these observa- 
tions. In the main, the results were ob- 
tained by casual observation without 
careful controls or recording apparatus. 

There is a special problem in identify- 
ing and defining the earliest words of 
infants, for many of them are mutila- 
tions or approximations to standard 
words, McCurry and Irwin (1953) have 
ven the following definitions: 


\ standard word is defined in terms of 
ts phonetic listing in Kenyon and Knott's 
'ronouncing Dictionary of American 
nglish (p. 133). 

À word approximation is defined as a 
phonetic pattern in which one or more 
£ the phonetic elements of the standard 
word, either vowel or consonantal ele- 
ments, are present. This means that some 
elements of the standard pattern are 
omitted and other elements are substituted 
or added (p. 133). 


The relevance of these definitions to 
problems of methodology is that the 
reports in the literature on the ap- 
pearance of the first word usually fail 
to make the distinction between the 
standard word and the approximation 
to the standard word. This failure some- 
times renders inconclusive the results 
of the method of observing the first 
words, 

A widely used research method in the 
area of speech development is the vo- 
cabulary test. Wechsler (1949) has in- 
cluded a vocabulary test as a separate 
unit in his intelligence scale for children. 
Reliabilities for this unit are reported 
in the manual describing the scale as 
ranging from .77 to the .go’s. The verbal 
part of Wechsler’s scale correlates well 
with the Revised Stanford-Binet Test. 
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Vocabulary tests for use with normal 
children are presented by Smith (1926), 
Van Alstyne (1929), Williams and 
McFarland (1937), Seashore and Ecker- 
son (1940), whose test was used by 
Smith (1941). Other tests are those by 
Ammons and Ammons (1948) and 
Templin (1957) and a series of short 
tests for use with cerebral palsied chil- 
dren by Irwin (1956a,¢, 1957b,c,d,e). 

Since some of these tests do not 
measure up to the criteria for the 
standardization of a testing instrument, 
at this point it might be pertinent to 
indicate briefly what these requirements 
are. There is a number of techniques 
for the selection of items in a testing 
instrument, Some are based on scaling 
methods, and factor analysis is also used. 
Usually a statement of the reliability 
and validity of a test is considered 
sufficient to establish its usefulness, but 
such statements are not always given. 
Moreover, there are other valuable 
criteria for estimating the usefulness of 
a test, such as age progression in the 
scores of normal children, the range of 
difficulty of the items, the discriminat- 
ing power of the items, and their 
uniqueness. The following definitions 
are given of these criteria in order that 
they may be applied to a number of 
tests to be considered. 

One of the first requirements of a test 
is its internal consistency or reliability, 
which may be determined in several 
ways. Among the most acceptable is the 
Kuder-Richardson Formula (1937) or 
Hoyt’s (1941) method based on an 
analysis of variance. The range of dif- 
ficulty of an item is determined by the 
percentage of testees passing. If a large 
percentage passes an item, it is rated 
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easy. If only a small percentage passes 
it, it is considered difficult. The range of 
difficulty of the items of a test should 
ordinarily be 10 or 15 per cent to 85 
or go per cent. The problem of age 
progression, or the increasing size of 
scores with age, may frequently be 
determined by inspection, It is im- 
portant in a standardized test that its 
items discriminate the individuals taking 
the test. If all the testees, irrespective 
of their ability, make the same scores on 
an item, that item lacks the power to 
discriminate them. This criterion is cal- 
culated by the correlation of the item 
scores with the scores on the whole 
test. High correlations indicate satis- 
factory discriminating power. Item 
uniqueness means that two items do not 
to any great degree test the same func- 
tion, It may be determined by the 
intercorrelations among the items. A 
small y between two items indicates 
their uniqueness. These values should 
be lower than the 7’s for discriminating 
power. 

The most important criterion of a 
test is its validity. An acceptable method 
for establishing the validity of a testing 
instrument is to give it together with a 
previously standardized test to the same 
sample of individuals. A high correla- 
tion is taken as evidence of the validity 
of the new test. Ammons and Ammons 
(1948) validated their picture vocabu- 
lary test against Wechsler’s (1949) in- 
telligence scale for children. The 

= .82. If an external criterion, such as 
another test, is not available, another 
technique for establishing validity is the 
Adkins (1947) method of extreme 
groups. For instance, if the difference 
between the means of the articulation 
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scores of two extreme groups, say 
mildly involved and severely involved 
cerebral palsied children, is significant 


statistically, it may be taken as evidence 
of the validity of the data. The phy si- 
cian’s rating of the degree of in lve- 
ment constitutes the external criterion 
and thus meets the requirement for es- 
tablishing validity. Age progression of 
test scores may also be taken as evidence 
for the validity of a test. 

One of the earliest vocabulary tests 
was published by Smith (1926). Two 
hundred and forty-four children, vary- 
ing in age from 8 months to 6 years, 
were observed. One hundred children 
were recruited from the Baby ! «unin- 
ing Laboratory, 95 from the Preschool 
Laboratories of the Iowa Child lfare 
Research Station and the Primary 
Group of the University Elementary 
School, 30 from the Day Nursery, and 
19 from the Home of the Friendless 
Cedar Rapids, Iowa. Thus there were 
more children at the upper and lower 
ends of the mental scale than would be 
found in a normal distribution. Test 
items taken from the Thorndike (1921) 
Word Book numbered 203. An object 
or a picture was presented to the child 
and appropriate questions were asked. 
The order of difficulty of the words 
was determined in terms of the fre- 
quency of their knowledge by 61 chil- 
dren. The reliability of the test was 
secured by administering two halves of 
it to 53 children varying in age from 
4 to 6 years. The product moment cor- 
relation was .ọ5 + .o2. The correlation 
with scores on the Terman (1916) 
Vocabulary List was .84 = .03. The 
criterion of age progression, that is, the 
increase in correct scores from the 
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youngest to the oldest age, was met 
in this study. Nothing, however, is in- 
dicated about the discriminating power 
or the uniqueness of the items. This 
test was widely used and has had a 
pronounced influence on the develop- 
merit of other tests. 

Williams and McFarland (1937) 
produced a revision of the Smith Vo- 
cabulary Test, The number of items 
was reduced and cast into two lists, 
form Land Form II, each containing 42 

is, The test was given to 252 chil- 
dren, 27 to 74 months of age, who 
were attending the preschool labora- 
ics of the Iowa Child Welfare Re- 
search Station and a second group of 
64 orphanage children, 28 to 73 months 
of age, Here, again, one group included 
almost exclusively children of superior 
status and above average mentality, the 
other, children of low socioeconomic 
status and below average intelligence. In 
neither the Smith test nor the revision 
by Williams and McFarland was the 
distribution of cases typical of the 
population, The reliability of the test, 
determined by the correlation between 
scores on Forms I and II, was .96 = .003 
for 359 children. The order of difficulty 
of the items was determined for both 
lists, The range is o to 99.6 per cent. 
There is progression in the scores with 
both chronological and mental age. 
Since age is an external criterion, age 
progression in scores may be taken as a 
form of validity. Validity, in the sense 
of correlation with another vocabulary 
test, is not reported. No statements are 
made concerning the discriminating 
power or the uniqueness of the items 
of this test. 

Van Alstyne (1929) built a picture 
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vocabulary test for use with 3-year-olds 
but which could be used with children 
whose mental ages are between 2 and 5 
years, The multiple-choice method was 
used to test the children. The child 
was presented with a card with four 
pictures and asked, “Show me the 

,” for nouns, and “What is 
_______ing?” for verbs. There are 
45 cards in all, and the test words in- 
clude 31 nouns and 14 verbs. Norms are 
based on the scores of 80 children from 
33 to 39 months of age. Two criteria 
of test construction are reported. The 
reliability correlation of the test was 
0.87, and there is almost perfect correla- 
tion with the Kuhlman Mental Test. 
Other criteria are not reported. 

Ammons and Ammons (1948) de- 
veloped a full-range picture-vocabulary 
test for the purpose of securing quick 
estimates of verbal ability of children. 
It can also be used to test the intelli- 
gence of the physically handicapped and 
the retarded child. Van Alstyne’s 
multiple-choice recognition picture ap- 

roach, involving the procedure of the 
child pointing to the appropriate pic- 
ture named, was adopted. The materials 
are 64 ink line drawings arranged in 
groups of four, so that there are alto- 
gether 16 plates. The words, organized 
by plates, are in order of difficulty. 
There are 253 words and phrases in the 
list. The subjects were 51 white chil- 
dren all from the same school, two 
boys and two girls from each grade 
level from kindergarten through the 
twelfth grade. The average IQ was 115. 
Test reliability, using the odd-even 
method, was .ọ5 + .o1. Instead of re- 
porting a range of difficulty of the items 
of the test, an attempt was made to 
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determine the şo per cent level of 
difficulty by throwing out items that 
were not passed by 50 per cent of the 
testees. The validity of the test, meas- 
ured by the correlation between the 
total picture-vocabulary test and scores 
on the Binet vocabulary test, was 
.96 + .o1. Templin (1957) found that 
there is age progression in scores of 
her subjects when tested with the Am- 
mons Full-Range Picture-Vocabulary 
Test. Seashore and Eckerson (1940) 
issued a vocabulary test available for use 
with children from grades 1 to 12. 
Smith (1941) established norms for it 
on 867 children. The odd-even re- 
liability for this test was about .go for 
grades 5 to 9. 

A very significant vocabulary study 
was done by Mildred Templin (1957). 
She emphasized the method of dis- 
criminating between the vocabulary of 
use and the vocabulary of recognition 
or understanding and studying the two 
separately. The vocabulary of use in- 
cludes those words that are actually 
produced in oral or written speech. The 
vocabulary of understanding consists of 
words that are recognized or under- 
stood when read or heard. She also 
introduced the method of comparing 
results obtained with children of upper 
socioeconomic status with those of 
lower socioeconomic status. The vo- 
cabulary of recognition was measured 
by the Ammons Full-Range Picture- 
Vocabulary Test and the Seashore- 
Eckerson English Recognition Vo- 
cabulary Test. The subjects were 30 
boys and 30 girls aged 3, 3.5, 4, 45, 
and 5 years. There was a steady increase 
with age in mean scores of 60 children 
on the Ammons test. When the data 
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were combined into whole year inter- 
vals, the scores tended to be quite 
similar to those reported by Ammons 
and Holmes (1949). In administering 
the Seashore-Eckerson Test, Smith 
(1941) first asked each child to define 
a word; then, if he was unable to do 
so, he was given four possible mean 
from which to choose the correct 
Templin followed the manual of c- 
tions prepared by Smith for admir r- 


ing the test. In studying the vocabi'sry 
for use, 50 remarks were secured fom 
her subjects. There was a stea n- 
crease in the number of different ds 
used from one age to another. 

Gesell’s (1928) scale includes tems 


Rr 


for indicating language develop 
It is limited to the period bel 
months of age. In her standardize 
scales of the mental developm 
babies from 1 to 24 months Griffiths 
(1954) included a section with items on 
speech and hearing. The items are 
largely premeaning sounds. 


METHODS OF STUDYING 
SENTENCE STRUCTURE 


The babbling speech of .the young 
baby is chaotic, meaningless, and com- 
pletely devoid of conventional form and 
content. In no sense is it a language. 
Quite literally, it is only the raw 
materials from which a language, any 
language, may be formed. It is one of 
the supreme accomplishments of the 
human being that from these meaning- 
less speech sounds the infant eventually 
molds and constructs such a com- 
plicated structure as a compound or 
complex sentence, not to say a full- 
blown meaningful language, and that he 
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loes it for the most part quite early in 
life, It is important then’ that the de- 
sment of language should be care- 
studied and information gathered 

to how the child manages to achieve 
the skill of combining words into sen- 


there is a problem of defining the 
sentence. It is sometimes designated as 
rbalization. Templin (1957) defines 
the term verbalization as a thought unit, 

ething “in which the child has some- 
thing to express. A unit might be as 
ef and incomplete as a single word; 
it might be a grammatically complete 
and accurate sentence; or it might be 
several clauses uttered one after another 
as a loosely integrated sentence” (p. 
74). Williams and McFarland (1937) 
preferred to use the term “expression 
unit” rather than the sentence. This is a 
unit “in which the child has something 
to express to someone else.” “This might 
be as brief and incomplete as a single 
word, or might extend over several 
highly connected clauses given together 
as a more or less integrated unit” (p. 
10). Williams goes on to treat the ex- 
pression unit in several ways: (1) 
correctness and completeness of word 
usage, (2) mean length of the expression 
unit, (3) sentence completeness in terms 
of subject-predicate-object relationship, 
and (4) complexity of organization of 
the unit in terms of the grammatical 
classification of simple, compound, and 
complex. For the measurement of the 
expression unit, arbitrary weights of o, 
1, 2, and 3 were assigned to unin- 
telligible, simple, complex, and com- 
pound-complex units. 

Templin (1957) has devoted an ap- 
pendix to rules defining sentences. They 
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are adapted from the lists of McCarthy 
(1930) and Davis (1937). The appendix 
is quoted in full: 


Rules Followed for Classification of 
Words and Sentences 


A. Rules for Counting Number 
of Words 


1. Contractions of subject and predicate 
like “it’s” and “were” are counted as 
two words. 

. Contractions of the verb and the nega- 
tive such as “can’t” are counted as one 
word. 

3. Each part of a verbal combination is 
counted as a separate word: thus “have 
been playing” is counted as three words. 

4. Hyphenated and compound nouns are 
one word. 

5. Expressions which function as a unit in 
the child’s understanding were counted 
as one word, thus “oh boy,” “all right,” 
etc., were counted as one word, while 
“Christmas tree” was counted as two 
words. 


N 


B. Classification of Sentence Structure 


I. Complete sentences. 

A. Functionally complete but structur- 
ally incomplete. This includes nam- 
ing; answers in which omitted 
words are implied because they 
were expressed in the question; ex- 
pletives; and other remarks, incom- 
plete in themselves, which are 
clearly a continuation of the pre- 
ceding remark, 

B. Simple sentence without phrase. 

C. Simple sentence containing (1) 
phrase used as adjective or adverb 
in opposition, (2) compound sub- 
ject or predicate, (3) compound 
predicate. 

D. Complex sentence (one main clause, 
one subordinate clause) with (1) 
noun clause used (a) as subject, (b) 
as object, (c) .in apposition, (d) 
as predicate nominative, (e) as ob- 
jective complement; (2) adjective 
clause (a) restrictive, (b) non-re- 


strictive; (3) adverbial clauses of 
(a) time, (b) place, (c) manner, 
(d) comparison, (e) condition, (£) 
concession, (g) cause, (h) purpose, 
(i) result; (4) infinitive. 

E. Compound sentence (two 
pendent clauses). 

F. Elaborated sentence: (1) simple 
sentence with two or more phrases, 
or compound subject, or predicate 
and phrase; (2) complex sentence 
with more than one subordinate 
clause, or with a phrase or phrases; 
(3) compound sentence with more 
than two independent clauses, or 
with a subordinate clause or 
phrases. 


inde- 


II. Incomplete sentences. 


A. Fragmentary or incomprehensible. 
Example: “Well—not this, but—.” 

B. (1) Verb omitted completely, (2) 
auxiliary omitted, verb or participle 
expressed, (3) verb or participle 
omitted, auxiliary expressed. 

C. Subject omitted, either from main 
or subordinate clause. 

D. Introductory “there” omitted. 

E. Pronoun other than subject of 
verb omitted. 

F. Preposition (usually needed sign of 
infinitive) omitted. 

G. Verb and subject omitted. 

H. Main clause incomplete, subordi- 
nate clause or second clause of 
compound sentence complete. 

I. Main clause complete, subordinate 
or second clause incomplete. Ex- 
ample: “I know why.” 

J. Omission from both main and sub- 
ordinate clauses. 

K. Essential words present, but sen- 
tence loosely constructed because 
of (1) omission of conjunction, (2) 
insertion of parenthetical clause, 
(3) changes in form halfway in 
sentence. Example: “We have— 
my brother has a motorcycle.” 

L. (1) Definite, (2) indefinite article 
omitted. 
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M. Object omitted from either xsin 
clause or prepositional phrase. 

N. Sentence left dangling. 

McCarthy’s (1930) careful study on 
language development in the preschool 
child introduced a method that has been 
followed extensively by subsequent 
workers. She secured 50 verbalizations 


by 140 children ranging in age from 18 
months to 54 months. The verbaliza- 
tions were elicited by means of toys 
and books. The responses were analy d 


in terms of their length, their granu u- 
cal form, and the parts of speech d. 
A similar pattern was used in a study 
of twins 2 to 5 years old by Day (15932) 
and by Davis (1937), whose samples 
consisted of only children, i 
with siblings, and twins 5.5 to 9.5 vears 
of age. Other reports on this problem 
are by Shire (1945), Little and Williams 
(1937), and Templin (1957). 

The length of response in terms of 
number of words is considered to be 
the most easily determined and ob- 
jective method of studying the sen- 
tence. It has been widely used. Examples 
are the studies by Smith (1926), 
McCarthy (1930), Day (1932), Fisher 
(1932), Davis (1937), Shire (1945), and 
Templin (1957). LaBrant (1933) used 
the mean length of the clause in her 
work and suggested that the counting 
of predicates is an objective method for 
the study of the sentence. 

Templin (1957) classified the remarks 
of 60 children 3 to 8 years old into six 
major categories: (1) functionally 
complete but structurally incomplete; 
(2) simple without phrase; (3) simple 
with phrase; (4) compound and com- 
plex; (5) elaborated; and (6) incom- 
plete. She has employed several other 
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methods of categorizing the children’s 


sC ices. One is the conventional 
grouping of sentences into declarative, 
interrogative, imperative, and exclama- 
to \nother makes use of noun, ad- 
jectival, and adverbial clauses. Her 
treatment of the use of parts of speech 
by children 3 to 8 years old involves a 
sample of 128,000 words. The parts of 
speech listed are noun, verb, adjective, 


adverb, pronoun, conjunction, preposi- 
ticle, and interjection. Templin’s 
work undoubtedly presents the most 
comorehensive treatment found in the 
lirer ture of the methodological prob- 
lem of sentence structure of children. 


FUNCTIONS OF LANGUAGE 


Piaget’s (1926) volume on the lan- 
guage and thought of the child pre- 
cipitated a controversy on the amount 
of egocentric speech indulged in by 
children. Methodology in this area on 
the side of research design is meager. 
It is concerned rather with how the data 
are to be analyzed. Piaget made records 
on the verbal behavior during free play 
of two 6-year-old children at La Maison 
des Petites. Two observers followed 
each child for about a month at the 
morning class, taking down in minute 
detail and in its context everything that 
was said by the child. The data con- 
sisted of about 1500 remarks. 

Egocentric speech is defined by 
Piaget as (1) repetition, vocalization 
in which the child repeats words and 
syllables for the pleasure of talking; 
(2) monologue, in which the child talks 
to himself as though he were thinking 
aloud; (3) collective monologue, in 
which another child is present but his 
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oint of view is never taken into ac- 
count. Socialized speech is defined as 
(1) adapted information, when the 
child really exchanges his thoughts with 
others; (2) criticism, which includes 
remarks about the work or behavior 
of others; (3) commands, requests, and 
threats; (4) questions; (5) answers. 
Piaget estimated that 38 per cent of 
the child’s remarks were egocentric 
and 62 per cent were socialized. 
McCarthy (1930) repeated Piaget’s 
work meticulously, using his definitions 
of egocentric and socialized verbal be- 
havior. In order to stimulate the child’s 
speech, she used pictures of animals, 
illustrated Mother Goose rhymes, and 
toys consisting of an auto, a cat, a 
telephone, a mouse, a music box, and a 
small ball. There were 140 children in 
the investigation, ranging in age from 
18 to 54 months and selected so as to be 
representative of the population. Fifty 
remarks from each of the subjects con- 
stituted the data. McCarthy reports that 
only 3.6 per cent of the vocalizations of 
children are egocentric in nature. Day 
(1932) and Davis (1937) used the 
McCarthy method in their studies. 
These investigations also failed to con- 
firm Piaget's findings, but a study by 
Fisher (1932) supported his results. Her 
subjects were 72 children, 37 girls and 
35 boys whose ages ranged from 22 to 
60 months. The mean IQ of the group 
was the unusually high value of 136. 
The data were obtained from steno- 
graphic records. On three different 
days, when his language was being re- 
corded, each child was followed by a 
trained stenographer from the moment 
he arrived in the morning until he had 
finished his lunch and was ready to 
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enter the sleeping room for his after- 
noon nap. The length of time of ob- 
servation for each subject was nine to 
twelve hours. 

The controversial aspects of these 
studies arises not so much in the 
methods used to collect the data as in 
the manner of categorizing them. 
McCarthy (1954) has commented that 
the studies that have repeated Piaget’s 
work, using his categories, have literally 
failed to substantiate him, whereas those 
which have attempted new definitions, 
usually in terms of the subject of the 
sentence, agree with his findings. 

In connection with Piaget’s concepts 
of the functions of language a word 
should be said about other views. This 
is also a somewhat controversial matter, 
for there are two extremely different 
schools of thought. One, exemplified 
by Wundt (1912) holds that the func- 
tion of language is to express psychic 
processes, such as ideas, feelings, and 
emotions. On the other hand, for 
DeLaguna (1927) it is a form of be- 
havior. She writes: 


Speech is the, great medium through 
which human cooperation is brought 
about, It is the means by which diverse 
activities of men are coordinated and cor- 
related with each other for the attainment 
of common and reciprocal ends. Men do 
not speak simply to relieve their feelings or 
to air their views, but to awaken a re- 
sponse in their fellows and to influence 
their attitudes and acts (p. 19). 


An intermediate or eclectic notion of 
the function of language is that of Lewis 
(1951). Language for him has three 
functions: (1) expressive of the emo- 
tions; (2) representative, taking the 
place of things and pointing them out; 


THE STUDY OF COGNITIVE PROC SES 


and (3) evocative or social, in the 
sense that it arouses a response in the 
listener. The foregoing discussio of 
other views was introduced in order to 
emphasize the fact that there are other 
dimensions of language besides Piaget’s 
rather limited treatment. 
RELATIONSHIPS 
OF THE MEASURES 
OF LANGUAGE DEVELOPME™ I 
Something should be said abour the 
roblem of the interrelations lan- 
guage skills. This is an area in w! the 
problem of methodology in the design 
of a language study does not phy a 
major role. After a number test 
results have been secured, their in- 
terrelationships may be determines by a 
statistical treatment, Williams and 


McFarland (1937) administered tests 
to 38 preschool children and secured 
measures of speech sounds, word usage, 
length of expression unit, completeness 
and complexity of sentences, the Van 
Alstyne vocabulary, and the Williams 
vocabulary. He then calculated zero- 
order correlations and partial correla- 
tions holding chronological and imental 
age constant. Templin (1957) has, in- 
tercorrelated measures of articulation, 
length of response, complexity of re- 
sponse, sound discrimination, vocabu- 
lary, and number of different words, A 
feature of this study is an attempt to use 
a terminal status measure as a base for 
estimating the interrelations of the 
various language skills. The percentage 
of terminal status scores attained on 
each of the major language measures is 
the method employed. Other studies 
concerned with the problem are by 
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Harris (1948), Gerwitz (1948), Yedi- 
na (1949), Spiker (1951), and 
Schneiderman (1955). 

THE PROBLEM OF EVALUATION 


OF METHODS 


‘The problem of evaluating methods 
ch and language research is 
asy nor simple. The advantages 
and disadvantages of particular methods 
been indicated in the chapter, 
19 comprehensive attempt is made 


have 


ana 


he he adoption of a method, and in 
turn a judgment of it, depends in large 
measure on the purpose the experi- 
menter has in mind. The efficiency of a 


method is determined by the nature of 
the hypothesis he wishes to test, the 
degree of precision to be attained, the 
nature of information wanted, and the 
costs of conducting the research. If one 
wishes to compare the uncontrolled 
method of observation used by Shinn 
(1900) or by Piaget (1926) with the 
carefully controlled methods employed 
by Spiker (1958) or by Peterson and 
Barney (1952), the extreme contrast is 
readily apparent. But how can a dif- 
ferential evaluation be made of the work 
of Fisichelli (1950), who used a cross- 
sectional method, and the comparative 
study of Sherman and Moodie (1957), 
who were concerned with the relative 
merits of psychophysical methods. Both 
used careful controls. The difficulty of 
comparing the advantages of a test of 
sound discrimination, such as that of 
Schiefelbusch and Lindsey (1958), with, 
for instance, a test in another area, such 
as Williams and McFarland (1937) re- 
vision of the Smith Vocabulary Test, 
is apparent. 
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The cost of conducting an experi- 
ment is an important factor in its design. 
There is a vast difference between the 
expense involved in doing a job with a 
small centralized group or with a state 
or nationwide sample. Both may be 
meticulously controlled, one with preci- 
sion instruments, the other with non- 
instrumental comparative methods. Yet 
both methods contribute valuable 
information. Each method, when 
buttressed by proper controls for the 
safeguarding of the data so as to yield 
the information wanted, has a legitimate 
place in the armory of the research 
worker. 

The directions that the improvement 
of research techniques in the field of 
speech and language seem to be taking 
at present are twofold: (1) the further 
development and refinement of tests 
such as those used not only with normal 
children but also with retarded children, 
those with cleft palates, cerebral palsy, 
aphasia, etc.; (2) the invention of preci- 
sion instruments of both mechanical and 
electrical nature. 

Hall in Travis’ Handbook of Speech 
Pathology indicates one area in which 
progress is needed. She states: 


It is to be hoped that further investi- 
gation will be made along the line of ob- 
jectifying and measuring articulation. A 
valid, reliable, standardized quantitative 
index of articulation would be extremely 
valuable, both for research purposes—so 
that articulation data could be handled 
statistically and have meaning through- 
out the country—and for clinical progress 
with speech therapy (P: 777): 


Efforts in this direction are being 
made. The work of Sherman (1957) and 
her colleagues on the relative merits 
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of several psychophysical methods for 
investigating defective articulation il- 
lustrates Hall’s statement. Bangs (1942) 
has worked with retarded children. Dale 
O. Irwin (1959) has devised a short 
articulation test for the purpose of 
discriminating the speech of children of 
various mental levels. Templin (1957) 
has devised several tests for use with 
normal children, and Spriestersbach, 
Darley, and Rouse (1956) have used 
them with cleft-palate children. 

A special problem is encountered 
with handicapped children that in the 
future must be taken into consideration, 
especially with those with cerebral 
palsy. It is a well-known fact that these 
children fatigue very easily and that 
their attention span is quite brief. The 
problem presented to the test con- 
structor as well as to the therapist is to 
secure optimal results during the brief 
period in which testing can be done 
without fatigue. With this type of 
child, short tests that do not place too 
great a burden on them should be used. 
With this purpose in mind, Irwin 
(1958a,b, 1957€, 1956c) has devised a 
series of tests, each of which requires 
about five to ten minutes for administra- 
tion. Further, he has attempted to stand- 
ardize these tests on a nationwide sample 
of cerebral palsied children, so that 
they would have meaning throughout 
the country. 

Further refinement of tests will un- 
doubtedly result from more careful 
standardization techniques. In standard- 
izing future tests, it should be insisted 
that the reliability, and wherever pos- 
sible the validity, of the test be re- 
ported. In the past such statements have 
often been omitted, with the result that 
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no accurate judgment concerning de- 
pendability can be made by othe: 10 
wish to use the test. As indicated above, 


other test criteria that might be met ind 
reported are the difficulties, th 
criminating power, and the uniqueness 
of its items. Moreover, whenever 
is composed of subtests, it is good 
tice to report the intercorrelations 

A brief word must be said jut 


trends in the use of precision instru- 


ments. A casual reading of Stee ind 
Hanley’s (1957) chapter on “in tru- 
ments of Diagnosis, Therapy and Re- 
search” in Travis’ handbook wil! i: vea 
the tremendous progress that has been 


made in the invention and app! 


jon 
of apparatus to problems of speech in 


the last few years. At present, nowhere 
else in the literature is there gathered in 
one place the amount of valuable de- 
scriptive and illustrative materials as 
here. Future trends will probably be 


concerned not so much with the inven- 
tion of new instruments as with the re- 
finement of the present supply of 
instruments and applications to new 
speech problems as they arise. There 
is an increasing trend toward the wider 
use of these instruments not only in the 
speech laboratory but in connection 
with clinical diagnosis and the processes 
of therapy. The chapter in the hand- 
book is highly recommended to the re- 
search man, the clinician, and the 
therapist. 
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Descriptive linguistics is the branch of 
anthropology concerned with the struc- 
ture of language. Until very recently 
the psychologist studying child lan- 
guage has been innocent of the findings 
of linguistics. Our evaluation of lan- 
guage in the abstract has been con- 
ventionally exalted: language is, 
presumably, the characteristic human 
behavior, the behavior separating man 
from the animals. At the same time, 
much research and theory concerning 
the child’s acquisition of speech has re- 
duced this behavior to something that 
might reasonably be expected of many 
animals. Partly, no doubt, this is because 
we have wanted to see in language only 
as much as we felt ready to explain. 
Partly it is because many aspects of 
language are visible only from the 
linguist’s vantage point; when the full 
range of variation in human speech has 
been grasped, one begins to see how 
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profound is the effect of each language 
in shaping the minds of those who speak 
it. To the psychologist who has studied 
linguistics, the child’s acquisition of 
speech seems a more interesting topic 
than it has ever seemed before. For 
the study of this topic, we need a new 
set of methods informed with both 
linguistic and psychological knowledge. 
With characteristic optimism, the name 
for these methods was coined in ad- 
vance of their invention. They were to 
be called “psycholinguistic.” Today 
some of them have actually been 
worked out and tried, others we can 
envision only in a general way. 

Every natural language is a system. 
From knowledge of one part, it is pos- 
sible to anticipate many other parts 
correctly. The linguistic scientist studies 
some finite set of utterances (his 
linguistic “corpus” ) in search of the 
recurrent elementary units and patterns 
of combination that will generate the 
infinite set of utterances belonging to 
the language. Every child, in learning 
his first language, does much the same 
thing. He is not, in his first few years, 
exposed to all possible utterances be- 
longing to the community language but 
only to that small sample brought to 
him by his family and their friends. Ex- 
posure to the sample, however, teaches 
him to understand and to produce 
utterances not actually experienced but 
implied by what has been experienced. 
The child may begin as a parrot imitat- 
ing what others say, but he will end as a 
poet able to say things that have 
not been said before but that will 
be grammatical and meaningful in his 
community. This is the terminal 
achievement that a theory of language 
acquisition must explain. 
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Every natural language is constructed 
from a small number of distinctive 
elements—vowels and consonants, levels 
of pitch, pauses, and stresses—the 

“phonemes” of a language. It is con- 
ceivable that the linguist might work 
out a set of rules of phoneme combina- 
tion that would generate all the ac- 
ceptable sentences in the language, but 
any such set of rules would be far too 
complex to be useful. Descriptions that 
can be grasped are achieved by work- 
ing at several “levels.” On the first, the 
phonological level of description, the 
phonemes of a language and also the 
permitted patterns for their combina- 
tion are enumerated. These never in- 
clude all conceivable patterns. In 
English, for instance, we do not begin 
syllables with mk or kn or sr or with 
any of a number of other logically pos- 
sible consonant clusters. 

On the grammatical level of descrip- 
tion the morphemes of a language, and 
also the rules for their combination, are 
enumerated. The morphemes are the 
smallest individually meaningful ele- 
ments in the utterances of a language; 
a much larger set than the phonemes 
but still a finite set. Free morphemes are 
meaningful forms that can stand alone, 
such as dog and cat. Bound morphemes 
are forms that never stand alone but 
that retain some consistent meaning in 
the various combinations into which 
they enter; e.g., the possessive ’s of 
dog’s and cat’s. 

Grammatical description is often sub- 
divided into the morphological and 
syntactic levels. Morphological rules 
describe the construction of words 
from free and bound morphemes, 
whereas syntactic rules describe the 
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construction of utterances from words. 
In fact, to obtain a usefully simple set 
of rules, syntactic description has to 
work with classes of functionally 
equivalent words, It may be that de- 
scriptive grammar will eventually do 
without the word as a unit of analysi 
and also, therefore, without any dis! 
tion between morphology and syntax. 
The postulates of grammatical des 
tion are by no means so clear an 
jective today as are those of phonolog 
cal description. For the present, 


morphology and syntax are usvally 
distinguished and are so here. his 
chapter undertakes to describe tose 
few researches that have appr‘ ied 
language as a system, dividing the report 
into procedures for the study of the 
phonological system, procedure for 
work on the morphological system and 


procedures for work on the syntactic 
system. 

Various morphemes and combinations 
of morphemes are associated w ith 
identifiable nonlinguistic circumstances. 
Bunker Hill Monument goes with a cer- 
tain artifact; singing a hymn is the name 
of a human performance; Mr., as a term 
of address, is associated with certain 
characteristics of the addressee and of 
the relation between addressee . and 
speaker. In fact, language makes ref- 
erence. The linguistic scientist is little 
interested in what we will call the 
reference system. He has tried to purge 
his analytic methods of any reliance on 
the determination of particular mean- 
ings and reluctantly retains in his 
postulates judgments of the presence 
or absence of meaning and of identity 
and difference in meaning. However, 
the psychologist studying language ac- 
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quisition will not wish to exclude 
reference from which linguistic mean- 
ing develops. The study of linguistic 
meaning in that full sense which in- 
cludes the many varieties of connota- 
tion is identical with the study of verbal 


behavior, and this, of course, includes 


the r part of social science—at- 
titude scaling, all sorts of paper and 
pencil tests, interviewing, etc. We see 
this clearly in the fact that the Semantic 
Differential (Osgood, Suci, and Tan- 
nenbaum, 1957), a paper-and-pencil 
method for studying connotative mean- 
ing used for everything from the 
evaluation of psychotherapy to research 
on consumer motivation. In order to 
keep this chapter focused on methods 
that are psycholinguistic in a narrow 
sense, we restrict our discussion of 


meaning to the reference system. 

We call reference a system because, 
like the three strictly linguistic systems, 
it has its regularities. With reference, 
as with phonology, morphology, and 
syntax, the child must and does ex- 
trapolate beyond the information he is 
given. Linguistic forms do not name 
unique space-time points; they name 
categories or concepts. Having heard a 
certain man called Uncle Fred on 
several occasions, the child correctly 
identifies this referent when it turns up 
in an unaccustomed quarter of town 
wearing a new suit, Having been taught 
the difference between “Go into that 
room” and “Do not go into that room,” 
the child may understand at once the 
difference between “Come over here” 
and “Do not come over here.” Though 
he has not heard the sentence The 
keeper kicked the ostrich, he may 
know the direction of the action from 
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his understanding of the general as- 
sociation between word order and sub- 
ject-object relations. We extend our 
coverage, therefore, to include some of 
the methods used to study the reference 
system. 

In an important sense, the rules 
written down by the linguist for 
phonology, morphology, and syntax 
and by the lexicographer for reference 
are what naïve adult speakers of the 
language already know and what chil- 
dren born into the community will 
learn. There is also a sense in which 
these rules are not known either to the 
child or to the naïve adult. They are 
not known explicitly; they are not 
formulated by the naïve speaker. They 
are known in that they are obeyed. For 
example, most adult speakers of Eng- 
lish, if asked how they usually form the 
past tense of a verb, will say that they 
add -ed. This is a notion derived from 
the written form of the language, 
which does not accurately summarize 
all the details of speech practice. To 
regular verbs ending in voiced conso- 
nants, such as grab or sag, or ending in 
vowels, such as sow or die, the speaker 
of English adds /d/ for the past. How- 
ever, for regular verbs ending in voice- 
less consonants, such as kick or sip, the 
appended consonant is, /t/, as you will 
notice if you listen to yourself. When 
a verb already ends in a /a/ or a /t/, 
as with bat or pad, the past is formed 
by appending /ad/. All of us who speak 
English follow these rules, but few of 
us would have said in advance that the 
form of the regular inflection for past 
tense is contingent on the character of 
the preceding vowel or consonant, after 
voiced consonants or vowels /d/ is ap- 
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pended, after voiceless consonants /t/, 
and with verbs ending in /d/ or /t/ one 
adds /ad/. As an explicit rule, this set 
of contingencies is “news” to the ordi- 
nary naive speaker, but, in another 
sense, it is not “news,” since he regu- 
larly follows these contingencies, and 
indeed they were abstracted as rules 
from his unwitting practice. 

The speaking of utterances that fol- 
low rules really does not compel us to 
credit the speaker with “implicit” 
knowledge of the rules. He may simply 
be repeating what he has heard in total 
innocence of any regularities in the 
things he says. It is only when the child 
says something new or applies an ut- 
terance to a new referent so as to con- 
form to linguistic rules that we are 
forced to credit him with more than 
a repertoire of rehearsed sayings. The 
difficulty with this requirement is that 
the psychologist usually cannot know 
exactly what sample of speech has been 
fed into a subject and so cannot ordi- 
narily tell whether a correct utterance 
results from the extension of an implicit 
rule or simply from exposure to that 
utterance. 

When a child is under continual sur- 
veillance, and that usually means he is 
in one’s home, it is possible to have 
some idea of the speech sample to 
which he has been exposed. A 3-year- 
old boy, whom one of us has watched 
acquire English, heard for the first time 
the expression another one in connec- 
tion with two books. “This is a book 
and that is another one” is the sentence 
he heard. Shortly thereafter he heard, 
“This is one of my shoes,” and he sup- 
plied together with the correct referent 
the continuation, “And here is the other 
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one.” This could be recognized 3s an 
impressive abstraction of a rule of refer- 
ence for the reason that the observer 
could be reasonably sure that his 
ject had not heard another one in con- 
nection with shoes. For this reason, 
parents can make valuable obser\ 
as to the acquisition of language 
and the various diaries of languag 
velopment (Leopold, 1947; 
1936; Stern and Stern, 1928) pi 
the data for nonexperimental ps 
linguistic research. 

When the child’s past experici of 


the language is not known to u the 
clearest evidence of implicit rules de- 
rives, ironically, from his syst itic 
errors. If he says sheeps or I brivged, 
he says something he probably has not 
heard from others, and what he sys is 
a projection of a general rule. Because 
English is not so simply regular the 
child makes an error, and the error re- 
veals his knowledge of regularities. 


Most of the time this knowledge will 
lead him to say what is correct, and so 
it is hard to be sure that he is not re- 
peating what he has heard. But when 
he oversimplifies the language, the gen- 
eralizing process is made evident. 

For the detailed study of language 
acquisition, one cannot wait for the 
production of errors. It is possible to 
study the child’s knowledge of the lan- 
guage system by asking him to invent 
new forms or to interpret linguistic in- 
ventions of the psychologist or to name 
referents he cannot have seen before. 
He can, for instance, be asked to make 
up words to see whether his creations 
follow the phonological rules of the 
language. He might, on the morpho- 
logical level, be shown a picture of a 


PS IOLINGUISTIC RESEARCH METHODS 


erforming some unfamiliar action 


and told, “This man is spowing.*” 
Spowing* is, of course, an invented 
word, and its referent is an invented 
action, The procedure would continue, 
“This man spows* every day. Today 


he spows.* Yesterday he vibe 
blenk is for the child to fill from his 


knowledge of the formation of the past 
tense in English. In this case the re- 
ferent is included only to make the 
procedure familiar and interesting—like 
going through a picture book. The in- 


flect: 
lem 


pict 


on for the past tense is the prob- 
and it could be studied without 
res. (“I am thinking of a man who 
wing.* He spows* every day. To- 
day he spows.* Yesterday he Z) 
At another time reference might be our 
central concern. Suppose we were in- 
terested in the child’s understanding of 
the phrase a kind of. We could show 
him a heterogeneous collection of 
blocks and tell him that some large class 
of them is called the biks.* “Migs,*” 
we might say, “are a kind of bik.* Can 
you guess which blocks might be 
migs*?” The question to be answered 
is whether the child would select a sub- 
class of the class biks.* This class of 
procedures, which requires the child to 
extend his linguistic rules to new ma- 
terials, seems to us to be the best 
adapted to the investigation of his 
knowledge of the systematic aspects of 
his language, and this chapter is mainly 
concerned with the application of this 
general procedure to the particular lan- 
guage systems. When the child is asked 
to create linguistic forms (Eps 
spowed*), we are testing his “produc- 
tive” control of the rules; when he is 
asked only to interpret invented forms 
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(e.g, to identify the migs*), we are 
testing his “receptive” control. 

Finally, let us agree with our readers 
on several conventions of presentation. 
The difference between phonetic and 
phonemic transcriptions of speech 
sounds will shortly be described. In ac- 
cordance with custom, we use square 
brackets to enclose phonetic symbols, 
eg., [pl], and slanted lines for pho- 
nemic symbols, e.g., /p/. When the 
sound corresponding to the symbol is 
not obvious, we add an illustrative Eng- 
lish word with the letter(s) correspond- 
ing to the sound underlined, e.g., [9], 
as in hang. When linguistic forms are 
discussed, they are italicized to indicate 
that it is the forms, not the meanings, 
to which we refer, e.g., dog is a mono- 
syllable, and invented forms are also 
starred, e.g., the verb spow.* Each of 
the four large sections of this chapter 
begins with a very cursory introduction 
to linguistic theory concerning the sys- 
tem in question. It is not possible to 
make them even approximately com- 
plete and so we have, in each case, cited 
what seem to us to be the best sources 
for extended reading. 


The Phonological System 


The linguist writes down the utter- 
ances he hears, using some set of pho- 
netic symbols, probably an alphabet and 
a number of diacritica. In recent years 
close acoustic study has shown that the 
individual “phones” (the vowels and 
consonants) of anyone’s speech are all, 
in some detail or other, unique. This is 
true even of those phones that a speaker 
intends to be the same and thinks of as 
being the same. No linguist can possibly 
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hear all of the physical differences that 
exist in the speech he records, but train- 
ing will have sharpened his perceptual 
discrimination so that he hears much 
more phonetic detail than does the un- 
tutored ear. It is generally not practical 
for the phonetician to preserve in his 
written transcription all of the phonetic 
detail he can discriminate under optimal 
conditions. In practice, phonetic rec- 
ords preserve major distinctions in the 
speech being studied; far more distinc- 
tions than the native speaker pays any 
attention to but not so many as the 
well-trained human ear is capable of 
hearing and, of course, not the infinite 
variety that physical instruments can 
register. The important thing about the 
phonetic transcription is that it be a 
culture-free record of the sounds of 
speech in a given language. A phonetic 
symbol such as [p’] stands for the class 
of phones produced by bringing to- 
gether the two lips and strongly ex- 
ploding a puff of air (i.e., with “aspira- 
tion”) without, at the same time, acti- 
vating the vocal cords (i.e, “voice- 
lessly”). The symbol stands for a class 
of phones and not for a particular 
unique phone because it does not indi- 
cate such additional properties of any 
particular phone as the general level of 
intensity or the duration of the sound. 
The phonetic [p’] also, and most im- 
portantly, implies nothing about the 
functional significance of this class of 
phones for the linguistic community. It 
does not indicate whether the commu- 
nity thinks of a phone of this type as 
different from a phone of the type [b] 
(like [p’], except that it is “voiced” 
and not “aspirate”) nor whether [p’] 
and [b] are used by the community to 
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distinguish one meaning from another. 
It is the business of the phonemic tran- 
scription to record speech in terms of 
the functional distinctions made by the 
community. The phonetic transcription 
is a rough physical record that is pre- 
requisite to the phonemic transcription. 

A brief presentation such as we offer 
here can give only an imperfect and 
approximate understanding of the pho- 
neme. In this paragraph we present an 
analogy intended to convey the general 
lines of the phoneme concept in terms 
that are familiar; in the next paragraph 
we set down the essentials of the defini- 
tion offered by linguistic science in 
terms not so familiar to a psychologist. 
Imagine, now, that we have undertaken 
a study of the metal currency of some 
foreign country, Imagine, further, that 
we have a collection of coins belong- 
ing to that currency. These will vary 
from one another in many respects: in 
color, size, brightness, printing, pic- 
tures, milling, smoothness, dirtiness, etc. 
Examined closely, there will be no ex- 
act duplicates among the coins even as 
there is none among phones. In the eco- 
nomic exchanges of the community, 
however, the various potentially dis- 
criminable differences are not of equal 
importance. Variations of size and color 
among American coins, for instance, as- 
sign a given specimen to one or another 
functional class; they make of it a 
penny, a nickel, a dime, a quarter, or 
a half-dollar. Within a class, one mem- 
ber is equivalent to another (though 
potentially discriminable in terms of 
year, shininess, etc.), and members of 
different classes are not functionally 
equivalent. Those of us who habitually 
operate with American coins think of 


PSYCHOLINGUISTIC RESEARCH METHODS 

them as falling into five classes. We 
regularly attend to size and color be- 
cause these characteristics determine 
functional class membership, and only 
under extraordinary circumstances do 
we notice characteristics of the coins 


that have no functional importance. 
Phones like coins are organized into a 
functional system, and the basic classes 
of this system are the phonemes. The 
phonemic structure of an unfamiliar 
language is worked out by the linguist 
from study of the speech practices of 
the community. He cannot tell in ad- 
which characteristics of the 
phones he hears will have functional 
significance for the community. In 
English we have many consonant pairs 
that are alike, except that one is voiced 
(the vocal cords are producing a com- 
plex musical tone) and the other is 
voiceless (the cords are silent and the 
sound is produced by movements of 
the air in the mouth), e.g, [b] and 
[pl], [d] and [t]; [g] and [k]. The 
linguist whose native language is Eng- 
lish will notice this dimension of varia- 
tion in any unfamiliar speech to which 
he is exposed, even as the American 
who studies unfamiliar coins will attend 
to size and color, However, it is always 
possible that in the unfamiliar society 
voicing does not distinguish phonemes 
and color and size do not distinguish 
coin types. It is essential that both the 
linguist and the coin classifier begin 
their studies with a relatively full de- 
scription of their materials that does not 
prejudge the local functional system; 
for speech, this is the phonetic tran- 
scription. The functional system must, 
in both cases, be abstracted from the 
behavior of community members. The 


vance 
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procedures used by the linguist for this 
purpose are complicated but quite im- 
peccably behavioristic. For phones, as 
for coins, the outcome of this effort is 
the discovery of the functioning units 
of a behavior system. 

The linguist’s phonetic transcription 
of a collection of utterances will re- 
cord more phonetic distinctions than 
are actually functional for native speak- 
ers. Some of the phones that have been 
distinctively recorded by the linguist 
should be grouped together as one 
phoneme, Those that belong together 
will have some articulatory and acoustic 
similarity, and this similarity causes the 
linguist to consider the possibility that 
certain phones are one phoneme. How- 
ever, similarity alone cannot establish 
the facts. The final decision depends on 
the way the phones are distributed 
among the utterances and on certain 
judgments obtained from native speak- 
ers. There are two kinds of relationship 
among phones that indicate to the lin- 
guist that the phones in question are 
members of one phoneme. The rela- 
tionships are called “free variation” and 
“complementary distribution.” 

Phones in free variation are, essen- 
tially, phones that are mutually substi- 
tutable one for another; the variations 
among them are unintentional and have 
no significance for native speakers, In- 
deed, native speakers are not ordinarily 
aware of the variations. For example, 
the force and the pitch and the breathi- 
ness of the initial [k] sound in cat can 
be altered to yield a variety of poten- 
tially discriminable phones. Native 
speakers of English, however, will not 
ordinarily pay any attention to these 
changes and, in fact, think: of the pro- 
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nunciations as all essentially the same. 
There are several behavioral tests from 
which the linguist can learn whether 
phones are free variants in the minds 
of native speakers. The linguist can tell 
his informant that he intends to “re- 
peat” a given utterance, €.g., cat, and 
then deliberately run through the set 
of phones suspected of being in free 
variation. If the native speaker accepts 
these various pronunciations as “repe- 
titions,” then the varying phones belong 
to one phoneme. The linguist can also 
ask the native speaker to repeat a form 
and note what variety of pronuncia- 
tions is offered by the speaker as a set 
of repetitions, Then there is sometimes 
the possibility of noting whether pho- 
netically variant forms are used to name 
the same referent. If they are, then it 
is probable that the native speaker 
thinks of the forms as the same. 
Phones in complementary distribu- 
tion are essentially variant forms of a 
single phoneme in which the variations 
are predictable from preceding or sub- 
sequent phones, In English the initial 
[k] of keep is articulated farther for- 
ward on the soft palate than the initial 
[k] of cool. It is as if the speaker antic- 
ipated the placement of the front 
vowel in making the [k] of keep and 
the back vowel in the [k] of cool. In- 
deed, these two forms of the English 
phoneme /k/ follow a consistent pat- 
tern in which the more forward [k] 
precedes front vowels and the [k] pro- 
duced farther back precedes back vow- 
els. These two phones are, for English, 
in complementary distribution, by 
which we mean that they occur in 
mutually exclusive phonetic environ- 
ments, With a little training, a linguist 
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can learn to hear the difference be: cen 
the two phones, but it is not a er- 
ence to which the native speake: at- 
taches any significance. Such phones in 
complementary distribution are culled, 
in descriptive linguistics, “allophones” 
(or variant forms) of a single phoneme. 

Examples of complementary tri- 
bution also occur in English printing, 
and these are probably more obvious 
than examples from our specc! The 
upper-case letter, for instance curs 
at the beginning of a sentence : 1 the 


lower-case letter does not. In this re- 


spect, therefore, such letters as id S 
are in complementary distributio”; Le. 
where the one occurs the other docs not. 
These are printed letters, not joken 
forms (pronunciation of s and § does 
not vary in any consistent way). The 
linguist would call s and S “allo; yphs” 


of one “grapheme” in the writing 
tem of English. Allophones, like 
graphs, are variant forms of one basic 
type, with the variations being p! edict- 
able from preceding and following 
phones—the phonetic environment. 
Finally, there are the phones that be- 
long in distinct phonemes, the variations 
in speech that the native speaker notices 
and uses to discriminate meanings. 
Phones that are phonemically distinct 
are never in complementary distribu- 
tion. They distribute like free variants, 
i.e., they occur in some utterances that 
are otherwise identical. The difference 
between phonemically distinct phones 
and free variants lies with the behavior 
of the native speaker. He will not ac- 
cept phonemic changes as “repetitions”; 
he will not offer them as “repetitions”; 
he will not use utterances that vary in 
this way for the same referent. In short, 
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using the tests for free variation, a set 


of results opposite to those defining 
free variation identifies phonemically 
distinct phones. It is important to note 


that all of these linguistic procedures 
are adaptable to children and can be 
used co work out the phonemic system 
child’s speech, 

irticulatory and acoustic variations 
that serve to differentiate phonemes in 
one language may differentiate allo- 
hones or free variants in another lan- 
In English, as we have men- 
, most of our consonants can be 
as voiced and voiceless pho- 
nemes. Pittman (1948) reports that in 
the Oto language (Oklahoma) some of 
these consonant pairs are in free varia- 
tion, ie., one member of the pair is an 
acceptable substitute for the other, as 
if din and tin were the same word. In 
English today we produce vowels of 
relatively long duration before voiced 
consonants and shorter vowels before 
voiceless consonants. (Contrast mode- 
mote, tied-tight, news-noose.) These 
longer and shorter vowel pairs are al- 
lophones of a single phoneme, since 
they occur in complementary distribu- 
tion (the one before voiced consonants 
and the other before voiceless). In Eng- 
lish of the past and in many languages 
today this difference is phonemic, i.e., 
a change of vowel length ordinarily 
signals a change of meaning. In Navaho, 
for instance, the long and short [o] are 
two different vowels as surely as /o/ 
and /i/ are different to us. For the psy- 
chology of language acquisition, it is 
of great importance to know that 
acoustical and articulatory variations 
may function distinctively in one lan- 
guage and not in another. All languages 
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do not recognize the same distinctions; 
phonemes are not marked off by 
acoustic features that are prepotent for 
the whole human species. Speakers of 
a given language have to learn what 
range of variations is treated as invari- 
ant by their community and what vari- 
ations are treated as distinctive. In short, 
there are perceptual problems of pho- 
neme constancy and acquired distinc- 
tiveness of cues. 

The “distinctive features” of an in- 
dividual phoneme would be those as- 
pects of the process of articulation and 
their acoustic consequences that serve 
to contrast one phoneme with others. 
In English speech the phoneme /b/ is 
always a stop G.e., it is produced by 
an abrupt explosion of air), and in this 
respect it contrasts with a phoneme 
such as /v/, which is a fricative (i.e. 
it is produced by means of a con- 
stricted, continuous stream of air). The 
/b/ phoneme is also voiced, and, in this 
respect, it contrasts with /p/. Jakobson, 
Fant, and Halle (1952) have proposed 
that any phoneme may be described as 
a bundle of concurrent distinctive fea- 
tures. They have given us a list of these 
features (about a dozen) that is pre- 
sumably adequate for the specification 
of the phonemes of all languages. Each 
feature is characterized in both articu- 
latory and acoustic terms, and, perhaps 
because the authors are impressed with 
the binary computer as an analogue to 
human nervous processes, each is con- 
ceived as operating on a two-alternative 
basis. This distinctive feature analysis, 
which is completely general, makes it 
possible to compare the phonemic sys- 
tem of one language with the systems 
of other languages. It is also extremely 
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economical. It is not yet clear how ade- 
quate the description is to the facts of 
speech in many languages, but distinc- 
tive feature analysis has already proved 
very fruitful for a number of studies 
(e.g, Miller and Nicely, 1955; Brown 
and Hildum, 1956). 

The various phonemes of a language 
do not occur with anything like the 
same frequencies nor do they succeed 
one another with the same frequencies. 
There are always many conceivable 
phoneme combinations that do not oc- 
cur at all. These frequency character- 
istics of the language are eventually 
absorbed by a person who learns the 
language and function as important de- 
terminants of his perception of speech. 
Numerous studies (Miller, Bruner, and 
Postman, 1954; Brown and Hildum, 
1956) have shown that the likelihood 
of correct perceptual identification of 
phonemes is a function of the prob- 
abilities of occurrence of the phonemes 
in the given contexts. This is, perhaps, 
more obvious with handwriting than 
with speech. Handwritten letters of the 
alphabet are more likely to be errone- 
ously identified by a typist when they 
occur in unusual proper names than 
when the same letters occur in familiar 
words in which they are predictable 
from context. The same sort of thing 
is true for the phonemic identification 
of individual phones in speech. Acoustic 
cues combine with probabilities gener- 
ated by context to determine phoneme 
identifications. Methods in use for 
studying the role of probability in adult 
speech perception could easily be 
adapted to work with children, but this 
work has hardly begun. 

For a general introduction to pho- 
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nemic procedures, we recommend the 
treatment by Gleason (1955) for both 
clarity and accuracy. Work in acoustic 


phonetics is very ably evaluated in 
Hockett (1955). Connections between 
the mathematical theory of communi- 
cation and phonology are described by 
Miller (1951) and by Cherry (1 
Brown (1958) provides an ext 
psychological discussion of phono! 

Most phonological learning occu 
the first three years of life, and, there- 


fore, it is probably most likely to be 
studied by parents in their own chil- 
dren. To conduct such studies, a parent 
will need training in phonetic tran- 
scription. The difficult part of this train- 
ing is the auditory discrimination, not 
the memorization of a set of symbols. 


In addition, the parent will need to 
study closely the linguistic procedures 
for discovering the phonemes of a lan- 
guage from adult speakers. The meth- 
ods to be used with children are essen- 
tially modifications of these procedures, 
which allow for the limited understand- 
ing of very young children. We de- 
scribe the general character of these 
procedures and also several linguistic 
studies that put them to use. In gen- 
eral, however, linguists who have 
worked on this problem do not give 
as many details of procedure as the 
psychologist is accustomed to and do 
not include all of the controls we 
should like. So the psychologist with 
some linguistic training who tackles this 
problem will have some methodological 
problems to solve on his own. It is cer- 
tain, however, that psychologists as 
well as linguists have important contri- 
butions to make in this work. They will 
see how to place phonological learning 
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within the general context of intellec- 
tual development, will see its relevance 
to learning theory and cognitive theory, 


and will think of empirical problems 
not likely to occur to the linguist. Now, 
we offer a rough guide to procedure. 

there is today much valuable infor- 
mation (Irwin, 1947a,b,c; Irwin and 


Ch 1946) on the frequencies with 
which various phones occur in the vo- 
calivation of children at different ages. 
In these studies the investigator makes 
a phonetic transcription of the sounds 
children produce and constructs fre- 
quency polygons listing the phones 
along the ordinate. With advancing age, 


these polygons change toward an ever- 
improving approximation of similar 
polygons plotted for adult speech, and 
the results provide a dramatic repre- 
sentation of one aspect of phonological 
development. However, they are not 
directly relevant to the study of the 
child’s phonemic systems. 

The mere occurrence in child vo- 
calization of a phone, which the adult 
observer would classify as belonging to 
a certain phoneme in the adult lan- 
guage, does not demonstrate that the 
phoneme exists as a cognitive construct 
for the child; neither does any particu- 
lar frequency of such a phone provide 
this evidence. Suppose, for example, that 
a child employs in his vocalization both 
Is] as in sin and [S] as in fish, We 
should place these two sounds in dis- 
tinct English phonemes, but we cannot 
tell whether or not the child does so 
from the single fact that he speaks both. 

How might we go about discovering 
whether the two sibilants function as 
distinct phonemes for the child? We 
could use procedures essentially the 
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same as those the linguist would use 
if he were dealing with a naive adult 
speaker of an unfamiliar preliterate lan- 
guage. We are, in the first place, inter- 
ested only in sounds used in meaningful 
forms, forms used selectively to signal 
needs or to identify referents. The 
forms need not be the conventional 
words used by adults, but they must 
be meaningful to the child. Phonemic 
conclusions cannot be based on bab- 
bling or echoic speech. Indeed, many 
linguists (Jakobson, 1941; Velten, 1943) 
believe that phonemic structure begins 
to develop only after the babbling pe- 
riod has ended and that there is usually 
a term of silence separating the two 
processes and occurring near the end 
of the first year. 

If, in a child’s meaningful forms, the 
two sibilants [s] and [S] occur in com- 
plementary distribution, then probably 
they function for him as allophones of 
one phoneme. If these two consonants 
occur in some of the same phonetic en- 
vironments, ¢.g., sill and shill, then they 
may be either free variants of a single 
phoneme or members of distinct pho- 
nemes. When the two forms sill and 
shill have clearly different semantic 
rules, whether of reference or expres- 
sion, then the probability is that [s] 
and [S] belong to different phonemes 
for this child. 

The conclusion suggested by the dis- 
tribution of [s] and [S] in the child’s 
meaningful forms can be checked by 
asking the questions used by the linguist 
with adult informants. These questions 
will also distinguish between receptive 
and productive control of phonemic 
differences and, in this way, refine the 
conclusions suggested by distribution 
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alone, For example, hearing the child 
say shill, we might ask him to repeat it 
several times, to say the word again 
and again. If he sometimes says shill 
and sometimes sill, evidently he does 
not have productive control of the dis- 
tinction between [s] and [S]; on the 
productive side they are functioning as 
free variants of a single phoneme. With 
an adult informant, this productive free 
variation would ordinarily imply recep- 
tive free variation, but in the child these 
two kinds of control of a phonemic dis- 
tinction may not be at the same level 
of development. To test receptive con- 
trol, we might offer sill as our repeti- 
tion of his shill, and, if he accepts it, 
then the two sibilants are free variants 
on the receptive as well as the produc- 
tive side, It often happens, however, that 
such “repetitions” will be rejected in 
the speech of another, even though they 
are unwittingly produced in the child’s 
own speech, The questioning technique 
for learning about the child’s receptive 
and productive control might be sup- 
plemented with an experimental test 
described on p. 532 in reporting a study 
by Brown and Horowitz. i 
Jakobson (1941) has been far in ad- 
vance of all others, both linguists and 
psychologists, in distinguishing the 
learning of phonemic contrasts from 
simple babbling. This author believes 
that phonological development, viewed 
as the progressive differentiation of the 
phonemes of the community language, 
reveals universal human regularities. In 
his 1941 work he describes the sequence 
in which phonemic contrasts seem to 
emerge in any language, and in 1956 
with Halle he sets forth such a sequence 
in terms of their “distinctive feature” 
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analysis of the phoneme. Jakobson and 
Halle (1956) believe that the child ! 
gins in a “labial stage” in which his only 
utterance is /pa/. This involves ihe 
consonant-vowel contrast. 

From the articulatory point of view the 
two constituents of this utterance repre- 
sent polar configurations of the vocal 


tract; in /p/ the tract is closed at its very 
end, while in /a/ it is opened as w ly 
as possible at the front and narrowed to- 
ward the back, thus assuming the horn- 
shape of a megaphone. This combination 
of two extremes is also apparent on the 
acoustical level: the labial stop presents a 
momentary burst of sound without any 
great concentration of energy in ar- 
ticular frequency band, where in the 
vowel /a/, there is no strict limitation of 
time, and the energy is concentrate) in a 
relatively narrow region of maximum 
aural sensitivity (p. 37)- 

This, according to the authors, is the 


primal syllable, contrasting an ideal 
consonant with an ideal vowel. The 
next stage, they report, is likely to in- 
volve a nasal-oral opposition, a distine- 
tion between /n/ and /p/. Alterna- 
tively, it may involve a grave-acute 
distinction between /p/ and /t/. The 
first vowel division is said to occur on 
the compact-diffuse axis and the next 
division on the grave-acute so that the 
child has /a/, /u/, and /i/. The discus- 
sion goes on to outline a complete se- 
quence, conforming to the general no- 
tion that the greatest possible phonemic 
distinctions are made first, with smaller 
differentiations following later. Jakob- 
son and Halle are familiar with the in- 
ternational literature on speech acquisi- 
tion and, presumably, derive their con- 
clusions from this literature. However, 
the Jakobson and Halle report does not 
set down the detailed empirical support 
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for the various generalizations that are 
made and does not give any explicit 
attention to methodological considera- 
tic Necessarily, therefore, we must 
regard their generalizations as hypothe- 
s€ ‘cher than facts that have been 
established to the satisfaction of the 


> are two published studies re- 
deliberate, fully competent in- 


qu into the learning of phonemic 
systems, Leopold (1947) kept a very 
deteiled record of the speech of his 
daughter Hildegarde from her eighth 
weck until her seventh year. The re- 
sulr ag four-volume publication is the 
ideal model for the descriptive diary 
of child’s language development. 
Leopold is trained as a phonetician and 
so was able to transcribe Hildegarde’s 


speech with phonetic symbols. The vital 
importance of such training for the po- 
tential diarist is apparent if one listens 
to the untrained American parent imi- 
tating the speech of a very young child. 
The sounds heard from the child are 
often those that do not occur in adult 
American English, but the imitating 
parent regularly assimilates them to his 
own standard versions of English pho- 
nemes and, not uncommonly, trans- 
forms what he hears into familiar 
words. The untrained adult cannot 
even approximate an accurate phonetic 
record, 

Leopold’s method was to scribble 
notes on bits of paper whenever Hilde- 
garde spoke and later, at his leisure, to 
enter them into a chronological record. 
This is an enormously difficult thing to 
do and requires considerable practice. 
It is difficult to stay in the role of par- 
ticipant-auditor in one’s own home. 
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Since one cannot always be with a 
child, nor always recording when pres- 
ent, a diary cannot be a complete rec- 
ord of the child’s speech. It is, however, 
possible during the first few years of 
life to exhaust the child’s finite speech 
repertoire in sample observation pe- 
riods. 

Leopold’s record is informed through- 
out with a linguistic and psychological 
sophistication that is unique among 
diaries of child speech. He makes care- 
ful definition of all his linguistic units, 
telling us, for example, the behavioral 
criteria for a “word.” Psychologist- 
observers have almost never done this, 
and so it is impossible to make mean- 
ingful comparisons between the “vo- 
cabulary” reported for one child and 
that reported for another. Leopold 
summarizes his chronological record by 
describing the development of each of 
the major speech systems and, in ad- 
dition, compares these results for Hil- 
degarde with the speech development 
of his second daughter Karla and with 
what can be gleaned from earlier pub- 
lished records. In general, this work 
should be studied by the psychologist 
who is planning a diary record, How- 
ever, as an example of procedures that 
can be used in working out child pho- 
nology, it is complicated by the fact 
that Hildegarde developed as a bilingual 
child (learning both English and Ger- 
man), and so we turn to a much shorter 
report by Velten for an account of his 
method. 

Using a phonetic transcription, Vel- 
ten recorded (in order of appearance) 
all of the meaningful speech forms used 
by his daughter Joan from her eleventh 


month until her thirty-sixth month. 
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Meaningful forms are, in general, those 
used with some consistency of refer- 
ence. From this linguistic corpus, Joan’s 
phonemic system is abstracted accord- 
ing to the following rules. When two 
meaningful forms differ with regard to 
a single phone, e.g., [ba] and [da], it is 
possible that the two phones ([b] and 
[d]) belong to the same phoneme for 
the child or that they belong to distinct 
phonemes. If [ba] has the same mean- 
ing as [da], if the one form is offered 
as a repetition of the other, then the 
two consonants are free variants. If, 
however, [ba] has a different meaning 
than [da], then [b] and [d] must be- 
long to distinct phonemes. In Velten’s 
actual materials, as it happened, [ba] 
seemed to mean bottle whereas [da] 
seemed to mean down. There was also 
[za] for that, and Velten concludes that 
these three consonants were organized 
by his daughter as distinct phonemes. 
Joan’s words also made use of the con- 
sonants [p], [s], and [t], which are the 
unvoiced mates of [b], [z], and [d], 
respectively. It was clear from her 
words [ap], [as], and [at] that the 
three unvoiced consonants were pho- 
nemically distinct from one another, 
but one must also ask whether each 
voiced consonant was distinguished 
from its unvoiced mate: ([b] from [p], 
[d] from [t], [z] from [s]). In Joan’s 
vocabulary the voiced consonant was 
always in initial position and the voice- 
less consonant, in terminal position; i.e., 
the consonants were in complementary 
distribution and, therefore, to be re- 
garded as allophones (voiced and voice- 
less versions) of single phonemes. There 
were, for instance, the words [ba] and 
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[ap] but neither [pa] nor [ab]. joan 
did not have two words of diferent 
meaning in which the only phonetic 
distinction marking the differer of 
meaning was the contrast between 
voiced and unvoiced. This feature, 
which is “distinctive” for the adult 
speaker of English, was not yet distinc- 
tive for Joan. 

The psychologist who studies child 
phonology thinks at once of a number 
of questions concerning the learniog of 
phonology that have not been answered 
by linguistic research but that are »pen 
to study by the experimentally inc ined 
parent. Velten might, for instance. nave 
undertaken deliberately to teac oan 
to distinguish the voiced-voicel« . fea- 
ture. This could have been d by 
methods of discrimination training long 
familiar in both animal and human psy- 
chology. Velten might have taken a 
simple contrasting pair of forms like 


bo* and po* and used them as names 
for two very unlike toys. He could 
have played a game with Joan in which 
she was required to bring to him the 
toy he named. If he then presented 
bo* and po* in random alternation and 
Joan was able consistently to fetch the 
indicated toy, there would have been 
evidence that she had acquired some 
receptive control of the voiced-voice- 
less feature for a particular pair of con- 
sonants in a particular pair of words. 
How widely would this learning have 
generalized? Would Joan promptly 
have shown control of the distinction 
in go* from ko,* do* from to,* and 
zo* from so*? Would she perhaps have 
distinguished the first two pairs, but not 
the third, for the reason that /g/, /k/, 
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nd /t/ are all stop consonants as 


are /b/ and /p/ but as /z/ and /s/ are 
n 

neans of questions and informal 
experimentation, as we have indicated 
above, a distinction can be made be- 
tw the child’s receptive and produc- 
tiv ntrol of phonemic contrasts. One 
of for instance, spoke to a child who 
ca his inflated plastic fish a fis. In 
imittion of the child’s pronunciation, 
the observer said, “That is your fis?” 
“No,” said the child, “my fis.” He con- 
tii to reject the adult’s imitation un- 
til he was told, “That is your ‘fish.” 


i he said, “my fis.’ The /s/ and 
I re not perceived as equivalent in 
the speech of an adult, but the child, 
in his own speech, used only the [s]. 
This sort of observation has often been 
made (Leopold, 1947), and it is the 
v of most linguists that in the de- 
velopment of the phonological system, 
as in the development of other linguistic 
systems, receptive control precedes pro- 
ductive control. 

What is the larger importance of 
phonemes? The set of phonemes used 
to distinguish one meaningful form 
from another in a language can be 
learned from a very small speech sam- 
ple, and such learning seems usually to 
be completed in the first few years of 
life, All of the thousands of meaningful 
forms learned thereafter are com- 
pounded of these same few phonemes. 
As long as an utterance is phonemically 
invariant, it is likely to be the “same” 
utterance with the same meaning, but 
if the phoneme sequence is disturbed 
a new form and a new meaning are 
created. The special importance of 


w 
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learning the phoneme structure of a 
language is surely the fact that invari- 
ance here is ordinarily a sign of some 
kind of invariance in the nonlinguistic 
world. One person after another may 
say of one animal after another, “See 
the dog,” and though their pronuncia- 
tions will vary in many dimensions 
there is phonemic invariance and also 
an invariance of reference that is es- 
sential dogginess. When the sentence 
changes to “See the cat,” there is a new 
phoneme sequence and also a new ref- 
erence. Phonemes are the terms in 
which the community tells the child 
when things or events are to be re- 
garded as equivalent and when they are 
to be regarded as different. 


The Reference System 


A morpheme is a segment of an ut- 
terance that recurs in various utter- 
ances with approximately the same 
meaning. In addition, morphemes are 
the smallest individually meaningful 
segments, and so any meaningful seg- 
ment that can be further subdivided 
into consistently meaningful parts does 
not itself qualify as a single morpheme. 
The word older, for instance, is made 
of two morphemes because the suffix 
-er consistently means the comparative 
degree in such forms as younger, faster, 
etc, The identification of morphemes 
involves “recurrence” of a segment, and 
“recurrence” means not absolute iden- 
tity of pronunciation but the same pho- 
nemes in the same sequence (phonemic 
invariance). However, a phonemically 
invariant sequence does not always 
analyze as a single morpheme. The same 
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phonemic segment occurs and is mean- 
ingful in the forms older and runner, 
but these suffixes must be analyzed as 
two different morphemes because of the 
distinction of meaning between the 
comparative -er appended to adjectives 
and the operative -er appended to 
verbs. 

Linguistic forms that make reference 
are always one morpheme or more. Dis- 
tinctive features and phonemes signal 
changes of meaning but do not them- 
selves have meaning; e.g., the phoneme 
/b/ does not “stand for” anything out- 
side of speech. When we think of ref- 
erence, we are likely to think of a full 
deliberate procedure of the kind an 
adult may use in teaching a child the 
meaning of some word. Although some 
privileged children in an affluent and 
leisurely society do receive quite a bit 
of deliberate tuition in reference, many 
more do not but somehow pick up the 
meanings of utterances from speech 
overheard at home and in the streets. 
We may think of reference quite gen- 
erally as a correlation between the use 
of linguistic forms and various nonlin- 
guistic recurrences without requiring 
the full deliberate pointing-and-naming 
operation. Of course, the referent is 
not invariably present when the form 
is used. Ford is often said in the absence 
of such a car. The referent is there only 
when the linguistic form occurs in cer- 
tain constructions, such as “Here is 
your Ford,” “Get a load of that Ford,” 
etc. The referent that is physically 
present when the form appears in these 
contexts is conceived of only when the 
form appears in such sentences as, “Did 
you see a Ford out in front?” 
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SIGNALING A CHANGE 
OF REFERENCE 


We have said that the general | efit 
a child derives from learning to per- 
ceive speech phonemically is that he 
knows how to identify utterances as 
invariant or as variant. Reference be- 
gins with the recognition that lin- 
guistic form has recurred and thar there 
is probably a recurrence of referent or 
that a linguistic form has change’) and 
that, therefore, there must be a change 
of referent. Brown and Horowitz 
(Brown, 1957) have conducted mall 
experiment that demonstrates this es- 
sential matter. 

Farnsworth has developed, with the 
Munsell Color Company, a seric of 85 
color chips equally spaced around the 
hue dimension; saturation and bright- 
ness are constant. For the Bro and 

rnate 


Horowitz experiment, eight alt 
chips were drawn from the red-blue 
region of the series. There was the 
same very small perceptual gap between 
each adjacent pair of the series The 
subject was shown this series of chips 
and told that the experimenter had a 
way of classifying them. It was the sub- 
ject’s job to discover this classification. 
The experimenter moved the series be- 
hind a screen and then exposed them 
one at a time (in random order), nam- 
ing each chip with a nonsense syllable. 
The subject simply watched this proc- 
ess until all eight had been named. He 
was then asked to group the chips as 
the experimenter had grouped them 
with his verbal behavior. 

As Fig. 13-1 shows, there were four 
groups of two chips each. The groups 
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ma ma ma: ma: 
o o o o 


A. English-speaking subjects 


ma ma ma: ma: 
o o o o 


B. Navaho-speaking subjects 
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mo mo mo: mo: 
o o o o 
mo mo mo: mo: 
o o o o 


Indicates point of division between categories made by 


subjects 


Fig. 13-1. Categorizations of eight color stimuli in terms of 
nonsense names varying in phonetic characteristics. 


were named [ma], [ma:], [mo], [mo: ]. 
The colon in [mo:] and [ma:] is a 
phonetic marking used to indicate that 
the vowel is prolonged. The difference 
between [a] and [o] is phonemic in 
English, but the difference between the 
long and short forms of each vowel is 
not. To produce the proper length, the 
experimenter thought of a voiceless 
consonant to follow the short form and 
a voiced consonant to follow the long 
form, Thus he pronounced [mo] by 
thinking mote and [mo:] by thinking 
mode. 

Fifteen Harvard students, whose na- 
tive language is English, after hearing 
one series generally divided the colors 
into two groups of four chips each, as 
in Fig. 13-1. The line of division cor- 
responds to the line of phonemic 
change. They did not make a line of 
division when the vowel changed in 
length. With four repetitions, the sub- 
jects persisted in their two-group clas- 
sification. 

In the Navaho language vowel length 
is always a distinctive feature. Each 


vowel has a long and short form, and 
these are different phonemes. Fifteen 
monolingual Navahos, given the same 

roblem of concept attainment, gener- 
ally divided the colors correctly into 
four classes of two colors each, as in 
Fig. 13-1, and persisted in this division 
through four repetitions. 

It is clear that English-speaking sub- 
jects were able to distinguish the pro- 
longed vowel from the short vowel, 
for when the experimenter rejected the 
two-group classification as erroneous 
subjects discovered vowel length and 
the proper grouping of colors. Many 
then remarked that they had from the 
first noticed some variations in the 
vowels but had assumed they were ac- 
cidental. There could be no better state- 
ment of the cognitive status of non- 
phonemic variations. They are, like the 
variations of brightness and legibility 
among coins of a given denomination, 
thought of as unintentional and of no 
Navaho subjects did not 


significance. 
in vowel 


assume that the variations 
length were accidental, since in their 
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language variations of this kind func- 
tion to distinguish meanings. Once the 
English subjects learned that the vowel- 
length feature was significant, they 
made no errors in identification. 

The groupings produced by both 
English and Navaho subjects were iso- 
morphic with the groupings they heard 
in the experimenter’s speech. Having 
different sets of distinctive attributes 
(phonemes) for speech, they came up 
with different groupings. When the 
English-speaking subjects were Cor- 
rected, they re-examined the experi- 
menter’s speech and discovered the at- 
tribute to which he gave significance. 
This is a method that tests for percep- 
tual or receptive control of phonemic 
contrasts (see p. 528) and which dem- 
onstrates the fact that phonemic change 
causes subjects to look for a change of 
reference. 


REFERENCE 
REQUIRING IDENTIFICATION 


OF NEW INSTANCES 


A very nice report of deliberate in- 
struction in reference is provided by 
Dr. Jean-Marc-Gaspard Itard (Ameri- 
can edition, 1932), who, in eighteenth- 
century France, undertook the educa- 
tion of the feral boy called Victor of 
Aveyron. Victor was a boy of 11 or 
12 found running naked in the Caune 
Woods in 1797 and believed to have 
lived in the forest outside all human 
society from about the age of 7. When 
captured, Victor had no speech and no 
understanding of speech. Dr. Itard had 
recently been appointed physician to a 
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new institute for deaf-mutes in 
and he obtained permission to take 


tor there to try to civilize and educate 
him. Dr. Itard’s methods for teaching 
the rudiments of language and for test- 
ing Victor's understanding of rds 
are essentially the same as experimental 
methods used today. 

In order to teach Victor the referents 


of certain words, Itard arranged 
eral objects, such as a pen, a key, 3 
and a book, on a shelf in the li 


Each thing rested on a card on ich 
its name was written, and Vict« had 
already learned to identify the names. 
Itard next disarranged the objects and 
cards and indicated to Victor that he 


was to match them up again. After a 
little practice, the boy did this very 
well. Itard then removed all the objects 
to a corner of the room. He showed 
Victor one name and gave him to un- 
derstand that he was to fetch the object 
named. Victor also learned this very 
quickly. 

The next test went badly at first. 
Itard locked away in a cupboard all of 
the particular objects with which Vic- 
tor had practiced but made sure that 
there were in his study other objects 
of the same kinds—other pens, keys, 
boxes, and books. He then showed Vic- 
tor a word, e.g., livre, and indicated 
that the boy was to bring the referent. 
Victor went to the cupboard for the 
familiar book and finding the cupboard 
locked had to give up the task. He 
showed no interest in any other book. 
The same failure occurred with the 
other words, Victor had understood 
each word to name some particular 
thing rather than a category of things. 
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Itard n spread out a variety of 
books, ung their pages to show 
what tl had in common. He indi- 


cated that the word livre could go with 
any of them. After this lesson, when 
ie word livre, Victor was able 


shown 

to fetch a book other than the specific 
book of his training. However, he did 
not correctly constitute the book cate- 
gory at once, for he brought a handful 
of paper at one time and a pamphlet 
and magazine at another. As his errors 


were corrected, however, he learned to 
distinguish books from other sorts of 
publications and also to recognize such 
categories as key, pen, and box. The 
crucial test for understanding of the 
referent category was always Victor’s 
ability to identify new instances, and 
this is still the best experimental test. 

Itard next approached the difficult 
problem of conveying an understanding 
of words that name qualities and rela- 
tions rather than objects that have size, 
shape, and weight. He took out two 
books, one large and one small, and 
Victor promptly labeled each with the 
word livre. Itard then took Victor's 
hand and spread it flat on the front of 
the large volume, showing how it failed 
to cover the full surface. The same 
hand spread out on the smaller book 
did cover that surface. Victor seemed 
puzzled, as if wondering why one word 
should name these two different ob- 
jects. Itard gave him new cards labeled 
grand livre and petit livre and matched 
them with the appropriate books. Now 
came the test to see whether Victor 
had learned specific habits or had ab- 
stracted a general relationship. Itard 
produced two nails, one large and one 
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small, and asked that the cards grand 
and petit be correctly assigned. Victor 
learned this relationship and others be- 
sides. 

Itard had another good idea for verbs 
that name actions. He took a familiar 
thing, e.g. a book, and made it the ob- 
ject of some action—pounding it or 
dropping it or opening it or kissing it. 
In each case he gave the boy the ap- 
propriate verb in the infinitive form. 
The test was for the boy to label such 
actions when their object was changed, 
e.g., to a key or a pen. This, too, Victor 
learned. 

As long as Victor could do no more 
than match some particular referent 
with its name, his performance could 
be the result of rote learning. Evidence 
of the abstraction of general rules of 
reference derives from the boy’s ability 
to identify new instances of a referent 
category (not the particular book of 
his training trials but any book). This 
is possible only if the essential attri- 
butes of books (pages and a cover) are 
separated from such accidental attri- 
butes as a particular title and color of 
cover and if the essential attributes can 
be identified when they appear in con- 
junction with different sets of acciden- 
tal attributes. It is in this respect that 
reference is a system, for we deal with 
the ability of a subject to go beyond 
the information he has been given. 

Itard’s test is of course the same as 
the general experimental criterion for 
concept attainment used by Hull 
(1920), Heidbreder (1948), Smoke 
(1932), Bruner, Goodnow, and Austin 
(1956), and many others. All of the 
studies in which subjects are required 
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to abstract the concept named by a 
nonsense syllable are, of course, studies 
of the process of linguistic reference. 
Identification of new instances has also 
been used experimentally to explore a 
child’s understanding of a meaningful 
utterance in the native language. Per- 
haps one example from the work of 
Piaget (1952) will serve to illustrate. 

To Piaget’s mind the study to be de- 
scribed is concerned with the child’s 
conception of quantity. We would add 
only that the conception is inferred 
from the child’s referential use of words 
like more, less, and equal, and so it is 
at least as accurate to say that the study 
is concerned with the child’s under- 
standing of the vocabulary of quantity. 
Piaget is inclined to see through words 
as though they were not there and to 
imagine that he directly studies the 
child mind. In fact, he is often con- 
cerned with the comprehension of ut- 
terances. 

In the present case the experimenter 
ranged side by side two tumblers filled 
with water to exactly the same height 
and asked his young subjects whether 
the tumblers contained equal amounts, 
or, if not, which one contained more. 
Then, while the child watched, Piaget 
poured the water from one tumbler into 
a container of different shape, narrower 
and taller. In this new container the 
water rose to a greater height than the 
water in the remaining tumbler. The 
child was asked whether the amounts 
of water were still equal. The younger 
children usually answered that they 
were not. Some thought there was more 
water in the new container. Piaget de- 
cided that such children were judging 
quantity by the size of the container, 
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without reference to the height of the 
liquid in the container. Older children 
and adults judge liquid quantity by 
taking account of both container size 
and the degree to which the container 
is filled. The vocabulary of liquid quan- 
tity has reference for us to a relation- 
ship between these two variables. Very 
young children attend to one or the 
other of the two but not to both. It is 
possible to interpret these results as evi- 
dence of inevitable stages in the devel- 
opment of the child’s mind, and it may 
be correct to do so. (In our section on 
the morphological system other exam- 
ples are reported of a progression from 
simple uncontingent rules to mo ade- 
quate contingent rules.) However, if 
we think of Piaget’s findings as relevant 
to the acquisition of word meanings, 
we shall see that the sequence Piaget 
discovered might be the result of par- 
ticular vocabulary training for which 
adults are responsible rather than an in- 
dication of what can be comprehended 
by the child mind at various ages. The 
vocabulary of quantity is likely to be 
introduced with simplified comparisons 
in which one of the variables is con- 
stant. With a set of glasses all the same 
size, more, less, and equal are directly 
tied to the height of the contained 
fluid. With a set of canisters varying 
in size but all filled to the top, quantity 
varies with the volume of the container. 
Moving outside these simplified situa- 
tions, the child, at first, makes an in- 
appropriate extension of his semantic 
rule and has to learn to take account 
of two variables at once. He can also 
solve the problem, of course, by grasp- 
ing the reversibility. of the pouring 
process. Having scen water poured 
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from one container to another, we 
should know that the quantity had not 
changed because the process could be 


reversed to restore the original situa- 
or the very young child, pouring 


seems to be an irreversible process that 

alters the quantity of the fluid. 

REFERENCE 

REQUIRING A DEFINITION 
Correct identification of new in- 

stances is one sort of evidence for the 


possession of a general rule of refer- 
ence. In addition, of course, one can 
ask the subject to formulate the rule 
governing his application of a linguistic 
form. This test will often be failed 
when the — identification-of-instances 
test can be passed. Victor could identify 
books but could not define them, and 
Piaget’s child subjects would certainly 
have been unable to define the various 
terms in their vocabulary of quantity. 
It is tempting, therefore, to think of 
definition as a test of the understanding 
of a reference that is less sensitive to 
lower levels of understanding than the 
identification test very much as recall 
is a less sensitive test of retention than 
recognition. In fact, however, this 
would not be correct, for adequate per- 
formance on the definition test (the 
less sensitive) does not by any means 
guarantee successful performance on 
the identification test (the more sensi- 
tive) in the way that correct recall 
generally implies correct recognition. 
Any definition is an utterance, and any 
utterance can be rote-learned. A child 
might define an orange as a round edi- 
ble citrus fruit of orange color and yet 
be quite unable to identify oranges as 
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distinct from apples and pears or any- 
thing else. 

For the purpose of studying the types 
of definition provided by children, 
many experimenters have used the 
problems of the Stanford-Binet. Feifel 
and Lorge (1950), for instance, used 
these problems for hundreds of chil- 
dren 6 to 14 years of age, classified the 
definitions into five qualitative cate- 
gories, and described age trends for 
types of definitions provided by chil- 
dren, Synonyms (orange is a fruit) and 
explanations (skill is being able to do 
something well) increased in frequency 
as children grew older, whereas defini- 
tion by use (ball is to be thrown) de- 
creased. The methodological complexi- 
ties in research of this sort arise in the 
classifying of the definitions, which is 
a variety of content analysis and calls 
for the calculation of reliability coeffi- 
cients for independent judges. The ex- 
perimenter must supply definitions of 
the categories in his classification to 
other judges to see whether they can 
sort the children’s responses as he has 
done. Verbal definitions often do not 
suffice to produce reliability of judg- 
ment, Even verbal definitions with ex- 
amples may not do it. In his study of 
the causal explanations of children 
Piaget (1930) offered no reliability co- 
efficient and so no guarantee that his 
system of classifying explanations could 
be used by others. J. M. Deutsche 
(1943) undertook to check the Piaget 
stages in the development of the no- 
tion of causality with a large sample 
of American children. To do this she 
had to be able to sort new causal state- 
ments into Piaget’s 17 classes. Deutsche 
abstracted the best definition of each 
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class given by Piaget and combined it 
with examples from the original work. 
After studying these materials, Deutsche 
and two colleagues independently 
classified a collection of new statements. 
There was much disagreement among 
the three judges; only 21.76 per cent of 
the items were classified in the same 
way by all three of them. What Piaget 
had written about his classes of ex- 
planation did not convey the ability to 
identify new instances in a reliable 
fashion. With content-analysis, building 
reliability is an important part of the 
procedure, and a high coefficient is an 
essential guarantee that the categories 
of the analysis can be shared. 

It is very important to remember 
that the child’s definitions can be rote- 
learned as well as independently con- 
structed. Because of this fact, changes in 
definition-type with age do not neces- 
sarily reflect age changes in the child’s 
style of cognitive function. They might 
result from the fact that parents and 
teachers consider different types of 
definition to be appropriate to ‘children 
at different ages and so change what 
they say to children. This in turn 
would change what children say to ex- 
perimenters. 


SEMANTICS 
WITH THE CONDITIONED 
RESPONSE 


Studies of verbal conditioning were 
made first by Russian investigators in 
several laboratories in the USSR and 
then by American psychologists (Raz- 
ran, 1939). Riess (1946) is one of the 
few to do the verbal conditioning ex- 
periment with children, and his pro- 
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cedure provides a model. As ut 
ditioned stimulus, he used a loud b 
to produce an unconditioned el: 
dermal response (EDR). This harm- 
less stimulus is a great improvement tor 
research with children over the ri 


shock that has often been used with 
adults. The conditioned stimuli are 
printed words (with children old 
enough to read), and classical con- 
ditioning is effected when the rds 
that have been accompanied b) the 

buzzer elicit the EDR. 
In one study Riess was conce ned 
of a 


with measuring the generalizatio! 
conditioned response from the tra ning 
word to a homophone, to a synonym, 


and to an antonym for the training word. 
Accordingly, his stimulus words came 
in sets of four (training, homophone, 
synonym, antonym), and he elected to 


use five such sets, or 20 critical words 
in all. The first step in the procedure 
was to determine the unconditioned 
EDR to each word as a kind of baseline 
from which to determine the condi- 
tioned response. Subjects were told to 
try to memorize the words they saw. In 
the next stage the training words were 
presented in a series with neutral words, 
and the conditioned response was €s- 
tablished by reinforcing the tra ning 
word with the buzzer. Conditioning 
was judged to be established when the 
conditioned stimulus elicited an EDR 
at least three times the base value for 
that word on three successive trials in 
the absence of reinforcement. Finally, 
generalization was tested by presenting 
the homophones, synonyms, and anto- 
nyms and measuring the EDR elicited 
on five trials for each word. 

Riess performed this experiment with 
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children in four age groups; children in 
their eighth year, their eleventh year, 


their fourteenth year, and their nine- 
teenth year. Riess advises varying the 
size of the letters of the exposed words 


to suit the age group. He gave warning 
of the buzzers and explained the elec- 
trodes as necessary to determine the 
subjects’ sweat reaction to the tempera- 
ture of the closed and windowless 


roo 


e mention only one result that is of 
peculiar interest in connection with the 
acquisition of linguistic reference. For 
the youngest group, the conditioned 
response generalized most strongly to 
the homophones (words of different 
reference but identical phonemic se- 
quence, e.g., pale-pail, ate-eight). For 
the oldest group, the greatest generaliza- 
tion was to synonyms (pale-light, ate- 
fed). Homophones are alike in sound; 
synonyms do not resemble one another 
as phoneme sequences but do resemble 
one another in their referents or, more 
broadly, in their meanings. For the 
younger children, the important dimen- 
sion of similarity seems to have been 
the sound and appearance of the words 
themselves, but with advancing age and 
experience words apparently become 
transparent and are seen through to 
their meanings, which provide a new 
basis of resemblance. Here is evidence 
that responses trained to one word will 
be extended to others linked with it by 
the reference system. 

Russian psychologists have recently 
been very much interested in the role 
of language (for which they use the 
Pavlovian term “the second signal 
system”) in facilitating the formation of 
nonlinguistic conditioned responses. 
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Liublinskaya (1957), for instance, re- 
ports the following study with pre- 
school children. A conditioned response 
was produced to the color (in another 
case to the size) of a paper cup, under 
which the children found a sweet as a 
reinforcement. With the experimental 
group, the successful solution was ac- 
companied by verbal designation of the 
distinguishing feature (red color, small 
size); with the control group, the dif- 
ferentiating feature was not named, 
There were many differences in the 
performances of the two groups. The 
experimental group formed the condi- 
tioned response much more quickly; for 
this group, the response lasted over a 
longer period than for the control 
group; the experimental group showed 
a quick, even immediate, generalization 
of the response to such new stimuli as 
a red cup ora red box, but the control 
group did not. Of course, all of these 
effects were more pronounced for 
older children than for younger, since 
the older would be more likely to have 
a well-developed understanding of the 
references for red and small, The Rus- 
sians are systematically exploring the 
role of the reference-making linguistic 
form as a signal, making possible rapid 
abstraction and generalization in the 
formation of new conditioned re- 
sponses. 


The Morphological System 


The grammar of a language consists 
of its morphology and syntax, cate- 
gories that often overlap and are dif- 
ficult to describe independently of one 
another. As we have said, a morpheme 


is a minimal unit of meaning in a 
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language. A phoneme does not in itself 
mean anything, although it. is used to 
signal differences in meaning. In English 
the /p/ phoneme distinguishes the word 
pig from similar words fig or big; but 
there is nothing about /p/ by itself that 
conveys the idea of pigginess. Nor can 
we find any common denominator of 
meaning in pull, porch, pig, pleasure, 
and the other words beginning with 
this phoneme. But if we look at a series 
of words such as hats, ships, cats, 
elephants, we note that the final -s does 
convey a common meaning: that of 
plurality. In a series such as unhappy, 
unaware, untied the un- seems to mean 
not, and in both of these instances we 
are dealing with morphemes. A single 
morpheme may be many phonemes 
long and capable of standing by itself, 
like elephant, or it may be only one 
phoneme long and bound to other 
morphemes, like the -s in elephants. 
The -s of elephants is called one al- 
lomorph of the plural. Although the 
plural is pronounced /s/ in this case, it 
is only one of several phonological 
shapes that in English signal this mean- 
ing. An allomorph is one variant of a 
morpheme that occurs in certain en- 
vironments. The plural allomorph /-s/ 
occurs after all voiceless sounds, except 
after /s/, as in kiss, /S/, as in hash, 
/é/, as in latch. The other regular al- 
lomorphs of the plural in English are 
/-əz/ after the /s/, /S/, and /č/ already 
mentioned, and also after the voiced 
counterparts of these phonemes, /z/, 
/Z/, as in azure, /j/, as in judge, and 
/-z/, which is the commonest allomorph 
and occurs after all other sounds. That 
is, it occurs after all voiced sounds, ex- 
cept those that call for /az/. Thus we 
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a 2 
are 


have regular plurals, such as 7g: 
racks /-s/, and rashes /-əz/. Th 
called regular because they are u ed in 
the great majority of English plura for- 
mations. The particular allomorp that 
will be used in forming a plural can 
be predicted from the form of th: final 


honeme of the singular form. | hese 
allomorphs, like allophones, are aria- 
tions in form conditioned by the 
phonetic environment. These forms are 


also called the productive forms o! the 


plural. They are productive the 
sense that English continues to form 
new plurals from new words, according 

to this basic rule. 
When we say that English continues 
to do something, nothing my ral is 
it the 


implied. This means, of course, | 
speakers of English do something and 
what adult speakers of English do with 
new words added to the language in- 
dicates the way in which they have 
structured the language system, even as 
what children do with new words added 
to their vocabularies indicates the way 
in which they have structured what 
they know of the language. If a new 
word comes into English, we quite 
automatically know what its plural 
should be. There is no question in our 
minds that the plural of bazooka is 
bazookas, with a /-z/, despite the fact 
that there is nothing in English pho- 
nology to prevent us from adding /-s/ 
after a vowel to form [bazuwkes]. In 
adding /-z/, we show that we know 
something more than a phonological 
rule that tells us what sound combina- 
tions can occur: we know a mor- 
phological rule for the formation of 
the plural. We also know other kinds 
of plurals in English, words such as 
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oxen, men, data, alumnae, and sheep, 
but no general rule seems to cover these 
instances, and we do not make new 


plurals according to these models. De- 
spite the fact that some words for 
animals, such as sheep and deer, take no 


end in the plural, we are not tempted 
to make the plural of a new word for 
an animal by adding nothing. To say 
one wug* and two wug* sounds 
vaguely Chinese to us. 

It may be noted here that the cri- 
terion of phonetic similarity, which 
guides the identification of allophones, 
is not essential for the identification of 
the allomorphs of a morpheme. Before 
we can say that the aspirate [p’] of pot 
and the [p] of spot are allophones of 
the phoneme /p/, we have noted that 
[p’| and [p] resemble one another. 
Quite obviously, the /-s/ of books and 
the /-ən/ of oxen are not phonetically 
similar, They are classed as allomorphs 
of the same morpheme because they 
both serve the same function, that of in- 
dicating that the noun to which they 
are appended is to be regarded as a 
plural form. 

Morphology, of course, deals with 
very much more than the expression of 
the plural. What has gone before has 
been by way of example. The native 
speaker’s knowledge of morphology 
often enables him to guess the mean- 
ings of words he has never seen before 
and, upon occasion, to make up new 
words, If a child in reading comes 
across the word unkindness, his knowl- 
edge of the meanings of the separate 
morphemes, #n-, kind, and -ness, might 
enable him to guess the meaning of the 
word even though he had never seen it 
before. When new words come into the 
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language they are rarely made of en- 
tirely new elements; more often they 
are made of morphemes that are already 
familiar. A suffix like -ness stands ready 
to transform adjectives and adverbs into 
nouns, should the need arise, as witness 
the recently coined togetherness. 

For the most part, the free mor- 
phemes of the language comprise its 
lexicon. The bound morphemes are 
used to derive new words and to express 
certain grammatical categories. Not all 
languages express the same grammatical 
categories, and each language is unique 
in its choice of morphological and 
syntactical means of expression, Often 
grammatical meanings can be expressed 
in two or more ways in the same lan- 
guage; one of these ways can involve 
morphology and the other, syntax; in 
English, for example, genitive relations 
can be shown either by special endings 
(morphology), as in the horse's hoof, 
or by juxtaposition of words and 
special function words (syntax), as in 
the hoof of the horse. It is thus im- 
possible to describe grammar without 
considering both morphology and syn- 
tax. 

Word order in English is very im- 

ortant. “The girl loves the farmer” and 
“The farmer loves the girl” mean quite 
different things; and actor and object 
are indicated only by the position of the 
words, In Latin both Puellam amat 
agricola and Agricola amat puellam 
mean “The farmer loves the girl.” Word 
order is not important here, and actor 
and object are expressed by the case 
endings on the nouns. English thus 
uses syntax and Latin uses morphology 
to express this kind of relationship, 
and to speak of English nouns as being 
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in the “nominative” or “objective” case 
is a fiction, since there is no difference 
in the form of words like gir} and 
farmer, whether they are actor or acted 
upon in any given construction. 

As the child acquires his native 
tongue, he must learn to use those 
patterns of morphology and syntax 
that are required for the production of 
grammatical utterances in that lan- 
guage. Language learning thus involves 
much more than the acquisition of a 
finite number of sounds or words. It 
involves learning great numbers of in- 
tricate patterns for the formation of 
words from morphemes and the forma- 
tion of sentences from words. 

For a general introduction to the 
problems of morphology, and an out- 
line of English morphology, we recom- 
mend the works of Gleason (1955), 
Hockett (1958), and Bloch and Trager 
(1942). In addition to the works already 
cited, numerous detailed accounts based 
on the observation of actual child lan- 
guage have appeared. A picture of 
morphological development in diverse 
languages is provided by Lewis (1936) 
on English, Stern (1928) on German, 
Guillaume (1927) on French, and 
Gheorgov (1908) on Bulgarian. 


NONEXPERIMENTAL 
OBSERVATION 
OF CHILD GRAMMAR 


The works of Lewis, Stern, and 
Gheorgov are, again, parents’ detailed 
accounts of their children’s language 
development, based on informal ob- 
servation and set down in notes and 
diaries. Guillaume’s findings are based 
on observations he made at a French 
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nursery school. His subjects wer hil- 
dren between 2 and 5 years old; after 
these children became used to his 
presence as an observer, they ignored 
him, and he wrote down their conversa- 
tions. All of these observers report that 


small children produce new fo! by 
analogy with words they already know 


and that the direction of the new forms 


is toward eliminating the irregu! rities 
and inconsistencies of the language— 
the equivalent of s ying I rin; and 
two mouses. Guillaume found that 
French children often regulerized 
common irregular forms, but irregular 
forms, even the commonest iever 
served as patterns for analogical forma- 
tions, 

Observant parents have thus con- 
tributed much of what we know about 


the grammar as well as the phonology 
of child language. Aside from possess- 
ing a trained ear and a consistent system 
of notation, the observer must know 
enough about language as a system to 
recognize new developments when he 
hears them. Any diary should also in- 
clude pertinent information about the 
diarist and his subject. The writer 
should identify himself and his relation- 
ship to the child; he should indicate 
what other people are in contact with 
the child, including older or younger 
siblings, what languages are spoken to 
the child, and what dialects. Since we 
are dealing with verbal behavior, we 
should also expect a record of what was 
said to the child just before and just 
after his utterance. These adult com- 
ments to the child should also be re- 
corded with great care, since they 
comprise the text the child is studying. 
Without this information, certain ques- 
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tions can never be answered. Reduplica- 
tion (words like da da and boo boo) 


and the use of -ie suffixes (as in doggie), 
for example, seem to be features of 
English child language. One might ask 
if the child says these things because he 
is taught them as explicit words or be- 
cause they are his own invented sim- 
plifications. If a child who had never 
heard adult baby talk produced a re- 
duplicated word or called a horse a 
horsie, we would have the beginning 
of an answer to these questions. Diarists 
are in a position to supply us with this 
information. 

Since it is possible that any correct 
utterance of a child may be something 
he has previously heard, the investiga- 
tion of the learning of morphology re- 
lies heavily, as we have mentioned, upon 


observations of children’s errors in 
speaking. These errors are indicative 
of the child’s productive system. The 
work in Russian of K. Chukovskij 
(1956) has employed this method, 
Chukovskij’s approach is largely anec- 
dotal and springs from a summer of 
informal observation and conversation 
spent at the seaside some forty years 
ago. Since that time, Chukovskij has 
attempted in an examination of chil- 
dren’s language to explore and set forth 
the “strange regularities” of child 
thought as a whole. His newspaper 
articles have attracted attention in the 
Soviet Union, and schoolteachers and 
parents have sent to him many hundreds 
of letters containing children’s composi- 
tions and citing words and expressions 
coined by Russian-speaking children. 
His many examples also demonstrate 
the extraordinary vigor of the Russian 
morphological system and the readiness 
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with which children learning this lan- 
guage invent new forms and distort 
Russian forms of foreign origin in order 
to make morphological “sense.” A 3- 
year-old child, for example, was sent to 
get some vaseline, which is a borrow- 
ing into Russian in the form vazelin. 
A few minutes later she appeared, look- 
ing for the mazelin,* as she called it. 
This transformation was by no means an 
articulatory error. Maz’ in Russian 
means liniment or oil, and mazať? means 
to grease. Mazelin* is therefore a very 
likely word for a kind of oil, since it 
contains a morpheme referring to oil. 
In the course of her wanderings from 
room to room, this small child reg- 
ularized a foreign form and placed it 
within a constellation of Russian words 
that have morphological reference to 
oil. 

The dexterity with which Russian 
children handle derivational prefixes and 
suffixes is shown by numerous examples: 
one child, wishing to express the fact 
that her grandmother loved her very 
much, referred to her grandmother as 
her lyubovnitsa. Since lyubov’ means 
love, and -nitsa is a feminine ending 
meaning agent (like the -er of English), 
the combined morphemes should mean 
a feminine person who loves. Indeed it 
does, but the Russian word lyubovnitsa 
means, more precisely, paramour. It 
seems unlikely that this child had ever 
heard the word paramour before, let 
alone applied to her grandmother, so 
that it is safe to assume that she had 
created a new word out of productive 
elements of Russian morphology. What 
strikes us most in Chukovskij’s work is 
that Russian child language is charac- 
terized by very great activity in the 
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formation of folk etymologies and new 
words from derivational morphemes. 
This is in contradistinction to English 
child language, in which we find that 
the inflexional morphemes are learned 
before the derivational ones and in 
which new words tend to be com- 
pounded of free morphemes rather than 
derived. These differences in child lan- 
guage reflect the differing linguistic 
structures of English and Russian and 
may well be indicative of future devel- 
opments in these languages. 

A method similar to Chukovskij’s, but 
dealing with several languages, has been 
employed by Kahane, Kahane, and 
Saporta (1958) in their study of the 
development of categories of the verb 
in child language. Here they have relied 
solely upon the written reports of 
others, but these reports have been the 
work of such astute observers as Leo- 
pold (1947) and Grégoire (1937-1947). 
The advantage here is that these are 
very careful studies, often covering a 
number of years and including samples 
of the child’s language in a great variety 
of natural situations, The limitations of 
this method spring from the same source 
as the advantages: from so few cases it 
is difficult to tell what is common to all 
child speakers and what reflects the 
particular idiolect of the often bilingual 
child of the linguist. Since the speech 
occurred in such a variety of situations, 
it is impossible to duplicate these condi- 
tions for the purpose of comparing re- 
sults with other children. In addition, 
too great a burden is placed upon the 
parent and linguist’s subjective feelings 
about what the child is saying or what 
he really means, regardless of what he is 
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saying. This point will be elaborated 
presently. 

The object of this study was to 
describe the evolution of verbal cate- 
gories in English, French, and German 
child language. The authors concerned 
themselves only with those categories 
that are formally marked in the verbs of 
the adult speakers of each of these lan- 
guages. For instance, in the sentences 
Jobn is loved and Mary is loved one 
could say that Joved has both a mascu- 
line and feminine meaning. Since it is 
never possible in English to tell from 
the form of the participle itself whether 
it is masculine or feminine, we cannot 
say that the category gender is ex- 
pressed in English verb forms. In 
French, however, one says Le livre est 
ouvert. (The book (masculine) is 
open) but La porte est ouverte (The 
door (feminine) is open) and the con- 
trasting forms [uver] and {uvert] 
provide a basis for saying that there 
exists a marked category of gender in 
French participles. 

The Kahane and Saporta study notes 
with examples two stages in the evolu- 
tion of verbal categories. In the first 
the child means a certain category, but 
does not mark it in his speech, and the 
second occurs when the meaning con- 
trast is accompanied by a contrast in 
form. In the first stage the child might 
say “Bear eat mama up” to mean The 
bear will eat mama up and also to mean 
The bear is eating mama up. In the 
second stage futurity is marked by the 
contrasting form will, It is, however, 
very difficult to decide what the child 
means in the earliest stages of language. 
The French child’s saying “tume” at 
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one point is interpreted to be a shorten- 
ing of the form costume (suit), and this 
brief syllable is considered to mean P've 
put on my Sunday suit, a postulation 
that requires some imagination on the 
part of the observer. In considering 
sev languages, these authors have 
demonstrated a general feature of child 
language and have shown how in each 
of these languages children progress 
toward using the forms employed by 
adult speakers. 


AN EXPERIMENTAL APPROACH 
TO ENGLISH MORPHOLOGY 


As we have noted earlier, one of the 
difficulties in observing actual speech 
is that we cannot tell if the form the 
child uses represents the workings of a 
productive system or if it is something 
he has heard and is repeating. If a child 
says “feet,” for instance, we have no 
guarantee that he knows the form foot, 
and he may say “shoes” and “climbed” 
without being able to make the plural 
of new nouns or the past tense of new 
verbs. When he says “two mouses,” 
we begin to suspect that he has some 
sort of morphological system that he 
employs in the creation of new forms. 

An experimental investigation of the 
child’s morphological system for Eng- 
lish was made by Berko (1958). In this 
experiment preschool and first-grade 
children were presented with pictures 
labeled with nonsense words and asked 
to supply English inflexions, derivations, 
and compounds of those words. The 
areas under consideration were the 
plural, possessive, verbal, and adjectival 
inflexions and several derivational forms, 
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such as the adjectival -y, as in meaty, 
and the agentive -er, as in teacher. As 
a typical example, the experimenter 
would show the child a picture of a 
small animal and say “This is a wug.*” 
Two such animals were then shown, 
and the experimenter would continue, 
“Now there are two of them, There are 
two ?” At this point the child 
would usually supply the plural form 
wugs.* 

In designing this study, the first and 
most general question had been: do 
children possess morphological rules? 
A preliminary study of 1000 words ap- 
pearing most frequently in the first 
grader’s vocabulary, based on Rinsland’s 
(1945) listing, showed that first graders 
use real words involving all of the basic 
English morphological processes. Ask- 
ing questions about real words might, 
however, be tapping a process no more 
abstract than rote memory. We can be 
sure that the nonsense words were new 
words to the child, and that if he 
supplied the right morphological item 
he knew more than the individual words 
in his vocabulary; he must possess 
general rules enabling him to deal with 
new words. If knowledge of English 
consisted of no more than the storing 
up of many memorized words, the child 
might be expected to refuse to answer 
on the grounds that he had never before 
heard of a wug* and could not pos- 
sibly give the plural, since no one had 
ever told him what it was. This was 
decidedly not the case. The children 
answered the questions. The answers 
were not always identical with those of 
an adult comparison group, but they 
were consistent and orderly answers, 
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and they demonstrated that children in 
this age range unquestionably operate 
with clearly delimited morphological 
rules. No differences between the sexes 
appeared, and although first graders 
were considerably better on some items 
than preschoolers the over-all charac- 
ter of the answers of both groups was 
much the same. 

When asked to supply the regular 
plural allomorphs /-s~-z~az/ to new 
words, these children did best with the 
/-z/ form, which is the commonest of 
the three allomorphs. They could also 
usually supply /-s/. They did not, how- 
ever, use the rule for the addition of 
/-əz/ to new words. If a word ended in 
a sibilant, they regarded it as already 
in the plural: they said one tass* and 
two tass.* At the same time, it was 
demonstrated that over go per cent of 
them could produce the real form 
glasses, which showed that although 
they did not yet generalize this rule 
they had a model for generalization. 

The possessive of the noun and the 
third person singular of the verb, as in 
horse’s or he passes, are phonologically 
identical with the plural, and children 
operated with the simplified rules 
described above. There was, however, 
one major difference: they scored 
higher on these forms than on the noun 
plurals. They were asked to produce 
three nearly identical forms: a man who 
nazzes,* two nizzes,* and a niz’s* hat. 
On the verb they were 48 per cent 
right; on the possessive they were 49 
per cent right, and on the noun they 
were only 28 per cent right. This was 
an example of items that were pho- 
netically identical but learned at dif- 
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ferent times because they function 
differently. 

On the verbs the children performed 
best with the progressive -ing form; 90 
per cent said that a man who knew how 


to zib* was zibbing.* This is an im- 


ortant form, and it may be noted that 
although other verbal morphemes have 
several allomorphs there is only one 
form of the progressive and the rules for 
its application are completely regular. 
The allomorphs of the past are 
/-t~-d~ad/, and the children rules 
are again simplified by compari with 
adult speech, Although it was shown 
that they knew real forms such as 
melted, they were unable to ex id the 
comparatively rare /-ed/ to new w ords, 
They said that a man who bods* every 
day, bod* yesterday. They did best 
when asked to supply the commonest 


form, /-d/. 

The picture that emerged was one of 
high consistency among the children 
and of over-regularity from the point 
of view of adult language. The children 
did not treat new words according to 
idiosyneratic patterns. They did not 
model new words on patterns that 
appear infrequently. When they pro- 
vided inflexional endings, their best 
performance was with those forms that 
are the most regular and have the fewest 
variants. With morphemes that have 
several allomorphs, they could supply 
the commonest forms long before they 
could deal with allomorphs that appeat 
in a limited distribution range. And, 
unlike the Russian children, they did 
not use suffixes in deriving words. In- 
stead, they tended to put two free 
morphemes together to form com- 
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pounds of the blackboard variety. 
Ad said a very little wug* might 


be called a wuglet*; children called it a 
bab 1g.* Adults said that a man who 


zibs* is a zibber*; children called him a 
an.* 


MORPHOLOGY 
IN NONLINGUISTIC LEARNING 


Until now in this section we have 
talked about children’s morphological 
productions, but it is also evident that 
before children are able to use certain 
fortis they understand them (i.e., re- 
ceptive control precedes productive). 
In the following section we discuss ways 
in which the form of a new word and 
its position in a sentence influence our 
expectation about what it might mean. 
Here we report one experiment in the 
understanding of morphology and its 
relationship to learning. In an experi- 
ment with preschool children Sack 
(1957) tested the effect of the compara- 
tive -er ending on children’s ability to 
learn a principle of comparison. 

Eight Munsell color chips, shading 
from green toward blue, were used. 
Each chip was three just noticeable dif- 
ferences (jnd’s) bluer than the one be- 
fore it, The children were divided into 
two groups. Half of the children in each 
group were told that some of the chips 
were called kef.* Children in the other 
half of each group were told that some 
of the chips were kefer,* and they had 
to learn to recognize the chips that were 
kef* or kefer.* 

The first group had before it a task 
of absolute learning. For them, the four 
bluest chips were always kef* or kefer.* 
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On being presented with a pair of 
chips, they had to recognize if both, 
none, or one was kef* or kefer.* The 
second group had before it a task of 
relational learning. On being presented 
with two chips, they had always to 
choose the bluer as kef* or kefer,* and 
a chip that was kef* or kefer* in relation 
to one chip might not be in compari- 
son with another. Since this was a prob- 
lem in relational training, one of the 
questions was: would the -er ending 
facilitate learning a principle of com- 
parison and hamper the acquisition of an 
absolute category? Among the children 
who had been given the word kef* the 
absolute discrimination was made more 
easily than the relational one. For the 
groups using the word kefer," the rela- 
tional discrimination was easier. Young 
children thus utilized the morphological 
signal -er in the performance of a learn- 
ing task. 

Experimentation with English-speak- 
ing children has indicated that they have 
a productive morphological stem and 
that there is very little variation from 
child to child in the nature of the 
system. It has been suggested elsewhere 
that there is perhaps a regular order in 
which children learn the phonemes of 
their language. Here we have seen 
that the child’s morphological system 
evolves according to a very definite 
pattern: the productive system is based 
on the largest general cases and it con- 
tains no irregularities. Further experi- 
mentation should indicate when and 
how the child’s very regular system 
becomes the adult’s somewhat irregular 
system and it should provide informa- 
tion on the acquisition of other morpho- 
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logical and syntactic patterns. Probably 
this kind of complete picture can be 
obtained only by the use of an experi- 
mental method that covers the whole 
range of a linguistic system with a large 
number of children, Experiments based 
on the major morphological and syn- 
tactic patterns a and administered to chil- 
dren of varying ages should yield scales 
of language acquisition and patterns of 
cognitive organization. 


The Syntactic System 


The postulates of syntactic descrip- 
tion are, at the present time, both too 
complicated and too unsettled to be 
presented here. We must be content to 
sketch one aspect of syntax that has in- 
spired several psychological experiments 
with children. This is the fact that forms 
in any language can be grouped into 
classes of functional equivalents. Mor- 
phemes, words, and phrases fall into 
“form classes,” the members of which 
have similar privileges of occurrence in 
building larger constructions. 

If we were to examine a very large 
sample of English speech, we should 
find a certain number of nearly identical 
utterances. We might, for instance, find 
That street is clean; That house is clean; 
and That face is clean. The forms street, 
house, and face have identical privileges 
of occurrence, and we begin to think 
of them as belonging to one form class. 
If we found next the sentences This is 
his street and This is his house, we prob- 
ably would expect sooner or later to 
find This is his face. If a child heard 
others produce the first five sentences, 
he might very well create the sixth on 
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his own. The sentence could be original 
with him, though both the constituent 


forms and the pattern have been learned 
from others. As it happens, he would in 
this case have produced a grammatically 
acceptable English sentence. 

As adult speakers of English, we 
organize the meaningful elements of our 


language into form classes. This is 
evident if we make a kind of phe- 
nomenological report on the grouping 


of constituents in a pair of sentences; 
e.g., She bought a new hat; He likes the 
old man. We probably feel an affinity 
here between He and She and another 
affinity between bought a new hut and 
likes the old man. The first two are 
subjects and the second two are 
predicates, Within the subject class the 
two members can be interchanged, and 
this is also true within the predicate 
class: She likes the old man; He bought 
anew hat. A Gestalt theorist might say 
that the members of the subject form- 
class are organized by “common fate,” 
i.e., similar privileges of occurrence in 
larger constructions, and the members 
of the predicate form-class, by a dif- 
ferent common fate. The linguist would 
say that the division of these sentences 
into subject and predicate is a first-level 
analysis into their immediate con- 
stituents (Hockett, 1958), which be- 
long to different form classes. 

In English subject and predicate are 
very large form classes, each of which 
embraces an immense variety of mor- 
phemes, words, and phrases. The parts 
of speech, the noun, verb, adjective, and 
adverb, are also very large form classes. 
Smaller form classes having familiar 
names include animate nouns, mass 
nouns, particular nouns, proper nouns, 
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transitive verbs, and intransitive verbs. 
In general, form classes grow smaller as 
the membership requirements grow 
stringent. Transitive verbs are 
nearer having completely identical 
privileges of occurrence than are verbs 
in general, 

One part of the linguistic description 
of syntax is the assignment of forms to 
form classes on various levels of gen- 
era There can be no doubt that 
with syntax, as with phonology and 
morphology, each of us in learning the 
language does unconsciously what the 
linguist does deliberately. In the case 
of syntax, the importance of the system 
is most evident, Suppose we were to 
hear the sentence: The iggle* squigs* 
trazed* wombly* in the harlish* goop.* 
Though we do not know the meanings 
of any of the major words, we are able 
to assign them to form classes on the 
basis of their order and of certain in- 
flections and marking words. As a re- 
sult, we are prepared to form a vast 
number of utterances that we expect to 
be grammatically acceptable (though 
they might turn out to be peculiar from 
a semantic point of view). “Squigs* are 
iggle,* they say, and squigs* know how 
to traze,* though in a wombly* fashion. 
As for goop,* it is harlish.* It may be 
that I also am harlish.* I might even be 
a squig,* whether iggle* or no, etc., 
etc.” When a new form is introduced in 
such a way as to place it in one or more 
form classes, a population of utterances 
involving the form is placed in readiness 
and a larger population is ruled out. 

The description of syntax, which has 
been given less than cursory treatment 
here, is set forth in detail in the works 
of Harris (1951), Hockett (1958), and 


mo 
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Gleason (1955). A new and very 
promising approach in terms of rules of 
transformation is outlined by Chomsky 
(1957). The adult speaker who can say 
“The boy hit the ball” can also trans- 
form this sentence into “The ball was 
hit by the boy.” The investigation of 
the child’s syntactic system should in- 
dicate when and how children learn to 
make such transformations. 


FORM CLASSES 
AND WORD ASSOCIATION 


As long as child speech is nothing 
more than the production of practiced 
utterances, we should expect any 
isolated word to suggest to a child th 
word that most commonly follows it. 
We should, in short, expect word as- 
sociation to follow primarily sequential 
paths. As meaningful forms become 
organized into classes of functional 
equivalents, we should expect an 
isolated word to suggest another word 
from the same form class, although that 
other word may never directly follow 
the stimulus word in any utterance. 
Word-association tests with children 
ought to show that with increasing age 
vocabulary is bundled together in terms 
of general functional equivalence rather 
than in terms of particular sequences. 
Ervin (1957), for one, has found such 
evidence. 

A list of common words belonging to 
various form classes was presented to 
children in kindergarten and in the 
first, third, and sixth grades. The re- 
sponses were Classified according to 
form class. The two youngest groups of 
children gave 48 per cent noun re- 
sponses to hand, whereas the sixth 
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graders gave 79 per cent noun responses 
to this stimulus word. Comparable in- 
creases occurred with verbs and prep- 
ositions, and there was a reduction in 
frequency of words from classes that 
generally follow the stimulus word. 
Similar results were obtained with 
words representing a variety of form 
classes: him, these, there, always, walk- 
ing, sorry, sick, either. The younger 
children tended to give more phrase 
responses; e.g., to such verbs as build 
and give they responded with articles 
plus nouns more often than with verbs. 
With the older children, generally, 
there was an increase in paradigmatic 
responses—words that are functionally 
equivalent. This learning of form classes 
founded on the principle of substi- 
tutability is what makes possible the 
creation of new sentences. 

Members of a form class may be 
marked with some phonetic common 
denominator, may be identified by 
some reliable concomitant form (as the 
helps to identify nouns in English), and 
may have more or less consistent seman- 
tic correlates in addition to the shared 
privileges of occurrence that define the 
class. Languages differ in the means used 
to mark various classes. We would ex- 
pect the age at which classes are learned 
to be a direct function of the clarity 
with which class membership is marked. 


IDENTIFICATION 
OF FORM-CLASS MEMBERS 


An approach to form-class recogni- 
tion was made by Porter (1955) in an 
investigation of the concept of the 
verb. Using adult subjects and children 
between the ages of 7 and 13, hẹ at- 
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tempted to discover by what cues we 
identify verbs and if there are dif- 
ferences between adults’ and children’s 
ideas of the verb. Since the verb may 
be identified by its position in the sen- 
tence, by special function words, by 
affixes, or by its lexical meaning, he 
attempted to manipulate each of these 


variables. 
The subjects were presented with a 


sentence typed on a 3X5§ Cal and 
asked to identify the verb. Children in 
the third, fourth, fifth, sixth, and 
seventh grades were interviewed in- 
dividually for this experiment A 


sample sentence with the verb under- 
lined, The cow yuMPED the fence, was 
shown them, and they were told they 
were playing a guessing game. Their 
instructions were: “See if you can guess 
the word I am thinking of. It will be 
one like this, yumpep.” They were then 
given a practice sentence, A chicken 
lays eggs, and after they correctly chose 
the verb the test was administered. 
Adults were simply shown the sample 
sentence and told to pick words like 
jumped. A variety of cards was pre- 
sented, and the available cues for the 
identification of the verb were con- 
trolled. Children were asked on every 
fifth card, “Why did you choose that 
word?” Children were presented with 
three parallel sets of sentences, each 
containing 26 sentences. The first ten 
in each set were English sentences of 
increasing complexity. Sentences 11 to 
I5 were nonsense sentences containing 
varying cues. Some were as cue-laden 
as A cudof* biced* the sitev,* others as 
abstruse as Posiv* niret* malug* fohik* 
gujol.* Sentences 16 to 20 were English 
sentences with less obvious cues, and 
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) 26 were nonsense sentences of a 
snt paradigm. Adult subjects saw 
all bot the first ten sentences in each 


| identifying the verb, the cue that 
proved most effective was position and 


the least, lexical meaning. The main 
difference between adult and child 
responses was a tendency for the chil- 
dren to choose on the basis of position, 
disicgarding other cues. This was true 
only on the five-word sentences, how- 


There, adults chose the third 
word, whereas children chose the sec- 
which was the position of the verb 
vost of the English sentences. In 
the sentence Docib* hegof* gufed* 
rupan* tesor* the children erred in the 
direction of choosing hegof* rather 
than gufed* as the verb. In the 
children’s verbalizations it was often 
obvious that they had chosen—and cor- 
rectly—without being able to make 
conscious their principle of selection. 
One child, when asked why he chose 
the word saw in Bill saw a distant house, 
said, “Bill saw a distant house—it 
sounded right when I said it loudly.” 
Many of these subjects were thus able 
to identify new instances of the form- 
class verb without being able to supply 
a definition of that class. 


o 


THE SEMANTICS 
OF ENGLISH PARTS 
OF SPEECH 


One of the aims of psycholinguistic 
research is to investigate the influence of 
language on cognition and perception. 
The determining influence of the vo- 
cabulary of a language is quite clear, yet 
anthropologists like Whorf (1956) have 
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felt that grammatical features of the 
language affect the speaker’s cognition 
and that speakers of structurally dif- 
ferent languages have different world 
views. The studies described in this 
section undertake to show how one kind 
of grammatical practice, the allocation 
of words to one or another part of 
speech, affects cognition. 

The studies described above have 
shown that as speakers grow older they 
tend more and more to group together 
members of the same form classes. In 
their free associations they come to 
respond with members of like classes, 
and they recognize new instances of 
these classes. These studies have in- 
dicated that groupings are made on the 
basis of certain formal and structural 
equivalences and in the absence of any 
common meaning. We have all been 
told in school that a noun is “the name 
of a person, place, or thing,” but it is 
obvious that truth is neither a person 
nor a place, and if it is a thing the 
meaning of thing must be very am- 
biguous. It is impossible to demonstrate 
that a consistent semantic underlies each 
of the English parts of speech, that 
verbs always refer to actions, nouns to 
things, adjectives to qualities, and so on. 
Yet it is just these semantic implications 
that we would expect to affect cogni- 
tion. 

In an examination of children’s vo- 
cabularies Brown (1957) investigated 
the possibility that in child language the 
parts of speech are more clearly de- 
finable in semantic terms. He found 
this to be so. In a comparison of adults’ 
and children’s most frequent nouns and 
verbs he found that most of the chil- 
dren’s nouns referred to picturable ob- 
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jects, such as truck or barn, whereas 
fewer of the adults’ nouns named cate- 
gories having a characteristic visual 
contour, Similarly, in a comparison of 
verbs he found that the children’s verbs 
characteristically name movements and 
that a much smaller percentage of 
adults’ verbs refer to actions. These 
studies confirmed the impression that 
the nouns and verbs used by children 
have more consistent semantic implica- 
tions than those used by adults. 

New words are generally introduced 
to children in a way that makes their 
part-of-speech membership clear. The 
child is told “Look at the dog” or “See 
him running.” If a part of speech has 
reliable semantic implications, it could 
call attention to the kind of attribute 
likely to belong to the meaning of the 
word. A child who had absorbed the 
semantics of the noun and verb would 
know the first time he heard the dog 
that it was likely to refer to an object 
having some characteristic size and 
shape, whereas running would be likely 
to name some type of movement. The 
part-of-speech membership of a new 
word could act as a filter, selecting for 
attention probably relevant features of 
the nonlinguistic world. In an experi- 
ment with preschool children Brown 
(1957) attempted to determine whether 
children experience any filtering of at- 
tributes by introducing newly invented 
words assigned to one or another part 
of speech and then inquiring about the 
meanings the words appeared to have. 

The parts of speech under considera- 
tion were a verb ending in ing, a 
noun, and a subclass of the noun known 
as a mass noun. Mass nouns in English 
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are nouns like milk and rice; when 
some is used with a particular noun like 
barn, the noun is made plural—some 


barns, whereas a mass noun would be 
in the singular—some rice, Mass nouns 
usually name extended substances hav- 
ing no characteristic size or shape 
Brown’s experiment involved pre- 
senting three sets of four pictures to 
individual preschool children. A typical 


set consisted of the following: the first 
picture showed a pair of hands per- 
forming a kneading motion with 2 mass 
of red confettilike material, which was 
piled into and overflowing a striped 
container. The important features were 
the kneading motion, the material in the 
container, and the container. Each of 


the remaining three pictures repr« duced 
one of the three salient features of the 
first picture. In order to represent the 
motion a second time, it was also nec- 
essary to show a mass and a container, 
but these were of different colors. The 
other two sets of pictures involved dif- 
ferent content but always a motion, a 
mass substance, and a particular object. 
The children were shown the first pic- 
ture in conjunction with a new word 
identifiable either as a mass noun, a 
particular noun, or a verb; they were 
then shown the other three pictures and 
asked to point out again what had been 
named in the first picture. The stem 
words used were miss,* sib,* and latt.* 
If the word was to function as a mass 
noun, the experimenter would ask the 
child, “Have you ever seen any sib*?” 
He would then show the first picture 
and say, “In this picture you can see 
some sib.*” He then showed the other 
three pictures and asked the child to 
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find another one that showed some 
sib,* and the child was expected to 
point to the other picture containing 
a mass, For the verb, the child was told 
the picture showed sibbing,* and for 
the particular noun he was told it 
showed a sib.* He was accordingly ex- 
pected to select the other pictures 
showing the motion or the container. 

The results showed unambiguously 
that the children chose the container 
when the word was presented as a par- 


ticular noun, the mass substance when 
it was given as a mass noun, and the 
action when the nonsense word took 


the form of a verb. This study suggests 
that the part-of-speech membership of 
a word indicates its general meaning, 
whether action, object, or substance, 
and that with very young children 
learning English the semantic implica- 
tions of the verb, the mass noun, and 
the particular noun are discovered and 
utilized, 


FORM-CLASS SEMANTICS 
IN NONLINGUISTIC SORTING 


An experiment concerning the Nav- 
aho language was conducted by Carroll 
and Casagrande (1958) in an attempt 
to see if Navaho object class would 
affect children’s form or color prefer- 
ences, In the Navaho language it is ob- 
ligatory when using verbs of handling 
to employ different verbal forms, de- 
pending on the shape or other physical 
attributes of the object being talked 
about. The nouns themselves carry no 
class marker, but all speakers know the 
verb stem appropriate for any given 
noun. Because of this classificatory sys- 
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tem, the experimenters hypothesized 
that Navaho children would discrimi- 
nate form earlier than English-speaking 
children. 

One hundred and thirty-five children 
on the Navaho reservation were sub- 
jects in this experiment. On the basis 
of a bilingualism test, they were di- 
vided into balanced bilinguals, Eng- 
lish-dominant, and Navaho-dominant 
groups. The tests were conducted in 
English or Navaho, depending on the 
child’s dominant language, with the aid 
of an interpreter when necessary. Test- 
ing was done in the hogans and in the 
presence of family members and other 
observers. In the experiment 3-to-ten- 
year-old children were presented with 
pairs of objects—sticks, ropes, blocks, 
etc. The members of the pairs differed 
from one another in two respects—size 
and shape, shape and verb-form class, 
color and size, color and shape. For 
each pair, the child was shown a third 
object similar to each member of the 
pair in only one of two relevant char- 
acteristics and asked which of the pair 
went best with the third object. The 
child might be shown a pair consisting 
of a yellow stick and a blue rope and 
then be asked which of them went best 
with a yellow rope. If he chose the 
stick, it was on the basis of color; if 
he chose the rope, it was on the basis 
of the Navaho verb form classification. 
Children whose dominant language was 
Navaho tended to choose on the basis 
of similarity of shape or verb-form 
classification rather than on the basis 
of size or color. English-dominant 
Navaho children were, on the contrary, 
more attracted to size and color. The 
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performances of Navaho children were 
also compared with results from a 
group of upper middle-class American 
children from the Boston area. The 
youngest Navaho-dominant children 
showed more form preference than 
their American age-mates, but at older 
levels the Navaho-dominant and Ameri- 
can children were about the same. Eng- 
lish-dominant Navaho children showed 
much less form preference than either 
the Navaho-dominant or American 
children, 

The authors postulate that the tend- 
ency of a child to match on the basis 
of form rather than color increases with 
age. This discrimination may be facili- 
tated by a society (like the “American) 
that gives its children elaborate toys 
that require form discrimination; or it 
may be influenced by the child’s speak- 
ing a language like Navaho, where ob- 
ligatory verbal categories direct his at- 
tention to the forms of the objects 
surrounding him. 

The organization of linguistic forms 
into functional classes represents the 
development of a flexible grammatical 
system, which makes language an in- 
finitely adaptable instrument rather than 
a finite set of rehearsed sequences, It is 
clear that children are able to assign 
forms to the proper form class on the 
basis of syntactic and phonetic cues. In 
addition, however, they are aware of 
rough semantic correlates and use the 
form-class membership of a new word 
in guessing the sort of reference the 
word is likely to make. It is, in addi- 
tion, possible that continuing attention 
to these semantic dimensions for the 
purposes of _ first-language learning 
causes a generally heightened sensi- 


THE STUDY OF COGNITIVE PROCESSES 


tivity to them that affects perception 
and cognition even when language is 
not explicitly involved. 


There is some amusement to be had 
from the invention of language games 
for young children, but the ntist 
who is not wantonly eccentric will pro- 
fess an aim beyond his own entertain- 
ment. What are the large questions that 
the psycholinguist dreams of answering 
with his phonemics and morphemics, 
with his wugs,* zazzes,* and < 

Linguistic science itself aims to find 
a set of procedures and a set of analytic 
units of complete generality, free of 
bias from any particular language and 
adequate to the description of all lan- 
guages. This aim has been accomplished 
for phonology, is near accomplishment 


for morphology, and is well begun on 
syntax. As languages are described in 
the abstract terms of phonemics, mor- 
phemics, and grammar, they become 
comparable one with another. The 
Hawaiian language has fewer segmental 
phonemes than the Chippew ayan, but 
(and indeed, one could say “therefore”) 
Hawaiian morphemes are generally 
longer phonemic strings than are Chip- 
pewayan morphemes (Hockett, 1958). 
Russian has greater morphological com- 
plexity than Chinese, but (and again, 
perhaps, “therefore”) Chinese has the 
greater syntactical complexity. The 
student of child language will hope to 
see his work follow close behind de- 
scriptive linguistics, turning up laws 
about the relative advantages, difficul- 
ties, and delays attendant on learning 
languages that have varied structural 
characteristics. 

The psycholinguist is also likely to 
have in mind the far-reaching proposi- 
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tions concerning linguistic relativity 
and linguistic determinism, which have 
been championed by Whorf (1956) 


and by many others. Whorf holds that 
persons speaking very different native 
languages (e.g., English and Hopi) will 


have 


y different cognitive psycholo- 
gies. This is the linguistic-relativity 
thesis, which maintains that cognitive 


style and language structure covary. 
vf holds, in addition, that the na- 
tive language is a major determinant 
-ognitive style. The language offered 
the child is held to be a kind of mold, 
giving shape to his mind. Study of the 
relativity thesis involves cross-cultural 
investigations of language and of adult 
nonlinguistic behavior. However, to 
establish language as a formative influ- 
ence on perception and thought, it is 
necessary to study the acquisition proc- 
ess in children belonging to different 
language communities. Several of the 
studies described in this chapter have 
had this determinism thesis in mind; 
they include the work of Brown (1957) 
and of Carroll and Casagrande (1958). 
Whorf’s realization that language is 
more than a perceptual-motor skill, that 
it is a prime agent of cognitive sociali- 
zation, has given the study of language 
acquisition a new interest for psy- 
chology. 

It may seem strange that we psy- 
chologists should need to have language 
described for us by a special linguistic 
science. Since language is everywhere 
about us and within us, we have only 
to look to see what it is. Still it is a 
fact that the descriptions of language 
behavior written by psychologists have 
usually been far too simple and rigid; 
we have been prone to falsify the na- 
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ture of language to make it fit a few 
principles discovered in animal learn- 
ing. The linguistic emphasis on system 
should make this kind of thing impos- 
sible for the future. Man is not a simple 
sort of monster gorged with words and 
phrases that are shaken out of him like 
pennies from a piggy bank. We experi- 
ence a sample of language function 
when we are young and are thereby 
structurally altered so that we spin out 
the implications of that experience the 
rest of our lives. That makes us a com- 
plicated sort of monster. 
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Many research problems in child de- 
velopment are concerned with the im- 
pact of environmental events on the 
child’s development and functioning. 
Such events can be defined in terms of 
an “objective” analysis of the stimulus 
conditions or they can be defined sub- 
jectively in terms of the experiencing 
person in the situation. The interview 
is a technique particularly well adapted 
to uncovering subjective definitions of 
experiences, to assessing a child’s per- 
ceptions of the significant people and 
events in his environment, and to study- 
ing how he conceptualizes his life ex- 
periences. 

The interview has a variety of ap- 
plications in developmental research. It 
may be the principal instrument for 
data collection, as in studies on concept 
development and research on attitudes 
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and values. Interviewing may be used 
in pilot studies for developing hypothe- 
ses to be tested subsequently with other 
techniques; in pretesting personality 
questionnaires and attitude scales the 
interview may be useful in uncovering 
ambiguous or unclear items. More re- 
cently, interviewing has been utilized 
in a significant way in experimental 
studies. In learning or frustration ex- 
periments the child may be interviewed 
after the experimental procedure to ex- 
plore his perception of the experimental 
situation, thus helping to clarify the 
impact of varied experimental proce- 
dures. A similar application of the in- 
terview, the “focused interview” (Mer- 
ton and Kendall, 1946), has been made 
with adults in an attempt to pinpoint 
specific elements of a larger situation, 
such as a film or radio broadcast, which 
have been significant in influencing at- 
titudes. Concurrent with the growing 
interest among psychologists in the de- 
velopment of ego-functions, the inter- 
view situation is being used increasingly 
to observe the child’s manner of coping 
with varied aspects of a defined social 
situation, ¢.g., attempts at mastery, per- 
sistence, dependency, and passivity. 
The major share of literature on in- 
terviewing has been clinically oriented, 
although recently several excellent 
articles and books have given detailed 
consideration to the use of the inter- 
view in social research (Hyman, 19545 
Kahn and Cannell, 1957; Maccoby and 
Maccoby, 1954). There is, however, 
little available literature on methods of 
interviewing children for research. This 
chapter focuses on those aspects of the 
interview technique that are particu- 
larly relevant to research on children. 
The interview is distinguished from 
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other research approaches commonly 
used with children by its dependence 
on an interpersonal relationship. This 
is a source of strength as well as diffi- 
culty in its use as a research instrument. 
The commonest justification offered for 
the choice of the interview over more 
impersonal techniques, such as the per- 
sonality inventory or attitude question- 
naire, is the presumed facilitating effect 
of the personal relationship in the com- 
munication process. It is likely that in 
many research situations the interper- 
sonal relationship will contribute sub- 
stantially to the validity of the data. 
The direct relationship in the interview 
reduces misunderstandings by provid- 
ing the opportunity to clarify the 
meaning of unclear questions. Further- 
more, very inhibited children may need 
the reassurance of the interviewer's per- 
missive attitude before they are able 
to express negative attitudes or reveal 
emotionally charged feelings or experi- 
ences, We cannot, however, assume un- 
critically the greater value of the inter- 
view for all kinds of problems, For ex- 
ample, a child may be freer in express- 
ing criticism of some aspect of school 
policy on a written questionnaire than 
in a direct interview with a strange 
adult, Similarly, an adolescent may be 
less able to express verbally in direct 
interview the details of his sex infor- 
mation than in writing. The fact that 
the nature of the child's relationship 
with the adult in the interview situa- 
tion exerts a significant influence on the 
content of his responses provides a 
source of variance difficult to control. 

Numerous problems in interviewing 
children are determined primarily by 
age-level characteristics. A major prob- 
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lem derives from the child’s limitations 
in language facility and comprehension 
of language. Other problems hav heir 
origins in the conventional role rela- 
tionships between children and idults 
in our culture. Thus we may have over- 
dependent behavior at one developmen- 


tal stage and extreme negativism 3t an- 
other period. In establishing compar 
bility of meaning between one inter- 
view situation and another there nay be 
special difficulties because of dificring 


stages of language maturity in children 
of approximately the same age. Other 
difficulties in interviewing, although not 
uniquely characteristic of its use with 


children, do require special considera- 
tions, ¢.g., developing adequate rapport 
with the child while keeping the rela- 
tionship within nontherapeutic bounds 
and avoiding communication of the in- 
terviewer's biases, 

In planning the research use of the 
interview, therefore, a major concern 
is the establishment and contro] of the 
relationship between the child and the 
interviewer. In addition to the broad 
question of rapport, there are many 
aspects of the relationship to be con- 
sidered, such as the meaning of the 
interview relationship to the child and 
the role relationships established. In this 
chapter we have attempted to formu- 
late some general principles relating to 
the varied aspects of the interpersonal 
relationship and to consider the major 
issues in the construction of interview 
schedules. Although it is not possible to 
present “cookbook” directions for inter- 
viewing, specific attention is given to 
such issues as the choice of language in 
the formulation of questions and the 
degree of directiveness and structure in 
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the interview as a whole, From the gen- 


eral principles developed and the exam- 
ples of specific techniques, it should be 
posible to derive creative adaptations 
to particular research goals and specific 


research situations. 


USE OF THE INTERVIEW 
AT DIFFERENT AGE LEVELS 


Developmental factors enter into con- 
sideration of interview procedures at 
several points, These factors—compre- 
hension of language, language facility, 
affective and role relationships between 
children and adults, and normative mo- 
tivacions (motivational characteristics of 
different age levels, eg. negativism, 
independence strivings)—must be con- 
sidered in evaluating the use of the in- 
terview with children and in suggest- 
ing the adaptations required in inter- 
view procedures at different develop- 
mental levels. 

Our first concern is how far down 
the developmental scale we can go in 
applying interview procedures. Recause 
of its dependence on language, the con- 
ventional interview is not applicable at 
a preverbal stage of development or 
during a period when the child is still 
concerned with mastery of the me- 
chanics of language; similarly, it is 
inapplicable to children having auditory 
or language handicaps and emotionally 
disturbed children with severe language 
inhibitions, There has been a general 
reluctance to use the interview with 
children under 6 years of age because 
of the assumption that the preschool 
child’s language comprehension, his lan- 
guage facility, and his lack of motiva- 
tion to communicate preclude effective 


563 
use of the interview. The research bear- 
ing on this question is very meager. 
Radke (1946) in a study of parental at- 
titudes used a direct interview produc- 
tively with intellectually superior chil- 
dren just under 4 years of age in assess- 
ing their perceptions of parental roles 
and control techniques. She was im- 
pressed with “the clarity and coherence 
of the individual reports of the children 
and, . . the degree of correspondence 
berween reports of the child and reports 
of the parents on the same issues,” 

Ammons (1950) used a verbal inter- 
view in conjunction with doll play to 
study awareness of racial differences in 
ay je gy and s-year-old children, In 
terms of the cooperativeness of the 
children, his findings are positive in 
regard to applicability of the interview 
to children more than 2 years of age. 
Among the 2-year-olds there was a high 

rcentage of refusals to respond (56 
per cent of the children) and a sharply 
decreasing number of refusals at other 
age levels, diminishing to a negligible 
number (10 per cent) of s-year-olds, 

Despite these positive evaluations of 
the effectiveness of the interview in the 
preschool period, considerable reluc- 
tance to use a direct verbal technique 
at these ages exists, This reluctance 
derives partly from the data on lan- 
guage characteristics of preschool chil- 
dren, i.c., data on vocabulary, articula- 
tion, and the functional use of language. 
Vocabulary at 2 years of age is limited; 
the total vocabulary is estimated at 200 
words (McCarthy, 1954). Although 
there is a very great increase in vocabu- 
lary berween 2 and 3 years of age, the 
child's language is still very self-con- 
scious, One might draw a parallel be- 
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tween the child’s use of language dur- 
ing this period and his earlier awkward- 
ness and uncertainty in locomotion, 
when his concern with the mechanics 
of the process transcends its functional 
value. Children between 2 and 3 years 
of age use words in a very limited way 
for exchanging information, ideas, or 
concepts; speech may be a form of 
play, or language may be used in an 
instrumental way to express the child’s 
needs and, to some extent, to control 
the behavior of others in accordance 
with his needs. There may be some 
difficulty at this age in understanding 
the child’s language because of poor 
articulation. According to available re- 
search on comprehensibility of chil- 
dren’s speech, about 30 per cent of re- 
sponses of 2-year-old children are in- 
comprehensible. By 34, even though 
articulation may not be perfect, only 
a negligible percentage of language re- 
sponses cannot be understood (Mc- 
Carthy, 1930; Metraux, 1950). 

By 4, children’s speech is not only 
more easily understood but there is a 
great change in the function of lan- 
guage. Between 4 and 5, children be- 
come much more interested in exchang- 
ing information, in describing events in 
their experiences, and, in a very con- 
scious way, in directing activities of 
others. Even after children have ac- 
quired a large vocabulary, they remain 
limited in the complexity and subtlety 
of ideas they are able to express directly 
in words. Very complex and subtle con- 
cepts are often expressed in highly sym- 
bolic language, a fact that offers diffi- 
culties to the adult in interpreting chil- 
dren’s interviews. Lois Murphy (1956), 
in discussing obstacles to communica- 
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tion between children and adults, notes, 
Children under five may not be able to 


express their feelings in words; their lan- 
guage may be full of phrases expressing 
their wants, and descriptions like “Garage 
man fixes broken bumper,” which may 
reflect some concern about body d 


but such directly expressed remarks as, “I 


was very angry at my mother for not 
giving me two ice creams” are not w ithin 
the range of many preschool children, At 
best, a grumbling ‘ ‘stinky old mommy” 
attempts to convey the feeling (pag: Xj: 

Similarly, Vigotsky (1939) has 
pointed out that the egocentric lan- 
guage of early childhood is based on a 
grammar of thought and syntax of 
word meaning that differ markedly 
from the conventional socialized speech 


of adults, 


The meaning of the child’s words is 
derived from the whole complex of sub- 
jective experience which it arouses in him, 
and one word in his egocentric speech may 
be saturated with sense to such an extent 
that it would require many words in so- 
cialized speech to explain it (Hartley et 
al, 1952, p. 7). 


These observations point up some of 
the hazards and suggest the lower limits 
in terms of age for the use of the inter- 
view. On the whole, research evidence 
suggests that the direct interview can be 
used effectively with 4-year-olds. Un- 
der this age, interviewing can be car- 
ried on productively with special adap- 
tations to children’s linguistic and mo- 
tivational characteristics. We can take 
advantage of the fact that passive VO- 
cabulary develops much earlier than 
active vocabulary by using techniques 
requiring only the comprehension of 
language. A valuable adaptation of the 
conventional interview for children un- 
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der 4 years of age is the picture-choice 
technique, in which the problem is de- 
fined verbally and the only response 
red of the child is a choice among 
a series of pictures (Horowitz, 1943). 
Creative use of this technique provides 
the possibility of studying rather com- 
plex and subtle feelings and attitudes 
without making great demands on the 
ls verbal ability. It is a technique 
continues to be interesting to chil- 
n into adolescence. Another adapta- 
\ of the standard interview is the 
|-play interview, in which the child 
acts out with the dolls his response to 
a verbal question and a scene set up by 
the experimenter. The doll-play inter- 
view is more limited in the age range 
for which it is effective; its greatest use- 
fulness is between 3 and 5 years of age. 
Although it retains interest value for 
girls well into middle childhood, with 
boys it begins to lose its usefulness 
after 5. 

A number of factors other than lin- 
guistic ones inhibit adequate commu- 
nication between the preschool child 
and the adult and affect the validity of 
the interview. During this period, when 
there is persistent testing out of adults, 
the child may simply refuse to respond 
to the interviewer’s verbal approach or 
he may use this opportunity to engage 
in playful teasing behavior that might 
be expressed in deliberate distortion of 
his responses. These are sources of diffi- 
culty that can be handled to some ex- 
tent by skillful manipulation of the sit- 
uation and establishment of a relation- 
ship that would make such behavior un- 
likely and inappropriate. If the inter- 
viewer is casually permissive and avoids 
communicating any anxiety of his own 
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about the child’s responding, he is less 
likely to arouse teasing negativism. 
Testing-out behavior may be decreased 
if the interviewer is not a complete 
stranger to the child; in interviewing 
done in a nursery school setting the 
interviewer may become familiar to the 
children as an observer in the nursery 
school some time before the first inter- 
view. Playing a game with the child or 
showing him a trick gadget before the 
interview may help establish rapport. 

Although the young child may be 
negativistic, he may at the same time 
be highly suggestible, and special care 
must be exercised by the interviewer 
to avoid influencing the child to give 
the response he thinks will win adult 
approval. Careful wording of questions 
to suggest that any one of several alter- 
native responses is acceptable may be 
helpful, e.g., “some children think 
that, . . . Others think that... 
Maybe you have some different ideas 
about it. . . . We like to know all the 
different ideas that 6-year-old boys 
have about this. . . .” Sometimes sys- 
tematic use of countersuggestion may 
be effective, ¢.g., presenting the same 
question in several forms, slanting the 
response first in one direction and then 
in the opposite direction. (See p. 582 
for further discussion of these tech- 
niques.) 

During the middle childhood or la- 
tency period, the child makes important 
strides in language ability and in his 
whole conceptual development. Chil- 
dren of this age are capable of thinking 
about ideas that are removed from their 
immediate concrete experiences. They 
have well-developed language skills and 
their concepts of time are becoming 
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clarified; time is measured in terms of 
life history events (Stone and Church, 
1957). In contrast to the early pre- 
school child’s feeling-oriented speech, 
language at this age is socially directed 
and utilitarian, Biber and her associates 
(1952), in describing a group of su- 
perior 7-year-olds, note, “Social ex- 
changes through the medium of lan- 
guage in this group are on what ap- 
pears to be practically an adult level 
as far as understanding and purpose are 
concerned.” They feel that language is 
used very little at this age as a means 
of expressing feeling. It tends to be 
used almost exclusively in the service 
of communication of ideas. 

Although language facility may be 
no bar to communication in the middle 
childhood period, clinicians have noted 
among children at this age an intensified 
resistance to revealing their feelings, 
concerns, and attitudes to adults. There 
is a tendency for children in middle 
childhood to exclude adults actively 
from their private world. Unlike the 
preschoolers who often think out loud, 
in the middle childhood period children 
are more likely to keep their feelings 
and thoughts to themselves. This be- 
havior has been considered anormal 
developmental phenomenon, which is 
part of the process of the child’s estab- 
lishing control of his developing ego. 
In the latency period, according to psy- 
choanalytic interpretation, the child has 
achieved a calm based on a temporary 
abeyance of the conflict between bio- 
logical impulses and social demands. 
The defensiveness against revealing sig- 
nificant feelings to adults is considered 
adaptive behavior motivated by fear of 
upsetting this unstable equilibrium. 
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The latency period is a time of in- 
tense interest in exploring and master- 
ing the environment. We can capitalize 
on these normative characteristics to 
enhance rapport and facilitate verbal 
expression, eg, giving the child mate- 
rials to handle or build with during the 


interview or playing a game prior to 
the interview. This is an age when com- 
petitive games have special appeal. A 
game that is moderately involving, but 
more dependent on chance than on 
skill (e.g. Parchesi), is desirable to 
avoid inducing a failure experience and 
to avoid fostering a competitiv rela- 
tionship between the child anc adult, 
The use of a game during the int rview 
procedure is appropriate only for cer- 
tain kinds of interview content. it may 


be quite inappropriate for an interview 
focused on personal relationships in- 
volving rivalry feelings or feelings of 
personal or intellectual adequacy. here 
are difficulties, too, in controlling fail- 
ure experiences in a game during the 
interview. Any manipulation by the in- 
terviewer to assure the child’s success 
may be perceived by the sensitive child 
and result in a deterioration in the re- 
lationship. 

Throughout the literature on inter- 
viewing, there are casual statements 
suggesting that the sex of the inter- 
viewer may influence significantly the 
responses of boys and girls, particularly 
during the middle childhood and ado- 
lescent periods. In the latency period, 
boys’ social code requires overt rejec- 
tion of girls and women. There is no 
clear parallel among girls. Although 
girls may show distain for boys, this dis- 
dain does not generalize to ‘adult men. 
There are no systematic research find- 
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ings on the influence of the sex of the 
interviewer on the responses of boys and 
girls. Studies of adults have pointed up 
the biasing influence of the interview- 
er’s sex on responses to certain kinds of 
ego-involving material. However, there 
are no firm conclusions that the sex of 


the interviewer introduces systematic 
bias in all kinds of interview situations. 
In the clinical literature on therapy with 


there are contradictory sug- 
gestions as to the significance of the sex 
of the therapist. In child therapy 
women analysts have worked with ap- 

‘ent success with boys during the 
» period. On the other hand, 


Fleming and Snyder (1947), who 
studied this problem specifically in 
relation to nondirective therapy with 


boys, found that 10-year-old boys did 
not respond so well to a female as to 
a male therapist. Although it may be 
generally characteristic of this develop- 
mental period in our society that boys 
have difficulty in relating to a woman 
in an interview situation, it is by no 
means a universal phenomenon and 
undoubtedly is modified by the specific 
experiences of the child. The personal 
characteristics of the interviewer may 
play an equally significant role in terms 
of associations with familiar figures in 
the child’s environment. The influence 
of the interviewer’s sex may also vary 
with class or other social-background 
factors. In the research interview in 
which contact may be limited to one 
session, it is important to be aware of 
the possible inhibiting influences of the 
sex or personality of the interviewer. 
The adolescent’s resistance to adult 
attempts to probe his private world is 
a characteristic that has been discussed 
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so much that it has acquired the status 
of a stereotype. It is a real factor that 
must be considered in research with 
adolescents. We must be aware, too, 
that there are significant differences be- 
tween different periods of adolescence. 
Whereas the late adolescent period is 
comparable to adulthood insofar as 
interview methods are concerned, the 
early adolescent, roughly between 
12 and 14, tends to be closer in psy- 
chological orientation to the late latency 
child. Here we find a rejection of pa- 
rental identification with a replacement 
by a peer group identification. At this 
age it is likely that the adult whose at- 
titudes express an acceptance of the 
child on an equal-status basis will be 
better able to establish rapport than the 
adult who represents the parents or 
adult-authority figures. However, de- 
spite the emotional resistance at this 
age, there are other characteristics, such 
as intense intellectual curiosity and 
idealism that might well be used to 
motivate cooperation in research. In the 
next period, middle adolescence, 
roughly 14 to 16, there tends to be 
heightened preoccupation with self and 
a greater withdrawal within the self. 
The child at this period is also striving to 
find a satisfactory ego-ideal; these 
strivings may result in a heightened 
awareness of attitudes and opinions of 
others and a tendency to accept and in- 
corporate the opinions of acceptable 
adults, This tendency necessitates 
greater vigilance on the part of the in- 
terviewer not to reveal his own atti- 
tudes and not to suggest any response. 
It is important, too, during this stage 
that the interviewer does not threaten 


the child through overfriendliness. 
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Geelerd (1957) suggests in relation to 
psychotherapy with adolescents that 
“the ego of the adolescent is already so 
threatened by the increased id drives 
that a very little overeagerness on the 
analyst’s part may be too overwhelm- 
ing.” 

The sex of the interviewer is likely 
to have a significant effect on rapport 
at this age. We might anticipate that 
boys will be inhibited by women inter- 
viewers and girls will feel less free with 
men interviewers during adolescence 
than in the latency period. It is even 
more likely that an interactive influence 
will exist between the interview content 
and the sex of the interviewer. 

The motivations that will be effec- 
tive in involving the child in the inter- 
view and in stimulating free response 
also vary somewhat for children of dif- 
ferent ages. With the young child it is 
not possible to capitalize on any abstract 
motivations relating to research. For 
him the gratifications must be imme- 
diate; he is not capable of conceptual- 
izing the interview as a step toward a 
deferred goal. On the other hand, the 
adolescent, like the adult, may be mo- 
tivated to express his feelings and atti- 
tudes about a particular topic, or he 
may see the interview as an opportunity 
to influence a desired change, e.g., bring 
about changes in school policies. Al- 
though the indirect association between 
the research findings and some desired 
outcome can sometimes be stressed 
meaningfully, ethical considerations 
would prevent the interviewer from 
deliberately misleading the child to an- 
ticipate any direct results. 

The most effective motivations to 
communicate with the adult are devel- 
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oped during the course of the inter- 


view. The interview experience itself 
can be a gratifying one to children of 
all ages; their feelings of status are en- 
hanced by receiving the full attention 
of an interested adult. After the child 
has been in the interview situation and 
has had an opportunity to air his 


thoughts and feelings to a permissive 
adult, the positive feelings de\ loped 
may be sufficient to sustain the inter- 
view and to motivate the child to re- 
spond to difficult questions requiring 
real effort. 


It is convenient to discuss the char- 
acteristics of children that offer diffi- 
culties in interviewing in terms of de- 

How- 


velopmental stage phenomena 
ever, we know that motivational char- 
acteristics cannot be too closely defined 
by a given chronological age; there may 
be considerable overlap between devel- 
opmental periods. Also, it must be rec- 
ognized that individual characteristics 
may transcend age-level characteristics. 


INTERPERSONAL RELATIONSHIP 
AND INTERVIEW SETTING 


A research interview has character- 
istics that make it uniquely different 
from other familiar social situations in 
the child’s life. In this setting the child 
has the full attention of an adult who 
is interested in him and in his ideas. 
This person is nonjudgmental and ac- 
cepts the child’s attitudes and feelings 
without criticism or threat. He accepts 
information or answers to questions 
without judging them right or wrong. 
In another sense, the interview relation- 
ship is unique. Unlike normal social in- 
tercourse, the interviewer submerges Or 
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mutes his own personality and identity, 
and, while maintaining anonymity, he 
remains an empathic and responsive 
person 


Some of the basic orientations to in- 
terpersonal relationships growing out of 
therapeutic interviewing might well be 
adapted to the research interview. The 
interviewer must convey to the child 
nuine interest in him and in his 
and ideas. Children’s sensitivity 
genuineness or lack of sincerity 
on the adult’s part has been repeatedly 
pointed out by clinicians, teachers, case- 
workers, and others working directly 
with children. To be able to convey to 
the child genuine liking and acceptance, 
and at the same time to maintain a sense 
of neutrality and objectivity, demands 
of the interviewer very fine skills in 
social relations. He cannot be a com- 
pletely neutral object, but he needs to 
elicit quickly a positive response from 
the child in order to secure cooperation. 
The level of the relationship in terms 
of empathy and responsiveness must be 
carefully controlled and will vary from 
one type of interview to another, An 
interview that probes feelings, attitudes, 
and deeply personal orientations re- 
quires a deeper level of relationship in 
terms of warmth, sensitivity, and re- 
sponsiveness than one concerned pri- 
marily with obtaining factual data. 

Although the therapeutic approach 
has much to offer in the way of an 
orientation to research interviewing, 
the basic difference in goals between 
the therapeutic and research interviews 
creates fundamental differences in the 
relationship. The respondent in the re- 
search interview is helping the inter- 
viewer by his cooperation, instead of 


his g 
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being the recipient of help, as in the 
therapeutic interview. Another impor- 
tant difference necessitating modifica- 
tions in technique is in the duration of 
the relationship. The research interview 
often involves a single contact with the 
child, as distinguished from the therapy 
situation in which there are often- 
repeated contacts over a period of 
months or even years. There is need, 
therefore, for a different approach to 
develop more rapidly the character of 
the relationship desired for the research 
interview. However, in some research 
situations, such as longitudinal research, 
the contacts between interviewer and 
child continue over a period of years, 
but are less frequent and less in- 
tensive than in the therapeutic rela- 
tionship. In this setting there may be 
advantages in proceeding more slowly 
in establishing the desired relationship. 
In some treatment interviews the trans- 
ference relationship is consciously fos- 
tered in the service of the therapeutic 

oals. In the research interview there 
are ethical as well as methodological rea- 
sons for avoiding the development of 
a relationship that could not be ade- 
uately handled by the interviewer in 
his limited time with the child. Cer- 
tainly, negative transference could 
seriously impair the adequacy of com- 
munication between the child and the 
interviewer. On the other hand, there 
is the danger of developing a too 
strongly positive relationship. Over- 
identification with the interviewer 
might distort the responses; the child 
might attempt to give responses he 
thinks the interviewer wants, Or that 
would be acceptable to him, and to 
inhibit responses he thinks the inter- 
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viewer would find unacceptable. Un- 
less the research objectives require that 
the child be able to project onto the 
interviewer feelings that he has toward 
parental figures or other significant 
adults in his environment, it is neces- 
sary that the interviewer clarify his 
identity early and do it in a way to 
differentiate clearly his role from that 
of the parent or teacher. 

Because of its very uniqueness and 
ambiguity, the interview relationship 
may initially be anxiety provoking. To 
avoid the development of anxiety, it is 
important that the interviewer be very 
explicit in defining the nature of the 
situation. It may be done verbally as 
well as nonverbally. The definition of 
the situation involves clarifying for the 
child the purpose of the interview, the 
interviewer's expectations of him and 
the interviewer's role.* 

When a child is brought to a strange 
setting for an interview, some time must 
be planned for him to become familiar 
with the environment and with the in- 
terviewer. At the beginning the inter- 
viewer may assume a relatively neutral 
role while the child is given freedom to 
explore the environment. However, the 
interviewer should be available and sup- 
portive without forcing himself on the 
child. Play materials may help put the 
child at ease because of their familiarity 


1H. S. Sullivan (1954), in discussing the 
psychiatric interview, emphasizes the im- 
portance of the definition of the situation: 
“Social or cultural definition is very im- 
portant indeed in the earlier stages of an 
interpersonal relation; in fact, it is finally 
important if one of the people concerned 
overlooks it, since this means that the rela- 
tionship will not be developed in any meaning- 
ful sense” (p. 29). 
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and pleasant associations. By focusing 
attention on the toys, the child self- 
consciousness may be diminished and 
he may be stimulated to offer sponta- 


neous comments about similarity or 
lack of similarity to his playthings at 
home. The play materials should be 
simple and challenging but not too 
difficult for the child to master and not 


so absorbing that they detract from the 


relationship with the interview The 
materials should have a limited time- 
involvement so that the child can 


quickly satisfy his curiosity about them 
or successfully master them before the 
formal interview procedure is arted, 
With older children, toys become less 
effective, and the interviewer is more 
dependent on his skills in interpersonal 
relationships to convey to the child 
quickly his warmth and genuine ac- 
ceptance of him. Some “trick” gadgets 
are often useful to break the ice and 
to help put the child at ease by shifting 
the focus of attention away from him- 
self to the object or game. In helping 
the child discover the possibilities of 
the game, the interviewer can commu- 
nicate warmth and permissiveness, set- 
ting the tone for the interview relation- 
ship. In some kinds of research, such as 
in studies on aggression, it may be 
methodologically important to stand- 
ardize the preinterview experience to 
control for possible effects of antece- 
dent failure, frustration, or fear (Yar- 
row, 1948). 

A useful procedure in establishing 
rapport and stimulating the child to 
talk is to have the child or parent bring 
to the interview some creative prod- 
ucts, such as pictures the child has 
drawn or painted. This situation en- 
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ables the interviewer to express his ap- 
preciation of the childs productions 
and places the child in the role of the 
“expert” with this particular material. 
He is the only one able to tell the in- 
terviewer all about it, thus facilitating 
spontaneous conversation and giving 
him confidence that what he has to say 
is important. The interviewer can then 
divert the conversation to the desired 
area 

Some consideration needs to be given 
to the physical setting in which the in- 
terview is conducted. In general, we 
think of the ideal setting as one that 
reduces anxiety and maximizes rapport. 
A quiet room that fosters a feeling of 
privacy, one free from “convenient” 
distractions, will facilitate the interview 
process. On the other hand, for some 
kinds of emotionally involving material, 
an interview on the spot in the situa- 
tion may yield significant data (Redl, 
1959). (See p. 591 for discussion of this 
technique.) For some research prob- 
lems, a setting may be created to facili- 
tate or stimulate deliberately the expres- 
sion of attitudes or feelings of aggres- 
sion, anxiety, or sympathy. Many obvi- 
ous practical considerations govern the 
appropriateness of any given physical 
setting for an interview; an interview 
about the child’s attitudes toward his 
teacher might not be most freely con- 
ducted in an alcove off the principal’s 
office. Other aspects of the physical sit- 
uation will also affect rapport, such as 
seating arrangements. A different at- 
mosphere with different expectations in 
the child is created when the inter- 
viewer and child are separated by a 
desk than when they are seated at the 
same level side by side. Often a child’s 
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concentration of attention is improved 
when the physical arrangements impose 
limits, such as being seated at a table 
with paper and crayons. Sometimes 
children in a strange situation will feel 
more comfortable if they are partly 
“protected” by a table or desk than if 
they are “exposed” in the middle of a 
large room. 

In addition to the nonverbal defini- 
tion of the situation, communicated by 
the physical setting and by the inter- 
viewer's initial approach to the child, 
it is usually desirable that there be a 
clear verbal definition. The interviewer 
should be explicit about the purposes 
of the interview as well as such factors 
as the length of time involved and the 
confidentiality of the content. 

With the preschool child, it has been 
traditional to define the interview as 
a play situation. If this orientation is 
given, then it is important to make the 
interview situation a genuinely pleas- 
urable one. The child may, with good 
reason, become resistant if he has been 
given the expectation of having fun 
and the situation becomes a serious one 
or one without clear purpose to him. 
With preschool children, the use of 
props during the interview (e.g., talk- 
ing through the medium of dolls or 
carrying on a conversation via toy 
telephone) is often more effective than 
a straightforward question-answer ap- 
proach. Giving the child an oppor- 
tunity to use crayons or clay while 
talking during the interview may also 
help to decrease _ self-consciousness. 
However, these accessory materials 
must not be so interesting that they 
will offer a convenient escape from the 
interview or become so absorbing that 
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the child will see the interview ques- 
tions as an annoying distraction from 
his play. 

During the middle childhood period, 
after school entrance, when the child 
has some background of experience in 
serious discussion with adults, it is usu- 
ally less desirable to structure an inter- 
view as play unless it is really carried 
out as a free-play interview. During 
this age period the interviewer must 
rely more on the child’s willingness to 
accept adult authority and on the in- 
herent interest of the content of the 
interview questions. An explicit verbal 
definition of the purpose of the inter- 
view is especially important at this age. 
A simple statement, such as, “We are 
interested in finding out what boys 
your age think about different kinds of 
TV programs so that we can plan pro- 
grams that boys like,” may be an ade- 
quate definition of the interview pur- 
pose. 

An important aspect of the definition 
of the interview situation is the clari- 
fication of role relationships. Research 
on interviewing adults has emphasized 
the importance of status roles, of the 
in-group or out-group identification of 
the interviewer, and of his degree of 
expertness in the topic under consid- 
eration. Although there is no relevant 
research, it is likely that different role 
considerations are important with chil- 
dren. The significant roles may be those 
that relate to the child’s daily experi- 
ences—teacher, father, mother, coach 
—roles that have meaning in terms of 
authority, discipline, and trust. The re- 
search interviewer, as a permissive per- 
son who encourages the child to express 
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his thoughts or feelings with impunity, 


does not easily fit any of these conven- 
tional roles. It may be difficult for the 
child to conceptualize the interviewer's 
role, and role ambiguity may be a source 


of anxiety. The interviewer should be 
as explicit as possible about his role 
early in the interview. He defines his 


role verbally as well as by his behavior. 
If he is aware of the various possible 
roles in which he might be per eived 
by the child, he can encourage the role 


perception that will facilitate the pur- 


poses of the interview. For the pre- 
school child, the interviewer seen as 
parent-surrogate will probably arouse 
the least anxiety. With children known 
to have very disturbed parental rela- 
tionships, it is important to convey 


quickly to the child the attitude of the 
“good parent.” Similarly, during ado- 
lescence, with resistance to parental 
probing already strongly established, 
one might assume that the parental role 
would hinder communication, A neu- 
tral adult who is able to accept the 
adolescent on an equal-status basis in 
terms of the adequacy of his ideas and 
knowledge, yet who maintains some 
degree of adult authority, is probably 
the ideal role. The degree of authority, 
the degree of equality, and the degree 
of neutrality will vary somewhat with 
the purpose of the interview. 

Although it is important that the in- 
terviewer’s behavior be carefully cov- 
trolled to provide comparability from 
case to Case, preoccupation W] ith stand- 
ardization of behavior may detract 
from spontaneity and result in “stiff” 
behavior. It is unrealistic to assume that 
a rigid behavior pattern can be defined 
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for all interviewers in any given situa- 
tion to insure equivalent stimulus-value 
from one interviewer to another. 
Rather, there must be sensitive aware- 
ness of those aspects of behavior, which 
might be biasing in terms of the par- 
ticular interview situation and content. 
With such awareness, there is some lati- 
tude for expression of the individual 
personality within the defined inter- 
viewer role and for the kind of spon- 
tancity that will facilitate the child’s 
response. The interviewer should be 
capable of adapting his behavior to the 
situation and to the subject according 
to his evaluation of motivations and 
defenses that might significantly influ- 
ence the child’s response. If these 
changes represent conscious adaptations 
from a defined role model, the re- 
searcher will be able to evaluate pos- 
sible biases introduced by these adap- 
tations, 

In the preceding discussion of inter- 
personal relationships in the interview 
we have focused on general techniques 
useful with a hypothetical normal child, 
probably of middle-class background. 
When interviewing atypical children or 
children from cultural backgrounds 
different from the interviewer's, some 
adaptations in technique may be re- 
quired. The necessity for adaptations in 
interview technique is obvious when 
dealing with atypical children—high- 
grade mentally deficient, emotionally 
disturbed, or brain-injured children— 
but it is less often recognized that adap- 
tations may be necessary for children 
from different cultural backgrounds. 
For these children it may simply re- 
quire adapting language or con- 
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cepts that are relevant to the life ex- 
periences of the child. Sometimes we 
may need to be aware of different 
modal patterns of relationship with au- 
thority figures, ¢.g., different degrees 
of freedom or spontaneity with adults 
characteristic of an ethnic or class 
group. In a comparison of the be- 
havior patterns in play therapy of nor- 
mal children and children with serious 
emotional problems Moustakas and 
Schalock (1955) found a high degree 
of similarity in basic patterns of inter- 
action. There were, however, some dif- 
ferences with important implications 
for interviewing procedures. The dis- 
turbed children were more inhibited in 
verbal interaction with the therapist and 
responded in a way that did not en- 
courage interaction. They also ex- 
pressed overt hostility more frequently 
than normal children. When working 
with children who are withdrawn and 
verbally inhibited, considerable skill is 
necessary to draw them out. In plan- 
ning the interview, sufficient time 
must be allotted for the initial estab- 
lishment of rapport to stimulate ver- 
balization. The technique mentioned 
earlier of beginning the interview con- 
versation with some area of special in- 
terest to the child may be useful. It is 
also important that the interviewer re- 
frain from being overanxious in de- 
manding a response from the child. 
When dealing with extremely disturbed 
children, a major difficulty may be the 
child’s inability to focus on and attend 
to any stimulus for more than a few 
seconds. This limits the kind of inter- 
view procedure that is feasible. Short 
questions and the use of props to rein- 


574 


force the question may help to utilize 
available attention span. 


METHODOLOGICAL 
CONSIDERATIONS 
IN INTERVIEWING 


The principal methodological issues 
in the construction of interviews are 
the degree of standardization of the in- 
terview, the extent of interviewer di- 
rectiveness, and the form and structure 
of interview questions. Choice among 
alternative approaches is partly depend- 
ent on the purpose of the interview, 
partly on the interview content, and 
partly on the researcher’s own personal 
predilections. 

The Standardized versus the Free 
Interview. In planning a research in- 
terview, a major choice is between a 
standardized and a free interview. In 
the standardized interview the questions 
are completely formulated prior to the 
interview session and are presented to 
each subject in the identical way, with- 
out variation in language or sequence, 
In the free interview the structure may 
consist merely of an outline of sug- 
gested topics, to be introduced at any 
point at which they seem meaningful 
by adapting the language to the sub- 
ject. Between these two extremes are 
varying degrees of standardization and 
freedom. 

There has been much discussion in 
the literature on the relative merits and 
demerits of standardized and free inter- 
views. The chief advantage claimed for 
the standardized interview is that in 
presenting the same verbal stimulus to 
each subject it limits the operation of 
interviewer bias. A further advantage 
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for research is that it assures some re- 
sponse from all subjects to the same 
question. The proponents of t free 
interview maintain that comparability 
of meaning is better achieved by dapt- 
ing the language to the respondent and 
to variations in the situation. They 
stress the potential variations in inter- 
pretation of the same quest from 
different frames of reference. It is also 
contended that greater validity is ob- 
tained in the free interview when the 
questions are introduced and the re- 
sponses given in a more natural con- 
text, in which meaningful associations 
and motivations of the respondents are 
revealed. [For detailed discussion of 
pros and cons of the standardized and 


open interviews, see Crutchfield and 
Gordon (1947); Kahn and Cannell 
(1957); Maccoby and Maccoby 
(1954)-] 

In research with children there are 
many advantages of the free interview. 
It allows for greater flexibility in form 
and sequence of questions, a help in 
interviewing children, particularly if 
the research covers an age range In 
which there may be wide variation in 
comprehension of language and con- 
cepts. By lessening formality and mak- 
ing the interview situation a more nat- 
ural one, shyness and verbal inhibitions 
are decreased. 

Directive and Nondirective Ap- 
proaches. The optimum degree of di- 
rection by the interviewer in the move- 
ment of the interview varies with dif- 
ferent objectives and content. In the 
directive approach the interviewer may 
maintain complete control of the con- 
tent areas of the interview by setting 
up the framework and redirecting the 
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interviewee’s responses to keep them 
within the desired structure. At the 
other extreme, with the nondirective 
approach, the interviewer may simply 
open up an area of discussion and fol- 
low completely the child’s leads in the 
direction of the topic, keeping the in- 
terview moving by reflecting content 
or feelings expressed (Rogers, 1945; 
1951; Axline, 1947). 
reality, neither extreme in direc- 
tiveness is well adapted to research in- 
terviewing. The more directive ap- 
preach is usually adequate for obtain- 
ine factual information about topics of 
minimal ego-involvement, e.g., favorite 
games. The directive, highly structured 
interview is comparable to the written 
questionnaire, and there probably is not 
much gain in validity over the ques- 
tionnaire for obtaining factual informa- 
tion, except with children who have 
not yet mastered reading skills. 

Increasing recognition has been given 
to the value of the nondirective 
approach in interviewing adults, par- 
ticularly in personality assessment and 
in research on attitudes and feelings that 
are not fully conscious. Its major value 
is that it gives the respondent an op- 
portunity to select content that is par- 
ticularly significant to him and to 
present it in a context that may offer 
further clues to its meaning to him. 
The nondirective interview also has 
value with children to the extent that 
it is differentiated from the formal ques- 
tion and answer approach, an approach 
that may be unfamiliar to the young 
child and one that is associated by older 
children with testing procedures in 
school. 

Degree of Structure of Interview 
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Questions. The degree of directiveness 
and the degree of structure in inter- 
viewing are often not clearly distin- 
guished. Directiveness, as we have used 
it, refers to the behavior of the inter- 
viewer, the degree of control he exer- 
cises over the movement of the inter- 
view and the interviewee’s responses. 
Structure refers to the characteristics 
of the questions, the degree of speci- 
ficity of content. High directiveness 
and a high degree of structure usually 
go together, but it is possible for a non- 
directive interviewer to use a highly 
structured stimulus-situation to focus 
on specific content and allow the in- 
terview to proceed nondirectively. 

Possible variations in question struc- 
ture range from a highly structured 
question in which both the stimulus and 
response frameworks are structured to 
the unstructured question in which both 
the stimulus and response aspects are 
relatively ambiguous. The possible 
variations are schematized below with 
an example based on a doll-play inter- 
view with 4- and 5-year-old children in 
regard to mother-child relationships 
(Winstell, 1951). 

The questions with highly structured 
stimulus-situations are best adapted to 
problems in which we wish to focus 
on a very specific topic, such as the 
child’s feelings about specific aspects of 
parental control techniques, €.g., his re- 
action to being frustrated in play by 
the mother. If we are interested in a 
broader characterization of the mother- 
child relationship, then the less struc- 
tured approach is desirable because it 
gives the child freedom to select the 
most salient framework for character- 
izing his relationship to his mother. 
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POSSIBLE VARIATIONS IN STRUCTURE 
OF INTERVIEW QUESTIONS 


Stimulus Framework 
Structured 


Example: “This little 
boy is busy playing 
and his mother stops 
him and says it’s time 
to go to bed.” 
Structured 


Example: “If this lit- 
tle boy is busy play- 
ing and his mother 
stops him and says 
it’s time to go to 
bed.” 

Open 
Example: “This little 
boy is playing and 
his mother comes 


tn? 


in 

Open 
Example: “This little 
boy is busy playing 
and his mother 
comes in.” 


Response Framework 
Open 
“What does he say?” 


Structured 


“Does he get mad or 
does he say, ‘All 
right, mother?’ i 


Open 
“Lets make up a 
story about what she 
says to him and what 
he says to her.” 


Structured 


“Does the boy ask 
his mother to play 
with him or does he 
just go on playing 


as if his mother was 
not there?” 


There are obvious dangers in interview 
questions with highly structured re- 
sponse possibilities because of young 
children’s high suggestibility. This ap- 
proach may be useful when we wish 
to facilitate the expression of culturally 
inappropriate attitudes. When alterna- 
tive response possibilities are given, the 
form of the question can suggest that 
both are socially acceptable, e.g., “Some 
boys think this way, others think that 
way.” 

Direct, Indirect, and _ Projective 
Questions. In terms of directness, we 
can distinguish three major types of in- 
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terview question: direct, indirect, and 
projective. The direct and indirect 
questions are differentiated in terms of 


the extent to which the purpose of the 
question is made explicit; the projective 
question is distinguished from the other 
two in regard to the referent of the 
actor, i.e. the actor is a hypothetical 
child rather than the interview subject. 
Questions may vary along a continuum 
of directness. 


The straightforward direct q estion 
is most commonly used for obtaining 
factual information and in simple atti- 


tude studies, e.g., “Can you tell me the 
names of all the books you have read 
in the past month?” Indirect q sstions 
are used to explore feelings o1 attitudes 
when resistance to a direct approach 
is anticipated because of cultural taboos 
or unconscious inhibitions or when 
the cultural standards might prompt in- 
valid responses. In the indirect question 
the child is asked to state what he would 
do or how he feels about a topic, but 
the purpose of the question is disguised. 
For example, instead of asking the child 
which parent he prefers, the question 
might be, “If you were shipwrecked on 
an island and could have either your 
mother or your father with you, which 
one would you take?” In the projective 
question the purpose is further disguised 
in that the child is asked to interpret 
the feelings or predict the actions of a 
hypothetical child. Frequently dolls or 
pictures are presented with the ques- 
tion. For example, “Mother is feeding 
the baby. This little boy comes in and 
sees them. What does he do? What 
does he think about it?” 

The foregoing example illustrates an- 
other significant difference between the 
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projective and the direct question. 
Typically, the direct and the projective 
questions differ on the dimension of 
situational specificity as well as in the 
identification of the actor. The projec- 
tive question often uses a specific situa- 
tion in an attempt to explore a gener- 
alized attitude, e.g., sibling rivalry. The 
direct question characteristically uses a 
less structured stimulus-situation, ¢.g., 
“What do you think about the new 
baby in your house?” Thus the ques- 
tions differ not only in whether the 
child is asked directly about his feelings 
but in the generality of the attitude. In 
the projective approach a series of ques- 
tions involving a number of different 
situations might be necessary to estab- 
lish generalized attitudes toward the 
sibling, as distinguished from a reaction 
to a specific situation, e.g, being dis- 
placed by the sibling in the feeding 
situation, 

In the use of the projective question 
we are operating on the assumption that 
the child, in telling what another child 
will do, is expressing his own motives. 
This may often be a valid assumption 
(Getzels, 1951), but it cannot be un- 
critically accepted for all kinds of ques- 
tions and in all sorts of interview situa- 
tions. In some situations the child may 
give the response that represents the 
cultural norm of his group rather than 
his unique personal feelings or attitudes. 
Sometimes a child’s response to a pro- 
jective question may represent a blend 
of fantasy with reality. For example, in 
response to a projective question in- 
volving a hypothetical parent's reaction 
to the misbehavior of a hypothetical 
child, children often attribute to the 
parent a stronger reaction and harsher 
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discipline than is characteristic of the 
reality situation. This obviously poses 
problems in interpretation, There are, 
however, many kinds of questions that 
cannot be asked directly, when the 
child is not consciously aware of his 
true feelings or when he cannot express 
them because the cultural inhibitions 
are too strong. In such cases projective 
questions are the best available ap- 
proach. Insofar as we recognize that 
responses to projective questions may 
reflect unconscious feelings or may 
represent symbolic or disguised expres- 
sions of culturally inappropriate senti- 
ments, we will not analyze them at their 
face value but will try to utilize prin- 
ciples of interpretation derived from 
our knowledge of psychodynamics. 

Choice of Interview Approach. One's 
theoretical assumptions about human 
behavior will partly influence the kind 
of interview approach chosen, €g., with 
a psychodynamic orientation, one 
would favor the use of projective ques- 
tions. The kind of research problem 
being investigated also influences the ap- 

roach used. If we are studying atti- 
tudes with strongly defined cultural 
taboos, indirect or projective questions 
are generally preferable. However, if 
our interest is in the extent to which 
the child has internalized the cultural 
patterns of thinking and behavior with 
regard to these taboos, direct questions 
would be more meaningful. 

In the study of personality charac- 
teristics or interpersonal relationships 
an important aspect of the analysis may 
be in terms of the way in which the 
child structures his responses, the par- 
ticular content selected, and the se- 
quence of his associations. The nondi- 
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rective approach with the less struc- 
tured question is best adapted to this 
kind of analysis. 

We have pointed out earlier some 
advantages of the open interview with 
children in creating a freer, more nat- 
ural atmosphere. This conclusion does 
not apply equally to the structure of 
questions. On the whole, questions that 
are completely unstructured in terms 
of the situational components are not 
likely to be useful at the youngest age 
levels. With his limited associational 
ability, the young child may have dif- 
ficulty with an unstructured approach; 
he needs some framework within which 
he can focus and direct his thinking 
and language. Unstructured ques- 
tions in a nondirective interview with 
children over the age of 6 years may 
be valuable in determining the sig- 
nificant dimensions of the child’s psy- 
chological environment, the areas of 
gratification and anxiety and the 
clarities and distortions in his percep- 
tions of the significant people in his en- 
vironment. In short, for assessing con- 
scious and near-conscious personality 
trends and their significant environ- 
mental supports, the nondirective open 
interview may be a very useful method. 
Of the many limitations and difficulties 
inherent in the free and unstructured in- 
terview as a research instrument, the 
major ones are the danger of lack of 
comparability from case to case in the 
kinds of data obtained, the consequent 
difficulty in quantitative pooling of 
data, and the decreased control of inter- 
viewer bias. 

Interviews need not be rigidly limited 
to one kind of approach. The tech- 
niques and form of question may be 
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varied at different points in the inter- 
view. The interview may begin with an 
open question and then proceed non- 


directively. After an appropriat time 
interval, the interviewer may introduce 
increasingly more structured questions 
and may assume a more directive role. 
There are some dangers, however, in 
radical shifts in interview approach, 
Rapport in an interview grows out of 
the respondent's increasing familiarity 
with the interviewer's approach and in 


knowing what to expect of the inter- 
view situation. Therefore, an abrupt 
shift in technique from a nondirective 
to a directive approach is not di sirable 
and may be disruptive of rapport. 


CONSTRUCTION 
OF INTERVIEW SCHEDULES: 


Pretesting. In the development of a 
questionnaire or interview schedule, 
pretesting is useful in helping to elimi- 
nate unclear, inappropriate, or anxiety- 
arousing questions and to identify the 
most effective motivational techniques. 
The pretesting should be done on a 
population chosen for its comparability 
to the experimental population in char- 
acteristics relevant to the research prob- 
lem. Considerable pretesting of the 
kind Piaget (1951) has used may be 
necessary to develop an interview 
schedule for children of the preschool 
years. One form of “pretesting” sug- 
gested by Piagets work involves de- 
tailed recording of the spontaneous 
conversation of children in situations 
designed to elicit verbal comments 0n 
the topic under study. Questions can 
then be formulated with the words and 
concepts that are part of the active 


INTERVIEWING CHILDREN 


vocabulary of children of the research 
sample. Following formulation of the 
questions, there is still need for the 
more conventional kind of pretesting, in 
which the actual interview schedule 
planned for the research is used. The 
results of the pretest must then be 
evaluated. Each question requires care- 
ful consideration in terms of its ease 
of comprehension, its effect on rapport, 
and its effectiveness in eliciting a re- 


sponse relevant to the research objec- 
tives. Necessary revisions can then be 
made in the light of these evaluations. 
Although pretesting may add con- 
siderably to the time and expense of a 


research project, it is usually well worth 
the effort in terms of the ultimate re- 
turns 

Formulation of Questions. The basic 
considerations in the formulation of 
questions are (1) that the question be 
readily understood by the child being 
interviewed, (2) that the meaning or 
interpretation of the question not vary 
significantly from one child to another, 
and (3) that the form of the question 
not “lead” the child to a given response. 

One way to insure the child’s com- 
prehension in the interview would be 
to limit language used to the equivalent 
of a child’s basic English, There is a 
considerable body of knowledge of chil- 
dren’s language and concept develop- 
ment (Baldwin, 1955; Jersild and Ritz- 
man, 1938; McCarthy, 19545 Piaget, 
1951; M. K. Smith, 1941) available to 
guide us in constructing questionnaires 
and interview schedules. Before the pre- 
test or first trial of a standardized inter- 
view, it would be desirable to check the 
words with one of the standard lists 
commonly used with preschool or 
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school-age children (International 
Kindergarten Union List, 1928; a Com- 
bined Word List, Buckingham and 
Dolch, 1936). 

In addition to the specific language 
used, other characteristics of the ques- 
tion may affect comprehension and 
motivation in the interview. Ammons’ 
(1950) study of racial attitudes in pre- 
school children, although not designed 
as a methodological study and based on 
a limited sample of cases and questions, 
offers some interesting suggestions re- 
garding form of interview questions. He 
found few refusals at all ages to ques- 
tions with simple response alternatives, 
such as “yes” or “no,” or to questions 
involving a choice between concrete 
alternatives, ¢.g., “white” or “colored.” 
The highest rate of refusals was to 
open-ended questions. Children at all 
ages were more likely to respond to 
questions about their actions than to 

uestions about what they would say. 

Although children may respond easily 
to questions with simple response al- 
ternatives, the validity of such responses 
is sometimes questionable. There is a 
real possibility that young children who 
do not completely understand a ques- 
tion may choose a response on a purely 
chance basis simply in an attempt to 
please the examiner. 

A number of specific techniques, 
which have grown out of clinical and 
research experience in interviewing 
adults, may be helpful in formulating 
questions for interviews with children. 
These are discussed in several books and 
articles (Cantril, 1940; Jahoda, Deutsch, 
and Cook, 1951; Kahn and Cannell, 
1957; Maccoby and Maccoby, 1954; 
Payne, 1949; Parten, 1950). We have 


580 

limited discussion here to suggestions 
for formulating questions to facilitate 
the expression of responses that the 
child might consider culturally unac- 
ceptable. 


1. Suggest in the question that other 
children might feel the same way. The 
question may sanction a specific atti- 
tude or a general feeling. For example, 
“Some boys think they shouldn’t have 
to take their little brothers along with 
them when they go out to play. How 
do you feel about it?” “All of us get 
mad sometimes when something hap- 
pens that we don’t like. What sorts of 
things make you feel mad?” 

2, Present two alternatives, both of 
which might be considered acceptable. 
Example: “If your little brother pulls 
out all the books from the shelves, do 
you punish him so that he won't do it 
again or do you see that your mother 
finds out about it?” 

3. Choose words that will soften an 
undesirable response or present the re- 
sponse in a context that might make it 
more desirable. For example, in the 
preceding example, instead of saying, 
“would you tell your mother on him?” 
the use of the phrase, “see that your 
mother finds out about it” places the 
response in a more acceptable context. 

4. Avoid placing the child in a posi- 
tion in which he has to deny some un- 
desirable behavior by wording the ques- 
tion to assume that he has engaged in 
this behavior. “What sorts of things 
do you and your brother fight about 
most?” This technique must be used 
sensitively and with awareness of 
limits to avoid arousing guilt feelings 
that cannot be adequately handled. 
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5. Give the child an opportunity to 
express a positive response before pre- 
senting a question that will require 
negative or critical evaluations. “What 
things do you like best about school? 
. . . What things aren’t so good : 


Questions combining these different 
approaches may be used: “There are 
some things most boys like about school 


and other things they don’t like at all. 
. .. What are some of the things you 
like best . . . and what are some of 


the things that aren’t so good about 
school?” 

Question Sequence. The sequence of 
questions may significantly influence 
rapport as well as the validity of the 
responses. The organization of topics 
should be guided by consideration of 
what is most meaningful to the child, 
as illustrated in Piaget’s interviews in 
which the sequence is adapted to the 
child’s concepts and comprehension 
level. The opening questions in an inter- 
view with a child are especially im- 
portant to help set the tone of the 
relationship and to define what is ex- 
pected of him. It is important that he 
be able to respond to the opening ques- 
tions without difficulty or embarrass- 
ment to give him the assurance that he 
can successfully play the required role. 
The more difficult questions and those 
that may involve some threat to the 
child, e.g, the expression of hostile 
feelings, are best introduced late in the 
interview, after maximum rapport has 
been established. With young children 
whose attention span ïs limited, the 
sequence of the interview may be 
utilized to control interest. Sequential 
variations in kinds of questions or COn- 
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tent areas may help to reduce fatigue 
and boredom. 

\ common sequence, the funnel se- 
», begins with the most general 
and follows with more differentiated 
and specific questions. This sequence 
enables the child to state his frame of 
reference and to respond in terms of 
categories salient for him without being 
influenced by the specific questions 
that define the interviewer’s frame of 
reference, For some kinds of research 
problems with children, a reversal of 
the conventional funnel sequence may 
be desirable; that is, the interview begins 
with specific questions, which are fol- 
lowed by the introduction of general 
questions toward the end of the inter- 
view. This sequence may be desirable 
with very young children who might 
have difficulty structuring an area from 
general questions. By starting with 
questions to which they are able to 
respond, they may be encouraged to 
attempt a response to a more difficult 
general question. 

In a free interview the interviewer 
can often take the cue from the child 
when a given topic has been exhausted 
and he is ready for the introduction of 
a new area. Often it is desirable to use 
some kind of transition statement to in- 
dicate that a new topic is going to be 
opened up. For example, “You've given 
me lots of good ideas about the kinds of 
games girls like to play; now I'd like to 
get some of your ideas about the kinds 
of books 8-year-old girls like to read.” 
This device may prevent the child from 
carrying over the frame of reference 
from the previous topic to the new one. 

In young children there are tend- 
encies to perseverate on a given idea or 
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frame of reference. Thus a child’s re- 
sponse to successive questions in a series 
may not be a true reaction to the con- 
tent of the question but may be a 
perseveration from a previous question. 
There are no data on the conditions 
under which perseveration occurs. It 
may be partly a developmental phe- 
nomenon, but it seems likely that it is 
more frequent when the child is anxious 
or fatigued or when he is unclear about 
a question or what is expected of him. 
The establishment of good rapport may 
decrease the likelihood of perseverative 
responses. The sequence of questions 
and appropriate transitional statements 
may also be used to minimize persevera- 
tive tendencies. 

Length of Interview. In the develop- 
ment of interview schedules for use 
with children it is particularly im- 
portant to plan the total length of the 
interview within a time period ap- 
propriate for the child’s attention span. 
The validity of an interview may be 
seriously impaired by a child’s response 
when he is fatigued or satiated, Fre- 
quently, under such conditions, the 
child throws off an easy response in an 
attempt to end the interview quickly 
and escape from the situation, The 
length of an interview cannot, of course, 
be absolutely prescribed because the 
child’s attention span will certainly vary 
with the content of the material as well 
as with the manner of presentation and 
other rapport-relevant factors in the 
situation, In recognition of the limited 
attention span of young children the 
range of ideas and concepts covered in 
a single interview should be limited. If 
it is necessary to cover a variety of 
concepts and themes, it is sometimes 
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better to plan a second interview. The 
gain in validity through this procedure 
needs to be weighed against the added 
cost. 

Probing. Often in an interview there 
is need to clarify or elaborate on a 
response to a question. The technique 
of probing is used to stimulate addi- 
tional responses. To avoid creating de- 
fensiveness in the child, probes should 
be natural and without pressure. At 
most ages, if the child senses too great 
anxiety on the part of the adult to ex- 
tract information, he is likely to react 
with negativism. The nondirective 
follow-up question is usually the most 
successful approach. A nondirective 
probe, in the form of a restatement of 
expressed content or feelings, ending in 
a questioning tone, may stimulate the 
child to elaborate his response (Rogers, 
1945, 1951). The danger is that the in- 
terviewer’s biases may be communicated 
in his probes, thus affecting the validity 
of the interview. In a free interview 
without standardized probes the in- 
terviewer’s biases may influence the 
choice of points at which probes are 
introduced as well as the content of the 
probe. The interviewer who under- 
stands his own biases and maintains an 
awareness of them is less likely to trans- 
mit them than the interviewer who lacks 
such awareness. 

Handling Suggestibility. The sug- 
gestibility of young children may 
seriously affect the validity of their 
responses. Several approaches are use- 
ful in counteracting suggestibility. The 
question can be phrased in a way that 
avoids giving the child any indication 
that one response is more acceptable 
than another. It is equally important 
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that the examiner’s manner and tone of 
voice be controlled when presenting a 
topic so as not to reveal his biases, 
Countersuggestion, in which several 
questions on the same topic are system- 
atically slanted in different directions, 
is sometimes effective. For example, in 
studying the degree of parental control 
over choice of playmates the f owing 
series of questions might be introduced 
at different points in the interview: 

1. “Do your mother and father let 
you play with anybody you want to 
play with?” 

2. “Do your mother and father often 


tell you which children you can bring 
home to play with?” 

3. “When a new boy come 
in the neighborhood do you always 
your mother if you can play with 
him?” 


to live 
ask 


If countersuggestion is not used 
sensitively and subtly, there is danger of 
some uncontrolled effects. If the child 
has committed himself to a position on 
a question early in the interview, this 
position may be maintained whether or 
not it is a valid representation of his 
point of view. Also, the use of counter- 
suggestion may strain rapport. The 
child may refuse to respond if he rec- 
ognizes a question as essentially similar 
to one he has already answered. 

Handling Anxiety. For ethical rea- 
sons, as well as from the standpoint of 
effective motivation, the interviewer 
should avoid placing the child in a posi- 
tion requiring responses that are com- 
pletely unacceptable in the child’s cul- 
ture or responses that may arouse guilt 
feelings. Sometimes one can free 4 
child to admit to knowledge or discuss 
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feelings on which there are cultural 
taboos and yet avoid or minimize guilt 
feelings by the form of the question. 
Mat nf-fact acceptance by the in- 


terviewer of everything the child says 
is the most effective means of avoiding 
the development of anxiety. If the 
interview has been essentially non- 
directive and has not pushed the child 
to express unacceptable feelings, it is 
likely that what he has expressed will 
not be too disturbing. If taboo areas 
have been discussed, some provision 
should be made, either during or fol- 
lowing the interview, to reassure the 
child and to handle any guilt feelings. 
Assurance of confidentiality can be 
given if it seems appropriate, and the 
child can be given an opportunity to 
express acceptable attitudes on a variety 
of related topics. Even though the inter- 
view is not therapeutically oriented, it 
may be necessary to help a child achieve 
some closure on an anxiety-evoking 
topic that has been opened up in the 
interview. There is a sensitive line be- 
tween stirring up more anxiety by 
probing and subtly leading the child to 
an acceptable resolution. 


SPECIAL ADAPTATIONS 
OF INTERVIEW TECHNIQUES 


There are many variations on the 
conventional interview method that 
have been developed specifically for 
use with children, These variations have 
grown out of attempts to adapt the in- 
terview to the language, interests, and 
motivations of children. We have 
selected a few interviewing techniques 
that have proved their value with chil- 
dren and a few other creative ap- 
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proaches to interviewing that seem to 
show promise. These specific techniques 
might be viewed not so much as models 
but as points of departure in the de- 
velopment of a method for a particular 
problem. 

Piagets Méthode Clinique. Piaget’s 
work (1928, 1929, 1951) has served as 
a model for the combined interview- 
experimental approach to the study of 
concept development. This méthode 
clinique represents a creative approach 
to interviewing children and to the 
formulation of questions appropriate to 
the child’s level of comprehension. The 
first step in this method involves ob- 
servation of children in natural situa- 
tions with verbatim recording of 
spontaneous conversation. The spon- 
taneous concepts and language of chil- 
dren in these natural settings are used 
as the basis for formulating questions, 
although standard questions are not 
used, In the interview proper the in- 
terviewer introduces the subject in a 
casual conversational tone and allows 
the child to talk, “noticing the manner 
in which his thoughts unfold.” There is 
considerable stress on empathy with the 
child’s way of thinking. The inter- 
viewer must be attuned to the child’s 
thought processes and reflect this under- 
standing in his questions as well as in 
his probes. Probes in the form of direct 
questions are introduced in the same 
casual conversational manner, follow- 
ing up on leads given by the child. 
Claparède in the introduction to Lan- 
guage and Thought of the Child 
(Piaget, 1951) characterizes the method 
as follows: 


This clinical method, therefore, which 
js also an art, the art of questioning, does 
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not confine itself to superficial observa- 
tions, but aims at capturing what is hidden 
behind the immediate appearance of 
things. It analyzes down to its ultimate 
constituents the least little remark made 
by the young subjects. It does not give up 
the struggle when the child gives incom- 
prehensible or contradictory answers, but 
only follows closer in chase of the ever 
receding thought, drives it from cover, 
pursues and tracks it down until it can 
seize it, dissect it and lay bare the secret 
of its composition (p. xiv). 

The lack of standardization of ques- 
tions in this method has been subjected 
to severe criticism on the grounds that 
variations in wording of questions will 
have different meanings. Some support 
for this criticism comes from Nass’s 
(1956) research on children’s concepts 
of physical causality, in which he ob- 
tained data on the effects of variation 
in the form of the question. He con- 
cluded that questions worded to suggest 
the possible operation of animistic, 
supernatural, or dynamic forces yielded 
more such non-naturalistic types of re- 
sponses than questions less suggestively 
worded. 

Use of Dolls in the Verbal Interview. 
The use of dolls in conjunction with the 
direct verbal interview is a technique 
that has demonstrated its value in re- 
search on a variety of problems.! Doll 
play was introduced in therapy as an 
adaptation of the conventional psy- 
choanalytic procedure for children. The 
earliest use of this technique in research 
was made by David Levy (1933, 1936) 
in studies of sibling rivalry. Levy’s basic 


*The distinction between “projective” doll 
play and the doll-play interview is not a 
sharp one. In this chapter the discussion of 
doll play is limited to its use in conjunction 
with a verbal interview. 
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technique involves stimulating the child 
to talk through the medium of situa- 


tions set up with dolls. The experi- 
menter probes the child’s reactions to a 
series of problem situations cre ited by 
manipulating the dolls and by verbal 
definition. In studying sibling rivalry 
the examiner sets up a scene of a mother 
feeding the baby, with the baby doll in 
the nursing position, and a child doll 
of the same sex as the subject observing 
the feeding. The problem is then pre- 
sented verbally. “The sister comes and 
sees a new baby at the mother’s breast. 
She sees it for the first time. Now what 


does she do? Do whatever you think.” 
There are several variations on this 
basic situation. The situation may 
simply be presented to the child with- 


out any suggestions as to the direction 
of the child’s activity, or there may be 
direct stimulation for the child to 
express negative feelings. For example, 
in Levy’s study, after the general ques- 
tion, the experimenter encouraged the 
expression of rivalry feelings by verbal 
suggestion. “When the sister saw the 
baby, she thought ‘the nerve, at my 
mother’s breast!’ ” There are also varia- 
tions in the degree of directness of 
identification of the dolls. Sometimes 
the baby doll in Levy’s study is €x- 
plicitly identified as the child’s sibling 
and the other doll is identified as the 
child subject. 

Conn (1938) has used a similar tech- 
nique in the clinical exploration of a 
variety of problems, such as children’s 
sexual knowledge, awareness of sex 
differences, and the dynamics of car 
sickness. Conn and Kanners (1947) 
research on children’s knowledge of the 
origin of babies illustrates a number of 
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variations in the doll-play interview. A 
et up with a baby doll lying in 
a toy crib and several larger dolls to 
represent siblings standing nearby. The 
interviewer points to one doll and 
asks, “What does he want to know 
about the baby?” If no spontaneous 
curiosity is expressed about the origin 
of babies, he then introduces a more 
structured probe. “Which one of them 
wants to know where babies come 
from?” In the next step the experi- 
menter points to various dolls in succes- 
sion, asking, “What is he saying?” In 
this manner the child is encouraged to 
carry on a dialogue between the dolls. 
Although the basic approach involves 
encouraging the child to talk through 
the medium of the dolls, one variation 
Conn has employed is the use of a direct 
question, “What do you think?” An 
interesting variation, used to overcome 
the child’s inhibitions to expressing un- 
acceptable impulses, is the “cough boy 
technique.” A doll identified as a “tough 
boy” is introduced. This “tough boy,” 
it is explained, is one who does not 
need to be nice and can say things that 
nice boys cannot say. 

Bender (1953) employed an even 
more directive approach in a study of 
the dynamics of aggression and hostility 
in disturbed children ranging in age 
from 3 to 15 years. In this technique 
the examiner stimulated the child’s ex- 
pression of aggression by several dif- 
ferent approaches. In the doll test the 
examiner pushed over a small doll, re- 
peating this procedure three times with 
the question, “What has happened to 
it?” If the child made no attempt to 
imitate the examiner, she offered mild 
encouragement, saying, “Would you 
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like to play with it?” She then offered 
stronger encouragement, “Knock it 
down.” The children’s behavior with 
the dolls and their verbalizations were 
recorded. Bender used several other 
situations to probe attitudes toward ag- 
gression. In the cowboy-soldier test 
the examiner set up a scene with a cow- 
boy and an Indian placed between two 
soldiers and asked, “What are they 
doing?” If the child answered, “Fight- 
ing,” the examiner then probed, “Why 
are they doing it?” She then probed 
further, “Is it right to do it?” In the 
automobile test the examiner placed a 
lead soldier between three toy auto- 
mobiles, which were pointed toward the 
soldier. The examiner asked, “What is 
this? What will happen? Do you want 
to play with this?” 

These adaptations of a clinical thera- 

eutic method have in common the 
use of a highly structured approach in 
focusing on a specific content area and 
in encouraging the discussion of be- 
havior not sanctioned by cultural 
norms. Although the stimulus is highly 
structured, there is considerable latitude 
given to the child in terms of the kind 
of response and level and degree of 
elaboration of the response. 

In a study of awareness of racial dif- 
ferences among 2-to-5-year-old children 
Ammons (1950) used a more controlled 
“doll-play interview.” In the setting of 
a miniature playground he placed a 
Negro and white doll and then inter- 
viewed the subjects with direct ques- 
tions about their experiences as well as 
with indirect questions, using “framed 
situations.” The dolls were introduced 
as follows: “Here are two little boys 
who would like to play on the play- 
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round. Do they look the same? How 
re they different?” Later in the inter- 
jew contrived situations are intro- 
uced, e.g., “This (white) boy’s mother 
omes along and sees the boys playing. 
Vhat does she say?” “This little boy 
white) throws sand at this one 
colored). What does this boy 
colored) do?” 
A variation of a doll-play interview 
1d multiple-choice technique was used 
y Radke and Trager (1950) ina study 
` perceptions of social role in 5-to-8- 
ar-old children. They presented a 
ries of plywood formboards with 
tout figures of white and Negro 
ults. The children were given various 
ssibilities of costumes representing 
ress-up,” “work,” and “shabby” 
thes. The examiner introduced the 
terials, posing questions as follows: 
Vhat would a man be doing if he 
re Wearing these clothes?” (pointing 
one at a time). After the child has 
tified what each costume means 
him, he is asked which he likes best 
d why. The dolls are then presented: 
low let’s look at these dolls, We have 
o men. What do you think? Are they 
t alike or are they different?” Jf 
> child answers “Different,” he is 
‘ed how they are different. Then the 
iminer says: “This is a colored man 
l this is a white man. Now let’s dress 
e men. You put on their clothes. 
y one will fit; they’re all the same 
:.” After the child has dressed both 
Is, he is asked, “What would this 
1 be doing wearing these clothes? 
1 you tell me a story about this 
>”, pointing to the white doll and 
1 to the Negro doll. Following the 
tition of the procedure of dressing 
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the dolls, the examiner presents cut- 
outs of two kinds of houses, a “good 
house” and a “poor house.” The follow- 
ing instructions are given: “Here ar: 
some houses. Which do you like best 
In which house does the colored man 
live? Why do you think so? In which 
house does the white man live? Why do 
you think so? Do you think these two 
men are friends? Which house is like 
your house?” The child is given the op- 
portunity to assign both dolls to the 
same kind of house or one to a good 
house and one to a poor house, 

Among the real advantages of the 
doll-play interview, especially for 
young children, is its interest value, B 
shifting the focus from the strangeness 
of the unaccustomed adult-child in 
teraction to the dolls, inhibiting shyness 
will be lessened. Frequently, too, a 
complex situation can be conveyed 
more adequately in a setting or scene 
than by a purely verbal approach. A 
basic assumption in the use of this tech- 
nique is that the child is freer to ex- 
press socially unacceptable feelings 
through the medium of the dolls than 
if he were asked directly. There is 
a convincing amount of clinical ex- 
perience to substantiate this assumption, 
but the meaning and basic validity of 
verbal responses given under these cir- 
cumstances needs some clear-cut me- 
thodological investigation, One possible 
consequence of having the expression of 
unacceptable actions or ideas facilitated 
in this way may be that the attitudes 
or feelings that are expressed under 
these circumstances are an exaggerated 
reflection of the intensity of the child’s 
feelings. This is particularly likely to 
be the case where there is strong pres- 
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sure to express feelings, as in Bender’s 
study (1953) previously described. Al- 
though it may not be a serious draw- 
back for therapeutic work, it has 
obvious limitations for diagnostic and 
research purposes. Unless we can de- 
velop a “conversion index,” which per- 
mits the simple translation of intensity 
of response in this situation to intensity 
of feeling, there is a real possibility of 
seriously biased findings. 

Story and Sentence Completion. The 
story completion and sentence comple- 
tion techniques are usually more ac- 
ceptable than doll play to children 
beyond the preschool age. Both tech- 
niques have been used extensively as 
variations on the written questionnaire. 
Their use as an adjunct in interviewing 
has been limited. 

The story-completion method may 
vary from a completely verbal approach 
to one combining verbalization and the 
use of dolls. In a study of father rela- 
tions of war-born children, aged 4 to 8, 
Faust, in collaboration with Stolz 
(1954), presented a series of incomplete 
stories verbally and accompanied the 
verbal presentation with appropriate 
doll action. The themes selected were, 
for the most part, quite direct and 
structured to be relevant to a specific 
hypothesis, For example, one story 
designed to test the hypothesis that 
the war-separated child would not feel 
so free as a nonseparated child to admit 
his misdeeds to his father had the fol- 
lowing content: “A child is playing in 
his room with his daddy’s watch, al- 
though he knows that he is not to play 
with his daddy’s things. The child 
drops the watch and it breaks. His 
parents do not hear the watch drop and 
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the child wonders what he should do.” 
Faust found the use of dolls effective 
in obtaining story completions from 
preschool-age children but discovered 
that the dolls hindered rapport with 
the older boys. They responded better 
when the incomplete stories were read 
from an illustrated book, with instruc- 
tions to complete the story verbally. 

Basically similar to the story-comple- 
tion technique is the sentence-comple- 
tion approach. This technique has been 
used fairly extensively in recent years 
with young school-age children and 
adolescents (Sanford, 1943; Stein, 1947; 
Rohde, 1946; Rotter, 1951; Graham 
et al, 1951; Kates, 1951). There have 
been many adaptations on the sentence- 
completion technique as a variation on 
direct interviewing. Most frequently, 
this technique is used as a written test 
rather than an interview. Often the 
questions are very direct; e.g., “When 
I do or think something which I know 
is wrong, I... .” Indirect statements 
referring to the actions of hypothetical 
persons are also used. 

The story- and sentence-completion 
techniques can capitalize on young 
children’s inherent interest in stories. 
The situation of an adult reading a story 
approximates a familiar life-situation 
with pleasant associations for most chil- 
dren. Older children may be challenged 
by the creative possibilities in the task 
of completing a story. As in structured 
doll play, these approaches allow some 
freedom of response while permitting a 
relatively high degree of focus on a 
specific theme. 

Pictures as an Adjunct to the Inter- 
view. Another variation on a structured 
technique is the use of pictures to 
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portray a very specific situation. There 
are several variations in method. On the 
simplest level the child is asked to make 
a choice among alternative pictures. 
He may be asked specific questions re- 
lating to the pictures, or he may be 
given the task of completing the story 
defined by the picture and verbally 
interpreted by the interviewer. These 
techniques have been used very ef- 
fectively in studying peer relations and 
in getting at a child’s perception of 
parents, teachers, and members of other 
racial and religious groups. 

Fauls and Smith (1956) studied sex- 
role learning in 5-year-old boys by 
use of a multiple-choice picture tech- 
nique. There were six pictures, three 
portraying female appropriate activities, 
and three showing male appropriate 
activities. One picture showed a boy 
about to play baseball; in another the 
boy was engaging in doll play. The 
interviewer introduced the pictures 
with a verbal interpretation: “This boy 
is going to play baseball. His mother 
and father are watching as he goes to 
play. There he is in this picture holding 
a doll in his arms and looking at it. 
Which of these do you do? Which do 
you like best? Which does his mother 
want the boy to do? Which does his 
daddy want the boy to do?” 

In studying the complex problem 
of self-identification in young children 
Horowitz (1943) developed a picture- 
choice technique for preschool children, 
She used several series of pictures, three 
for racial identification, three for 
identification with reference to familial 
position, and one each for age, size, 
sex, and eating habits. The children 
were asked to make a choice among 
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several pictures, each of which wa: 
defined by a standard verbal descrip 
tion. For example, to study age iden- 
tification, four pictures of children of 
different ages were presented: infant, 
nursery school child, elementary schoo! 
child and adolescent. The children 
were asked, “Which one is you? Which 
one is (name of subject)?” Horowit 
Suggests with regard to methodology 
that it is important that “the materials 
be simple and as nearly unequivocal a 
possible, and the choices given be reall, 
parallel and mutually exclusive, so that 
if the subject makes one choice, he i 
excluding the other possibilities pre 
sented.” 

Biber and Lewis (1949) used a pic- 
ture-interview technique to study chil- 
dren’s feelings and their functioning 
in different kinds of classroom atmos 
pheres. The child was presented with 
a series of pictures, each depicting a 
different kind of problem situation oc 
curring in the classroom, The inter- 
viewer told what was happening in the 
picture and asked the child to elaborate 
or tell what would happen next. For 
example, in studying reactions to 
trouble situations, they used a picture 
of a sick boy, and presented it with the 
following explanation: “This little boy 
is not feeling well. He is coming up to 
the teacher and telling her he has a 
stomach-ache. What do you think the 
teacher is saying to him?” 

The authors point up a number of 
methodological implications in this kind 
of approach. One must be aware 
of the possibility that the responses may 
reflect either the real practices in the 
classroom or the feelings that are dis- 
placed from the child’s broader life ex- 
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perience. ‘Thus in their responses the 
children may be projecting onto the 
teacher-figure expectations derived 
from their out-of-school activities with 
adults. In using any kind of semiprojec- 
tive technique, one must be aware of 
potential distortion and attempt to 
utilize pictures and question forms that 
will minimize this possibility. Biber and 


Lewis also point out the difficulties in 
developing situations and questions that 
are equivalent for exploring a given 


area of feeling. For example, they found 
the intensity of response quite different 
toward two trouble pictures they had 
considered essentially equivalent. A 
situation depicting a lost bracelet 
aroused much more intense feeling than 
one portraying illness in a child. This 
observation emphasizes the necessity for 
careful pretesting of questions and situa- 
tions. 

Another variation of the picture tech- 
nique is that used by Radke, Trager, and 
Davis (1949) in a study of children’s 
perceptions of the social roles of 
Negroes and whites. The children were 
asked to interpret pictures portraying 
white and Negro children in a variety 
of social situations. Some pictures were 
ambiguous and others were highly 
structured. The pictures presented 
varied alternatives regarding interracial 
relations. For example, one picture 
showed white and Negro children on a 
playground clearly playing together. 
Another showed a Negro child on a 
fringe of a group of white children on 
the playground. The interview opened 
with the general question, “Tell me 
about this picture.” The subsequent 
questions became increasingly specific, 
e.g., “What is this little boy doing? 
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the others. Why?” The authors em- 
phasize that a child’s response to any 
single picture is not in itself a sufficient 
basis for interpreting his attitudes; 
rather, the constellation of responses to 
a series of pictures must be used. 
E. Hartley and associates (1948) used 
a combined interview-picture method in 
research on children’s perceptions of 
overlapping social roles. A series of 
pictures of the same person in different 
roles, accompanied by verbal defini- 
tions, was presented to the child. For 
example, one series consisted of four 
pictures: (1) a father, mother and a 
child, (2) a mailman, (3) a mailman at 
work, and (4) the same man as a father 
with his child. The interviewer’s com- 
ments and questions relating to each 
picture were as follows: 


(1) “This is Jimmy with his father 
and mother.” 

(2) “Who is this? This is the mail- 
man, isn’t it?” 

(3) “What is he doing? Is he still a 
father when he is a mailman?” 

(4) “Who is this? Is he still a mail- 
man when he is a father at home?” 


In this technique, beginning with a 
simple concept familiar to the child, the 
complex concept of overlapping roles 
is developed by a step-by-step pro- 
cedure. This approach would seem to be 
promising for the study of complex at- 
titudes, feelings, and concepts in young 
children. For example, children’s un- 
derstanding of multiple motivations and 
feelings underlying parental discipline 
might be explored by presenting pic- 
tures of the scolding mother and the 
loving mother. Similarly, concepts of 
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overlapping motivations determining 
children’s aggressive behavior might be 
studied by some adaptation of this tech- 
nique, 

A combination of interview and pic- 
tures has been employed successfully in 
studies of children’s perceptions of their 
peers and others in their environment. 
It has been used largely with school-age 
and adolescent children. Harris (1946) 
adapted this technique in studying the 
social perceptions of children in a 
kindergarten. In an interview setting 
she showed the young child a series of 
sketches, depicting a child in a variety 
of activities, e.g., helpful or aggressive. 
The situation was defined as a guessing 
game. Each picture was presented with 
the instructions: “This is a picture of 
someone at school. Who do you think 
it is?” If the child named a member of 
the kindergarten group, he was asked 
with reference to the picture: “What 
is... doing?” The picture choices 
were followed by a free interview 
about each child in the group, e.g., 
“John goes to your school, doesn’t he? 
Tell me something about him.” If the 
child did not respond, the experimenter 
asked, “You know John, don’t you? 
Tell me what he does at school.” The 
child was given crayons and paper or 
beads to string and told he could play 
while he talked with the experimenter. 

A variation on this technique involves 
presenting the child with a series of 
personality or trait descriptions and 
requesting him to select the picture 
of the child who best typifies a par- 
ticular trait ( Campbell, Yarrow, and 
Yarrow, 1958). Further exploration of 
specific attitudes can be made with 
follow-up questions. This technique has 
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been used with school-age and early 
adolescent children. 

Pictures, like dolls, are a useful prop 
for the interviewer in involving the 
child and in maintaining interest. By 
the use of such techniques, it is possible 
to focus more specifically on a given 
topic and to give it a concreteness not 
readily attained by purely verbal means 
As in the doll-play interview, there is 
no sharp line between the use of pic- 
tures to stimulate fantasy in a projective 
technique and the use of pictures as an 
adjunct to a focused interview. In an 
enthusiasm for projective techniques, 
schematic or ambiguous pictures are 
sometimes used when they are inap 
propriate for the research problen 
Such unstructured material in stimulat- 
ing fantasy is likely to produce re- 
sponses not directly relevant to the 
limited research questions being studied. 
If our objective, in using pictures, is to 
facilitate the child’s verbalization on a 
well-defined topic, then structured pic- 
tures may be more useful than am- 
biguous ones in helping the child to 
focus on specific content. We can also 
limit the range of response to a picture 
by verbally reinforcing the content 
area illustrated in the picture, e.g., “This 
boy is trying to get the wagon away 
from the girl.” These suggestions are 
applicable if we are not interested in 
eliciting projective responses or un- 
conscious material but are specifically 
directing our efforts toward obtaining 
data on conscious attitudes and con- 
cepts. (The use of pictures in projective 
techniques is discussed in Chapter 15 in 
this book.) 

Group Interviews. For certain kinds 
of problems, such as the study of modal 
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values or attitudes of a given age group, 
an interview conducted in a group 
set has real value. For latency- 
period children and those in early 
the group atmosphere 
might have a facilitating effect. 


adolescence, 


L. Murphy (1956) describes an inter- 
esting integration of doll play and group 
interview. Three or four children are 


brou 


he to the experimental room and 


given an opportunity to play house, 
each child selecting his own role. The 
fantasy is focused around the handling 
of two baby dolls, Questions are thrown 
out to the group naturally at ap- 
propriate points in relation to the on- 
going play: open questions about 


parental roles, such as, “What do mom- 
mies do? What do daddies do?”; more 
specific questions about feelings, such 
as, “Everybody gets mad sometimes. 
What makes you mad?” The experi- 
menter enters into the situation by 
assuming one of the roles not taken by 
the children, Murphy feels that the 
group not only has a facilitating effect 
on the production of fantasy but the 
group situation decreases the amount of 
anxiety that might otherwise be aroused 
by this kind of direct questioning. 

A group interview with puppets has 
been developed by R. Hartley (1952) in 
eliciting children’s feelings and attitudes 
about a variety of common adjustment 
situations: the arrival of a new baby, 
peer aggression, and resentment about 
separation from a parent. Puppet plays 
were presented to groups of five or six 
children, using a family of dolls—a 
mother, a father, a boy, a girl, a baby, a 
monkey, and an unidentified character 
adaptable to any special role required. 
After the basic situation was presented 
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by manipulation of the puppets, with 
appropriate verbalization (scripts were 
developed for each of the problems), 
the identification character expressed his 
feelings about the situation, e.g., harsh 
treatment by a parent. The children 
were then encouraged to verbalize their 
feelings and asked to indicate how the 
situation should be handled. Hartley 
points out the necessity for using this 
technique sensitively to avoid arousing 
in the children disturbing emotions 
with which the interviewer cannot 
cope. When using this technique, care- 
ful consideration should be given to the 
content of the plays as well as to the 
composition of the groups, e.g., anxious 
or timid children might be over- 
whelmed by aggressive themes or 
vigorous expression of hostility by 
other members of the group. 

The Life Space Interview. A creative 
variation on the standard interview 
approach has been introduced by Redl 
(1959) in research and therapy on dis- 
turbed delinquent children. This ap- 
proach, the “life-space interview,” 
attempts to assess the child’s reactions 
and feelings about a given situation, not 
retrospectively after some time lapse, 
but immediately after the occurrence 
of the incident and preferably in the 
physical setting in which it has oc- 
curred, In this concrete setting the 
child’s perception of the event and his 
feelings about it are explored. Clearly, 
the degree of affective involvement is 
much greater in this situation than in 
the aseptic office environment in which 
interviewing is ordinarily carried out. 
Presumably, the depth of feeling the 
child will express under these circum- 
stances will differ significantly from 
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view environment. 

There are many possible adaptations 
of this interview technique in research 
with young children. One would be to 
try to “catch” a child’s reaction im- 
mediately after an event has occurred, 
e.g., his first sight of a new sibling, his 
first day in school, after an aggressive 
encounter with another child. Another 
variation would be to create or re-create 
experimentally a situation involving 
frustration or parental discipline and 
then to interview the child in the ex- 
perimental setting. Although these sug- 
gested adaptations of the life-space 
interview are especially appropriate for 
use with the preschool child because of 
his limitations in time perspective, it has 
relevance for the entire age span, par- 
ticularly when it is desired to obtain the 
child’s feelings about a previous ex- 
perience. There are, of course, some 
kinds of research problems in which 
maximum recall is not desired; for ex- 
ample, when we are primarily interested 
in the child’s selective perceptions and 
distortions about a past event. 


NONVERBAL COMMUNICATION 
IN THE INTERVIEW 


Observation of behavior can be used 
meaningfully to supplement the verbal 
data obtained during the interview. 
Children are less likely than adults to 
attempt consciously to control or dis- 
guise their behavior, and they have not 
internalized so completely as adults the 
techniques that are often used uncon- 
sciously to cover up significant feelings, 
attitudes, and motivations. Observation 
of the child’s behavior during the or- 
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dinary interview may be used in severa! 
ways. The child’s reactions to a speci 
question may be helpful in interpreti 
the significance of the question to him 
or in evaluating the validity of his re 
sponse. On another level, the child’ 
reactions to the variety of stimuli im 
pinging on him during the conventional! 
interview may offer clues to motiva 
tional and personality characteristics, 
e.g., ways of handling stress, patterns of 
dominance and submission in inter 
personal relationships, and charac 
teristic defenses. The validity of thes: 
clues will depend on the adequacy o 
the behavior sample, the sensitivity © 
the interviewer in making relevant ob 
servations, and his skill in drawing in- 
ferences from raw behavior about 
motivational and personality functions. 

The interview may be used as a 
standardized setting for observing 
specific aspects of behavior. For ex- 
ample, in a controlled interview there 
may be systematic observation of the 
child’s capacity to relate to a stranger, 
or the observation may focus on de- 
pendency in relating to the interviewer 
in a problem-solving situation. Inter- 
view settings have been creatively 
utilized as experimental or semiexperi- 
mental situations in which the subject 
may be systematically subjected to 
special stimulus-conditions relevant to 
the research, e.g., stress. 

Although lip service has traditionally 
been given to the value of behavioral 
observation during the interview, there 
have been few systematic attempts to 
develop reliable behavioral indices. 
Recently a number of efforts at 
systematizing diagnostic clues has been 
based on expressive behavior in the 
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communication process (Birdwhistell, 


1952a,b; Wolff, 1946; Smith, 1956). 
Despite these attempts at systematiza- 
tion, the use of nonverbal cues in the 
interview to assess personality charac- 
teristics or to judge the validity of 
verbal responses remains essentially an 
intuitive art. 

Nonverbal cues characteristically 
used in assessing behavior can be sum- 


marized under the following major 
categories: 


1. Phy sical—posture, ega freedom- 
constraint; gestures, facial expression, 
e.g., somber-gay; speed of movement; 
vigor of movement; energy output; 
motor coordination; area of free move- 
ment, e.g., whether the child is expan- 
sive or restricted in his use of space. 

2. Physiological—vasomotor insta- 
bility, e.g., blushing, perspiring; mus- 
cular tension. 

3. Formal characteristics of language 
—e.g., loquacity, fluency, tempo, speed 
of verbal output. 

4. Interactional behavior—ways of 
relating to the interviewer, e.g., at- 
tempts to control the situation, resist- 
ance, compliance, dominance, passivity, 
dependency, withdrawal. 

5. Personality style—e.g., spontane- 
ous, inhibited, compulsive, loosely or- 
ganized, focused, scattered. 


All of these nonverbal cues are de- 
pendent on subjective judgments, and 
adequate criteria of validity are dif- 
ficult to establish, particularly for the 
categories interactional behavior and 
personality style, which are on a more 
complex level, are further removed 
from direct observation, and require 
high-order inferences from behavior. 
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Objectivity of observations can be 
increased by introducing some stand- 
ardization in the observation procedures 
by the use of observation guides or 
standardized observational or rating 
techniques. These techniques will in- 
crease the likelihood that the same 
aspects of behavior will be noted by 
different interviewers and from one in- 
terview situation to another. Obviously, 
all of the relevant cues cannot be in- 
cluded in a guide or rating scale, and in 
some kinds of research problems there 
can be latitude for free observation. 
Observer reliability can be determined 
by standard procedures, e.g., having a 
second observer make simultaneous ob- 
servations behind a one-way screen. For 
some kinds of research, the use of an 
observer to record interview content 
may be a standard part of the research 
design. (See detailed discussion of ob- 
servational techniques in Chapter 3-) 


RECORDING INTERVIEW 
RESPONSES 


An adequate transcript of the inter- 
view content is a basic requirement for 
valid analysis of the data. Our major 
concern is to record the interview con- 
tent and the respondent’s behavior ac- 
curately in such a way that the record- 
ing procedure will not interfere with 
the interview relationship. The main 
questions center around the timing and 
the content of the recording, i.e., 
whether it should be done during or 
after the interview and whether it 
should be verbatim or selective or in 
terms of a pre-established code. 

‘All the available methodological data 
emphasize the loss in accuracy when 
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the recording of an interview is based 
on recall of content after completion 
of the interview (Symonds and Die- 
trich, 1941; Payne, 1949). It is im- 
portant, therefore, that it be done at 
the time of the interview. Children’s 
reactions to the examiner’s writing dur- 
ing the interview will vary somewhat at 
different age levels and with different 
kinds of interview material. Children of 
preschool age tend to adapt rather 
quickly, and the interviewer’s writing 
does not, in general, seem to inhibit 
their communication. During preadoles- 
cence and adolescence it is more likely 
to have an inhibiting effect. It may be 
reassuring to the child if the interviewer 
is himself comfortable about writing 
and can do it unobtrusively. At all ages 
children’s questions about the writing 
must be answered casually and without 
evasiveness. Some variation on the 
simple statement, “I am writing down 
some of your ideas so I won’t forget 
them,” is applicable to most interview 
situations. With older children, assur- 
ance of the confidentiality of the 
material might be in order. On the 
whole, if the relationship is a good one, 
the content not too threatening, and 
the recording mechanics not too ob- 
trusive, children tend to accept the 
examiner’s writing rather easily. 

The amount of detail recorded will 
depend on its ultimate use. In studying 
children’s vocabularies or language de- 
velopment, or in some kinds of studies 
of concept development, verbatim re- 
cording may be essential. In dynam- 
ically oriented research on personality 
development, verbatim recording may 
be important to the extent that the idio- 
syncratic choice of words is significant 
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or the sequence of ideas may have mean- 
ing in the analysis. For studies in which 
the focus is on very specific limited 
content, brief recording on an inter- 
view guide or direct recording on < 
rating or code sheet may be the most 
efficient procedure. 

When a detailed account of the inter- 
view is desired, a common procedure 
is to jot down key words and phrases 
or summary notes in the appropriate 
sequence and fill in the details im- 
mediately after the interview. This is 
not nearly so satisfactory as full record 
ing during the interview by mechanic 
means or by an _ observer-recorde: 
present in the interview room or behind 
a one-way screen. The use of a me- 
chanical recording device is probably 
the most satisfactory solution to verba- 
tim recording. This approach 
proved especially useful in the free, 
unstructured interview. However, some 
concern has been voiced among re- 
search workers as to the effects of the 
electronic recording procedure on rap- 
port in the interview situation. There is 
a feeling among many clinically 
oriented interviewers that use of a re- 
cording machine may be inhibiting to 
the respondent. Although this problem 
has not been systematically studied, the 
impression has been growing among re- 
searchers who have used electronic re- 
cording with adults, as well as with 
children, that there is fairly quick 
adaptation if the recording device is 
Not too intrusive. In fact, it is thought 
that the interview may proceed more 
smoothly when the interviewer is free 
to devote his full attention to the con- 
tent of the responses and to observation 
of the respondent’s behavior. When a 


has 
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recording device is used with children, 
their questions about the technique 
should be answered in a way to satisfy 


their curiosity quickly so that they will 
not be distracted by the machine. Fre- 
quently children are eager to hear their 
own voices, and the promise of a play- 
back at the end of the interview may 
serve as added motivation for coopera- 
tion. There are limitations in the use 
of electronic equipment in recording 
children's interviews. Even the most 
sensitive recording equipment some- 
times fails to give a comprehensible 


record of speech that is full of articu- 


latory distortions and in which there 
is much variation in loudness. A 
stenog c record by a stenographer 


trained to understand children’s speech 
may be more reliable. Before deciding 
on the use of electronic equipment, it 
is best to pretest the recording under 
conditions simulating the actual in- 
terview, 


ANALYSIS OF INTERVIEW DATA 


Data collection and data analysis are 
not completely independent processes. 
Ideally, collection procedures are 
planned within a research design in 
which the analysis problems and tech- 
niques have been anticipated. The form 
in which the data are collected and re- 
corded will determine the kinds of 
analyses that will be possible. Although 
the interview is particularly well 
adapted to the study of the unique as- 
pects of a child’s world, its use is not 
limited to individual case study. It is 
possible to pool data that give insight 
into the unique worlds of individual 
children and to build up meaningful 
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norms for a developmental stage or 
derive generalizations significant for 
broader theoretical issues. 

Analysis of interview data recorded 
in terms of simple categories, such as 
“approve, disapprove,” presents no 
unique problems; it is basically similar 
to analysis of any directly quantifiable 
data. We can tally the frequencies with 
which a given response occurs for an 
age or sex group or for experimental or 
control groups and then apply the 
statistical procedures appropriate for 
the research problem, e.g., correlational 
analysis or analysis of the significance 
of differences between control and ex- 
perimental groups. On the other hand, 
qualitative data of the sort obtained in 
open-ended interviews require special 
techniques for translating the data to a 
form amenable to statistical treatment. 
The coding procedures may be quite 
complex, involving a number of steps 
in the transformation of qualitative data 
to quantitative form. Methods of 
analysis of qualitative material obtained 
from adults have been described in a 
number of articles (Berelson, 1954; 
Cartwright, 1953; Guetzkow, 1950; 
Schutz, 1950; Spiegelman et al., 1953; 
White, 1951). These procedures of 
qualitative analysis are adaptable to 
analysis of children’s interviews. It is 
important, however, when coding chil- 
dren’s interview responses to be es- 
pecially sensitive to possible differences 
between children and adults in concepts 
and frames of reference. There is danger 
that the categories chosen by the adult 
coder may not adequately represent the 
child’s frame of reference, thus de- 
creasing the validity of the coding. 
Appropriate definition of the codes and 
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careful selection and training of coders 
will help minimize such distortions. 
Knowledge of language and concepts 
characteristic of the age level being 
studied is a desirable qualification of the 
coders. 

In this chapter we can sketch only 
the main steps in coding qualitative 
data. For detailed discussion of coding 
procedures, the references cited above 
should be consulted. 

Before developing a coding outline, 
it is necessary to have a tentative design 
for analysis based on the major ques- 
tions of the study. In the development 
of the coding scheme a first step is to 
list the ‘variables relevant to the prob- 
lems or hypotheses of the study. The 
variables may evolve out of a systematic 
theoretical framework, or they may be 
empirically derived from careful in- 
spection of the data. Empirically de- 
rived variables are likely to be more 
limited than variables derived from 
theoretical considerations. A theo- 
retically based set of variables often 
helps alert the coder to covert mean- 
ings in the data that might be over- 
looked in a purely empirical approach. 

A major step in coding is the transla- 
tion of the research variables into ap- 
propriate categories for analysis. There 
are Oy, systems of categorization, 
ranging from the use of simple dichot- 
omous categories to those on a com- 
plex continuum. The most appropriate 
categorization system will depend some- 
what on the problem and on the form 
of the available data. The simplest 
type of coding system uses dichotomous 
categories, in which the data are ana- 
lyzed simply in terms of the presence 
or absence of a given characteristic, 
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e.g., anxiety about separation from the 
mother or no anxiety. On a somewhat 
more complex level the responses are 
rated or ranked in terms of intensity, 
usually on a three-to-five-point scale 
very high anxiety, moderately high 
anxiety, etc. 

In many kinds of analyses the cat« 
gories are qualitatively different rather 
than on a simple continuum. For ex- 
ample, if we are interviewing children 
about their first reactions to entering 
nursery school, the interview data might 
be analyzed with reference to different 
foci of anxiety: anxiety about separa 
tion from the mother, anxiety about 
being displaced by a younger sibling 
or anxiety about own abilities in rela 
tion to peers. 

In the actual coding procedure th 
selection of meaningful units of analy- 
sis is a major methodological considera 
tion. We need to set up criteria for 
choice of psychologically meaningful 
response units in the interview. In deal- 
ing with interview responses, the prob- 
lems are basically similar to those in- 
volved in choice of meaningful behavior 
units in observational studies (Barker 
and Wright, 1954; Wright and Barker, 
1950). For some categories, a response 
to a single question may be the ap- 
propriate unit of analysis; for more 
complex categories, responses to several 
questions may be taken as the unit of 
analysis. Cartwright (1953) makes a 
useful distinction between the unit of 
analysis and the context unit—the 
frame of reference within which a 
single analysis unit is interpreted. The 
context unit may be much larger than 
the analysis unit. The interpretation of a 
response will differ when our frame of 


INTERVIEWING CHILDREN 

reference is a single question and when 
it is analyzed in the context of the total 
interview. 

Frequently it is possible to use the 
same unit of analysis in several different 
codes, especially responses in open- 
ended interviews. A response such as 
the following to a single open-ended 
question about school could be analyzed 
in terms of several categories: “Well, 
it’s hard because the other kids are 
studying all the time trying to get high 
marks and I don’t like to have to study 
so much.” We can use this response to 


code separately the child’s attitude 
toward school, his feelings about his 
peers, and his achievement orientation. 

In assigning a given unit of analysis 
to the appropriate category, clear-cut 
definitions of the categories are es- 
sential, Conceptual as well as concrete 
definitions of the categories are de- 
sirable. When fine differentiations are 
required between dimensions of a cate- 
gory, illustrations are useful to define 
its limits. Definitions should be broad 
enough to encompass the psychological 
meaning of the category, yet sufficiently 
definitive to avoid overlapping with 
other categories. Responses may be 
coded in terms of manifest content, or 
the coding may require complex in- 
ferences about attitudes, feelings, or 
personality characteristics. For ex- 
ample, in a study of perception of peers, 
the child’s response to a question about 
another child may be, “I like him; he’s 
always making me laugh; he’s lots of 
fun, always cheers you up.” On one 
level this might be coded simply as a 
positive response to the other child. 
On a manifest level, in terms of percep- 
tion of valued characteristics in chil- 


597 


dren, this response might be coded as 
indicating attribution of high value to 
having a sense of humor or being cheer- 
ful. On another level, this response 
could be coded in terms of valuing 
others in relation to their need-gratifica- 
tion potential, i.e., the ability to gratify 
one’s own needs, 

The reliability of the coding can be 
established by the standard procedures 
for rater reliability—by having two or 
more raters independently code the 
same sample of data and then determin- 
ing the degree of correspondence 
among their ratings The validity of a 
code is much more difficult to establish. 
Ideally, validation requires comparison 
of the coded data with completely in- 
dependent data about the child, e.g., 
data based on direct observation or 
parental reports about the child. In 
many circumstances it is not possible 
to obtain completely independent 
data to use as criteria of validity. More- 
over, in order to use such criteria 
to establish coding validity, we must 
first assume the validity of the inter- 
view. Practically, the best way to assure 
coding validity is to choose competent 
coders and provide them with adequate 
training and well-defined codes. Dis- 
cussion among coders after some ex- 

erience with the coding outline will 

help clarify difficult categories and im- 
prove reliability and validity. This pro- 
cedure permits informal determination 
of the reliability of the categories as 
well as of the coders. Categories on 
which there are consistent disagree- 
ments among the coders may require 
redefinition or may have to be dis- 
carded. 

Although it might be assumed that 
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coders’ knowledge of the research hy- 
potheses would improve the validity of 
the coding, there are some conditions 
under which it may be important that 
the coders be unfamiliar with the hy- 
potheses being tested. It may also be 
necessary to conceal from the coders 
the identity of the subjects, i.e., whether 
they are in the experimental or control 
group. There is some question whether 
coding should be done by the person 
who has collected the data because of 
the possibility of his judgments being 
influenced by factors outside the re- 
corded material, i.e., subjective impres- 
sions obtained during the interview. If 
his impressions represent valid recall of 
significant nonverbal cues in the inter- 
view, the validity of the coding might 
be increased; however, such impres- 
sions may also tend to bias the coding 
because of distortions in recall or “halo 
effects.” Furthermore, the determina- 
tion of reliability is made more difficult 
by the introduction of bases for judg- 
ment that are not part of the recorded 
data. The relative advantages and dis- 
advantages of using coders with outside 
knowledge of the data need to be 
evaluated for a specific research prob- 
lem. 


ADVANTAGES AND LIMITATIONS 
OF THE INTERVIEW, 


The choice of a research technique 
for use with children, whether an ob- 
servational approach, a questionnaire, 
or an interview, depends on the kind 
of problem and the level of data needed 
to answer the research questions. Some 
of the values of the interview, as well 
as its limitations, have been suggested. 
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A comparison of the interview with 
other techniques frequently used in 
studying similar problems with children 
may help point up some values of the 
technique. 

Compared with the observational ap- 
proach, which has been used with much 
greater frequency in child development 
research, the interview has some clear 
cut advantages. It permits study of 
subjective phenomena not amenable to 
direct observation, such as aspirations, 
wishes, dreams, and anxieties. The in- 
terview also enables meaningful study 
of a wider time span in the child’s life 
than is possible with any cross-sectional 
observational technique. In comparison 
with the observational approach, the 
interview is much less vulnerable to 
time-sampling errors and is probably 
less sensitive to fluctuations in mood or 
to the impact of recent experiences. 

In comparing the interview with the 
written questionnaire, a number of ad- 
vantages are apparent. In the interview 
situation it is possible for the inter- 
viewer to be aware of the child’s lack 
of comprehension or misunderstanding 
of a question and to be in a position to 
clarify the question. Better control of 
the influence of question sequence and 
the context of questions is possible in the 
interview than in the questionnaire. In 
the questionnaire the respondent may 
read all of the questions before re- 
sponding. Thus his response to any one 
question may be influenced by the con- 
tent of later questions. In regard to the 
depth of data obtainable, there are con- 
tradictory opinions, based on limited 
findings, on the relative advantages of 
the interview and the questionnaire. For 
questions that might arouse emotional 
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resistance, it has been generally as- 
sumed that the direct person-to-person 
relationship in the interview will reduce 
resis! and facilitate the child’s re- 
sponses, although some findings with 
adults suggest that some kinds of ego- 
data are given more freely in 
mous questionnaire (Ellis, 
Certainly, further research is 
l in which the character of the 
onship with the interviewer is 
y controlled. To the extent that 


the interview permits meaningful prob- 
ing on leads given by the respondent, 
it is likely to yield data at a deeper 


level than most written questionnaires. 
[See Ellis (1946) for an evaluation of 
the uses and limitations of the written 
questionnaire. ] 

A cogent consideration in the choice 
of an interview over other techniques 
is whether the problem is one requiring 
an interpersonal relationship for ob- 
taining the desired data or one in which 
the relationship may contribute sub- 
stantially to the validity of the findings. 
The ultimate value of the interview as 
a research tool is dependent on the 
interviewer’s knowledge of develop- 
mental psychology and his ability to 
apply this knowledge sensitively in re- 
lating to children. 


REFERENCES 


Ammons, Carol H., and R. B. Ammons. 1952. 
Research and clinical applications of the 
doll-play interview. J. Psychol. 21, 85-90. 

Ammons, R. B. 1950. Reactions in a projective 
doll-play interview of white males two to 
six years of age to differences in skin color 
and facial features. J. genet. Psychol., 76, 
323-341. 


599 


Axline, Virginia. 1947. Play therapy. Boston: 
Houghton Mifflin. 

Baldwin, A. L. 1955. Behavior and develop- 
ment in childhood, New York: Dryden. 

Barker, R. G., and H. F. Wright. 1954. Mid- 
west and its children. Evanston, Ill.: Row, 
Peterson. 

Bender, Lauretta. 1953. Aggression, hostility, 
and anxiety in children. Springfield, T.: 
Thomas. 

Berelson, B. 1954. Content analysis. In G. Lind- 
zey (Ed.), Handbook of social psychology. 
Cambridge: Addison-Wesley. 1, 488-522. 

Biber, Barbara and Claudia Lewis. 1949. An 
experimental study of what young school 
children expect from their teachers. Genet. 
Psychol. Monogr., 40, 3-97. 

Biber, Barbara, Lois Murphy, Louise Wood- 
cock, and Irma Black. 1952. Life and ways 
of the seven to eight year old. New York: 
Basic Books. 

Bingham, W. V., and B. V. Moore. 1941. 
How to interview. (Rev. ed.). New York: 
Harper. 

Birdwhistell, R. N. 1952a. Body motion re- 
search and interviewing. Hum. Organiz, 11, 
37-38. 

. 1952b. Introduction to kinesics (An 
annotation system for analysis of body 
motion and gesture). Washington, D. C.: 
Foreign Service Institute. 

Buckingham, B. R. and E. W. Dolch. 1936. 
A combined word list. Boston: Ginn. 

Campbell, J., L. Yarrow, and Marian Radke 
Yarrow. 1958. A study of adaptation to a 
new social situation. J. soc. Issues, 14, 3-7. 

Cantril, H. 1940. Experiments in the wording 
of questions. Public Opinion Quarterly, 4, 
330-332. 

Cannell, C. F., and R. L. Kahn. 1953. The 
collection of data by interviewing. In L. 
Festinger and D. Katz (Eds.), Research 
methods in the behavioral sciences, New 
York: Dryden. Pp. 327-380. 

Cartwright, D. 1953. Analysis of qualitative 
materjal. In L. Festinger and D. Katz (Eds.), 
Research methods in the behavioral sciences, 
New York: Dryden. Pp. 421-470. 

Conn, J. H. 1938. The child speaks to the 
psychiatrist; An introduction to the method 
of the play-interview. Occup. Ther. 17, 


231-244- 


600 


Conn, J. H., and L. Kanner. 1947. Children’s 
awareness of sex differences, J. Child Psy- 
chiat., 1, 3-57- 

Crutchfield, R, and D. H. Gordon. 1947. 
Variations in respondents’ interpretations of 
an opinion-poll question. Int. J. Opin. Atti- 
tude Res., 1, 1-12. 

Dennis, W. 1942. Piaget’s questions applied 
to a child of known environment. J. genet. 
Psychol., 60, 307-320. 

Despert, J. Louise. 1940. A method for the 
study of personality reactions in preschool 
children by means of analysis of their 
play. J. Psychol., 19, 17-29. 

Ellis, A., 1946. The validity of personality 
questionnaires. Psychol. Bull., 43, 385-440. 

. 1947. A comparison of the use of 

direct and indirect phrasing in personality 

questionnaires. Psychol. Monogr., 61, No. 2, 

(Whole No. 284). 

. 1953. Recent research with personality 
inventories, J. consult. Psychol., 17, 45-49. 

Fauls, L. B., and W. D. Smith. 1956. Sex-role 
learning in five year olds. J. genet. Psychol., 
89, 105-119. 

Faust, Margaret S. 1951. A study of father- 
child relations using a story-completion 
technique. Unpublished master’s thesis, Stan- 
ford Univer. 

Fleming L., and W. U. Synder. 1947. Social 
and personal changes following non-direc- 
tive group play therapy. Amer. J. Ortho- 
psychiat., 17, 101-116. 

Geelerd, E. R. 1957. Some aspects of psycho- 
analytic technique in adolescence. In R. 
Eissler et al. (Eds.), Psychoanalytic study 
of the child. New York: International 
Univer. Press, 12, 263-283. 

Getzels, J. W. 1951. The assessment of per- 
sonality and prejudice by the method of 
paired direct and projective questions. Un- 
published doctoral thesis, Harvard Univer. 

~ 1958. The method of paired direct and 
projective questionnaires in the study of atti- 
tude structure and socialization. Psychol. 
Monogr., 72, 1, 30. 

Grace, H. A., and J. Lohmann. 1952. Chil- 
dren’s reactions to stories depicting parent- 
child conflict situations. Child Develpm, 23, 
61-74. 

Graham, F. K., W. A. Charwat, A. S. Honig, 
and Paula C. Weltz. 1951. Aggression as a 


THE STUDY OF PERSONALITY DEVELOPMENT 


function of the attack and the attacker. J. 
abnorm. soc. Psychol., 46, 512-520. 

Guetzkow, H. 1950. Unitizing and cate 
gorizing problems in coding qualitative 
material. J. clin. Psychol., 6, 47-58. 

Hammond, K. R. 1948. Measuring attitudes by 
error-choice, and indirect method. J. 
abnorm. soc. Psychol., 43, 38-48. 

Harris, Esther K. 1946. The responsiveness of 
kindergarten children to the behavior of 
their fellows. Monogr. Soc. Res. Child 
Develpm., 11, No. 2. 

Hartley, E. L. M. Rosenbaum, and S 
Schwartz. 1948. Children’s perceptions of 
ethnic group membership. J. Psychol, 26 
387-398. 

Hartley, E. L., and S. Schwartz. 1951. A 
pictorial doll-play approach for the stud) 
of children’s intergroup attitudes. Jnt. J 
Opinion Attitude Res., 5, No. 2. 

Hartley, Ruth E. 1959. Development of con 
cepts of women’s social roles. Unpublished 
manuscript. New York: City College of 
New York. 

. 1952. Understanding children’s play 

New York: Columbia Univer. Press. 

, L. K. Frank, and R. M. Goldenson. 
1952. New play experiences for children. 
New York: Columbia Univer. Press. 

Horowitz, Ruth. 1943. A pictorial method for 
study of self identification in preschool 
children. J. genet. Psychol., 62, 135-148. 

Hyman, H. 1954. Interviewing in social re- 
search. Chicago: Univer. Chicago Press. 

International Kindergarten Union. 1928. 4 
study of the vocabulary of children before 
entering the first grade. Baltimore: Williams 
and Wilkins. 

Jahoda, Marie, M. Deutsch, and S. W. Cook. 
1951. Research methods in social relations. 
New York: Dryden. 

Jersild, A. T., and R. Ritzman. 1938. Aspects 
of language development, the growth of 
loquacity and vocabulary. Child Develpm., 
9, 243-259. 

Kahn, R. B., and C. G. Cannell. 1957. The 
dynamics of interviewing. New York: 
Wiley. 

Kates, S. L. 1951. Suggestibility, sensitiveness 
to parents and peers, and extrapunitiveness, 
intropunitiveness, and impunitiveness in 
children. J. Psychol., 31, 233-241. 


INTERVIEWING CHILDREN 


Levy, D. M. 1933- Use of play technique as 
experimental procedure. Amer. J. Ortho- 
psychiat., 3, 266-277- 

1936. Hostility patterns in sibling 

rivalry experiments. Amer. J. Ortho- 

psychiat., 6, 183-257- 

. 1937. Studies in sibling rivalry. Res. 
Monogr. Amer. Orthopsychiat. Ass., No. 2. 

McCarthy, Dorothea. 1930. The language de- 
velopment of the preschool child, Inst. Child 
Welf. Monogr., No. 4. Minneapolis: Univer. 
Minnesota Press. 

1954. Language development in chil- 
dren. In L. Carmichael (Ed.), Manual of 
Child Psychology. (2nd ed.) New York: 
Wiley. Pp. 492-630. 

Maccoby, Eleanor E., and N. Maccoby. 1954. 
The interview: A tool of social science. In 
G. Lindzey (Ed.), Handbook of Social Psy- 
chology. Cambridge; Addison-Wesley. 1, 
449-487 

Merton, R. K., and P. L. Kendall. 1946. The 
focused interview, Amer. J. Sociol, 51, 541- 
557- 

M. Fiske, and P. L. Kendall. 1956. The focused 
interv . Glencoe, Ill.: Free Press. 

Metraux, Rhoda W. 1950. Speech profiles of 
the preschool child, 18-54 months, J. Speech 
Dis., 15, 37-53. 

Miller, F. M. 1952. The participant observer 
and over-rapport. Amer. sociol. Rev., 17, 
97-99. 

Moustakas, C., and H. D. Schalock. 1955. An 
analysis of therapist-child interaction in play 
therapy. Child Develpm., 143-157- 

Murphy, Lois B. 1956. Methods for the study 
of personality in young children. New 
York: Basic Books, Vol. I. 

Nass, M. L. 1956. The effects of three vari- 
ables on children’s concepts of causality. 
J. abnorm. soc. Psychol., 53, 191-196. 

Parten, Mildred B. 1950. Surveys, polls, and 
samples, New York: Harper. 

Payne, S. L. 1949. Interviewer memory faults. 
Publ. Opin. Quart., 13, 684-685. 

Piaget, J. 1928. Judgment and reasoning in the 
child. New York: Harcourt, Brace. 

— 1951. The language and thought of the 
child. New York: Humanities Press. 

——.. 1929. Introduction to the child’s con- 
ception of the world. New York: Harcourt, 
Brace, 


601 


Radke, Marian J. 1946. The relation of paren- 
tal authority to children’s behavior and at- 
titudes. Inst. Child Welf. Monogr., No. 22. 
Minneapolis: Univer. Minnesota Press. 

, and Helen G. Trager. 1950. Children’s 

perceptions of the social roles of Negroes 

and Whites. J. Psychol., 29, 3-33- 

, Helen G. Trager and Hadassah Davis. 

1949. Social perceptions and attitudes of 

children. Genet. Psychol. Monogr., 40, 372- 


447- 

Redl, F. 1959. Strategy and techniques of the 
life space interview. Amer. J. Ortho- 
psychiat., 29, 1-18. 

Rogers, C. R. 1945. The non-directive method 
as a technique for social research. Amer. J. 
Sociol. 50, 279-283- 

. 1951. Client-centered therapy. Boston: 
Houghton Mifflin. 

Rohde, A. R. 1946. Explorations in personality 
by the sentence-completion method. J. appl. 
Psychol., 30, 169-181. 

Rotter, J. B. 1951. Word association and 
sentence-completion methods. In H-H 
and Gladys L. Anderson (Eds.), An in- 
troduction to projective techniques. New 
York: Prentice-Hall. Pp. 279-311. 

Sanford, R. N., et al. 1943. Physique, per- 
sonality and scholarship. Monograph Soc. 
Res. Child Develpm., 8, No. 1. 

Schutz, W. 1950. Theory and methodology 
of content analysis. Unpublished doctoral 
dissertation, UCLA. 

Sears, R. 1947. Influence of methodological 
factors on doll play performance, Child 
Develpm., 18, 190-197- 

Seaton, J. K. 1940. A projective experiment 
using incomplete stories with multiple- 
choice endings. Genet. Psychol. Monogr., 
41, 149-218. 

Sheatsley, P. B. 1951. The art of interviewing 
and a guide to interviewer selection and 
training. In Marie Jahoda et al. (Eds.), Re- 
search methods in social relations. New 
York: Dryden. 2, 463-492. 

Smith, H. L. 1956. Linguistic science and the 
teaching of English. Cambridge: Harvard 
Univer. Press. 

Smith, Mary K. 1941. Measurement of the 
size of general English vocabulary through 
the elementary grades and high school. 
Genet. Psychol. Monogr., 24, 311-345. 


602 


Spiegelman, M., C. Terwilliger and F. Fearing. 
1953. The realiability of agreement in con- 
tent analysis. J. soc. Psychol., 37, 175-187- 

Stein, M. I. 1947. The use of a sentence-com- 
pletion test for the diagnosis of personality. 
J. clin. Psychol., 42, 320-329. 

Stolz, Lois M., et al. 1954. Father relations of 
war-born children. Stanford: Stanford Uni- 
ver. Press. 

Stone, L. J., and J. Church. 1957. Childhood 
and adolescence: A psychology of the grow- 
ing person. New York: Random House. 

Sullivan, H. S. 1954. The psychiatric inter- 
view. New York: W. W. Norton. 

Symonds, P. M., and D. H. Dietrich. 1941. 
The effects of variations in the time in- 
terval between an interview and its record- 
ing. J. abnorm. soc. Psychol., 36, 593-598. 


THE STUDY OF PERSONALITY DEVELOPMENT 


Vigotsky, L. S. 1939. Thought and speech, 
Psychiatry, 2, No. 1, 29-51- 

White, R. K. 1951. Value-analysis: The nature 
and use of the method. Soc. Psychol. Study 
soc. Issues. 

Winstell, B. 1951. The use of a controlled } 
situation in determining certain effects 
maternal attitudes on children. Child De- 
velpm., 22, 299-311. 

Wolff, W. 1946. Personality of the presc 
child: The child’s search for his self. 
York: Grune and Stratton. 

Wright, H. R, and R. G. Barker. 1950 
Methods in psychological ecology. Law 
rence: Dep. Psychol. Univer. Kansas. 

Yarrow, L. J. 1948. The effects of anteceden 
frustration on projective play. Psychol 
Mongr., 62, No. 6. 


lay 


chapter 1d 


Projective 
Techniques 


William E. Henry 
The University of Chicago 


A large number of projective tech- 
niques may be used for research in 
child development. Certain of these 
were designed with children specifically 
in mind. Others, though originally de- 
veloped for adults, may still be profit- 
ably used with children. 

The techniques may be grouped in 
a variety of ways so as to highlight 
their similarities and differences. Frank 
(1939) has made one categorization 
based upon the nature of the response 
required of the subject. He proposed 
the terms constitutive, interpretative, 
cathartic, and constructive. The con- 
stitutive techniques are those that re- 
quire the subject to assign a form or 
structure to a stimulus relatively un- 
structured. Thus in the Rorschach he 
Must name an object, that is assign a 
specific form, to a stimulus essentially 
ambiguous in nature. Similarly, in the 
use of finger paints, he must create a 
picture or a design. An interpretative 
technique relies more upon the subject 
telling what a stimulus means to him. 
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In the Thematic Apperception Test he 
must give a meaning to a picture, as 
he does also in the Szondi when he in- 
dicates whether or not he likes certain 
photographs of persons. In the cathartic 
techniques the stress is upon the expres- 
sion of feeling and affect, as when the 
child plays with dolls or toys. The 
constructive methods require the child 
to create an order and organization by 
the manipulation of, for example, 
blocks, as when he is asked to build a 
specific unit, his own home, out of 
blocks and toys. 

Other persons have suggested some- 
what different ways of categorizing 
these instruments. Each system throws 
additional light upon various aspects 
of them. For more detailed material on 
these various classifications, the reader 
is referred to Helen Sargent (1945), to 
Campbell (1950, 1957), and to Cattell 
(1951). For our purposes here, another 
set of categories based upon the re- 
sponse process of the subject is used. 
This is the system proposed by Gardner 
Lindzey (1960). The categories are as- 
sociative, construction, completion, or- 
dering, and expressive. 

The associative techniques are those 
in which the subject responds to a 
stimulus—a word, a picture, a sound— 
with the first word, image, or thought 
that occurs to him. In the Word As- 
sociation Test, used more commonly 
with adults than with children, the sub- 
ject is presented with a list of words, 
to each of which he is to respond with 
the first word that comes to mind. In 
some cases he may be asked to give sev- 
eral words, a chain of associations to 
each stimulus word. The Rorschach is 
probably the best known of the meth- 
ods utilizing this associative form of 
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response. Useful with subjects from 
nursery-school age and on, the Ror- 
schach is a set of ten inkblots, irregular 
and variegated in shape and color, to 
which the subject again responds with 
the first things that come to mind, as 
in the Word Association Test. In the 
Rorschach, however, the subject may 
give many responses to each of the ten 
cards, and in each case he identifies as 
best he can the area of the card about 
which he is talking as well as the char- 
acteristics of the section of the card 
that made him think of that particular 
response. 

The Cloud Pictures of Struve (1932) 
and William Stern (1938) are also as- 
sociative in form. They are a series of 
pictures of clouds on which the sub- 
ject is to outline objects that he per- 
ceives and identify them with some 
label or name. Also associative, but 
based upon vague sounds heard rather 
than upon words or pictures seen, are 
two forms of auditory projective tests. 
The Verbal Summator of Skinner 
(1936) presents vowel sounds on a rec- 
ord. The subject is asked to identify 
these sounds as though they were ac- 
tual words heard somewhat indistinctly. 
Another form of this idea is the Tauto- 
phone, devised primarily for adult psy- 
chiatric settings by Shakow and Rosen- 
zweig (1940). Davids and Murray 
(1955) have developed a more elaborate 
and more promising form, based upon 
patterns of sounds that are not merely 
formally meaningless but are also re- 
lated to certain personal dispositions, 
such as optimism, distrust, anxiety, 
trust, egocentricity, and so forth. 

The construction techniques are 
those that call upon the subject to 
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construct a product, typically a story, 
out of a picture stimulus. There is 
no particular demand here for the first 
idea that occurs to the subject. Rather 
the stress is upon the development of 
some sort of plot. This will have asso- 
ciative qualities but will also involve 
the cognitive task of organizing ideas 
in some sort of a sequence to constitute 
a story. The Thematic Apperception 
Test of Murray (1943) is probably the 
best known of this kind of method. This 
instrument consists of a set of 30 cards 
of varying content and degrees of am- 
biguity. One card is blank. The subject 
is shown each card individually and 
asked to create a story that describes 
what is going on in the picture, what 
led up to it, what the people are think 
ing and feeling, and how it will all 
come out. Certain of the cards are to 
be used for all subjects, regardless of 
age and sex, and certain are specifically 
intended for boys, girls, or adult males 
or females. 

The Make-A-Picture-Story Test 
(MAPS) of Shneidman (1948, 1949, 
1952) is a close relative of the TAT 
but adds one special feature. In it the 
subject is provided with background 
scenes and a large number of human 
and animal figures from which he may 
select. Once he has chosen the figures 
he wishes to use, he then constructs 
a story, as in the TAT. Van Lennep’s 
Four Picture Test consists of four col- 
ored scenes, which are presented simul- 
taneously, rather than sequentially, to 
the subject. He must then arrange these 
four scenes in any order he chooses 
and incorporate them into a single 
story. (Lennep, 1948, 1951) 

A number of modifications of the 
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TAT has been devised for such special 
purposes or subjects as children, Ne- 
groes, »dolescents, interacting groups, 
the family, and persons in other cul- 
tures. Modifications have also been 
made to facilitate the study of such 
specific variables as achievement, affili- 
ation, anxiety, aggression, with one set, 
the Blacky Pictures (Blum, 1950) for- 
mulated on the base of a series of psy- 
choanalytic variables related to stages 
of psychosexual development. Some at- 
tention is also given to the analysis of 
dreams under this rubric. Although 
dreams are not usually considered a 
projective technique, they meet most 
of the basic criteria of such techniques, 
except for the obvious fact that the 
stimulus to the resulting product is not 
under the control of the examiner. 

The completion techniques present 
the subject with some incomplete situa- 
tion, which he is to complete in any 
manner he chooses consistent with the 
instructions. The Sentence Completion 
Test is the basic and best-known test 
in this form. It consists of a series of 
incompleted sentence stems, which the 
subject may complete with any words 
he chooses to make a sentence. The 
particular sentence stems vary widely 
and may be made up readily by an in- 
vestigator to study one or more vari- 
ables. 

The Rosenzweig Picture Frustration 
Test (Rosenzweig, 1949) is also in this 
completion form and presents a series 
of cartoons all dealing with situations 
of some potential frustrating circum- 
stance. In these cartoons the subject is 
to complete the cartoon by giving the 
response of the person portrayed as 
frustrated. There are 24 such scenes, 
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each dealing with some element of 
frustration. 

The Sargent (1955) Test of Insight 
into Human Motives is closely related 
to both of these instruments. This test, 
adopted for children by Engle (1958), 
consists of a series of armatures, or in- 
complete stories, and the subject is 
asked to tell what the central character 
does, and why, and how he feels. 

In the ordering or choice techniques 
the subject looks at a series of picture 
stimuli provided by the examiner and 
ranks or orders them according to some 
principle set by the examiner. The 
Szondi (Deri, 1949) is the best known 
of this type. It presents a series of 48 
photographs of individuals taken from 
eight psychiatric diagnostic categories. 
The subject is presented with these 
photographs in six sets of eight pic- 
tures. In each set he is to view all eight 
photographs and to select the two he 
likes the best and two he likes the least. 
He is then again asked to indicate his 
most liked and least liked from those 

reviously selected by him as liked or 
disliked. 

The Picture Arrangement Test of 
Tompkins and Miner (1957) provides 
both ordering and completion features, 
although the primary work to date has 
gone into the ordering elements. The 
test consists of 25 cards showing car- 
toonlike figures, each involved in three 
different but related activities. The sub- 
ject orders these three in the sequence 
in which he feels the activities took 
place. In addition, he is to give a sen- 
tence for each picture descriptive of 
what is going on in it. 

The expressive techniques differ 
from those in the previous categories 
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in several ways. In some of these tech- 
niques the subject provides data by the 
manipulation of various objects or 
equipment. Although this process fre- 
quently results in an end product (a 
painting, a drawing, a block construc- 
tion), which can be analyzed subse- 
quent to its creation by the subject, it 
is also characteristic of these methods 
that the actual process of manipulating 
the materials—whether or not they are 
recorded in some way for future ex- 
amination—is seen as relevant data. 
Thus with some forms of play tech- 
niques the subject may make several 
play constructions during an hout’s 
time—rearranging, breaking them up, 
making new ones, verbalizing about 
them. This process is seen as crucial 
data about various areas of the subject’s 
life, even though only a movie camera 
and a sound recorder would provide 
a full record for subsequent examina- 
tion, Similarly, the various forms of 
psychodrama and role playing involve 
a process of acting out, often with other 
actors and an audience, some particular 
role or plot selected by the examiner. 
The relevant data are again principally 
the process of acting out these roles, 
and no end product is normally avail- 
able. In the case of drawings or paint- 
ings, there is normally an end product 
that may be analyzed to provide various 
assessments of the personality. Even 
here, however, there is considerable in- 
formation to be derived from observing 
the process of creating this end prod- 
uct. 

In the expressive techniques possi- 
bilities also exist of some direct expres- 
sion of attitudes and viewpoints, an 
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expressive process that is often felt to 
be not merely diagnostic but in some 
way therapeutic. The use of play ma- 
terials in child therapy is well known. 
Here, as often in the psychodrama, the 
central interest has been in the thera 
peutic effects of the expression o! 
acting-out elements rather than in the 
diagnostic use of either that process 
or any possible end product. 

Various modifications of these three 
processes—drawing, playing, acting out 
—have been made with a view to maxi- 
mizing the diagnostic and personality- 
evaluation elements. In this chapter w 
concern ourselves principally with three 
later modifications in such forms of the 
drawing process as Machover’s (194% ) 
system of the analysis of drawings of 
the human figure and Buck’s (1948) 
form in which the person is to draw 
a house, a tree, and a person. Each of 
these provides the subject with stand- 
ard equipment and a system for the 
analysis of the end product. Napoli 
(1951) has made important contribu- 
tions to the earlier work of Ruth Shaw 
(1934) in developing drawing and, in 
particular, finger painting as a projec- 
tive technique. 

Lois B. Murphy (1956) has made a 
notable contribution to the research 
use of projective techniques in her de- 
velopment of various “projective situa- 
tions.” These share some of the char- 
acteristics of the playing and the role 
taking in, for example, the various 
Group Play Techniques. These are 
planned and semistructured experimen- 
tal situations in which certain attributes 
of persons are explored. They include 
the Potential Leadership Game, the 
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Imposed Leadership Game, and Family 
Play. It should be noted that each of 
these, as in role playing, has to some 


extent ən axis of social interaction. De- 
signed explore less interactive and 
more impulse-dominated elements are 
such methods as the Balloons Test, re- 


lated to the performance of aggression, 
the Cold Cream and Dough Situations, 
related to sensory and impulse con- 
cerns. The Blocking Techniques, used 
by Murphy and developed earlier by 


Lerner (sce Murphy, 1956) relate to 
frustration. Murphy also has developed 
a set of Miniature Life Toys which, 
similar to the World Test of Buhler 


and Kelly (1941) and Bolgar and 
Fischer (1947) are a standard set of 
toys used for both diagnostic and thera- 
peutic purposes. 

This system does not subdivide all 
projective techniques useful with chil- 
dren into completely independent cate- 
gories. It serves, however, to call at- 
tention to the range and variety of 
these methods and to suggest certain 
general principles of procedure in- 
volved in them. In the sections that fol- 
low certain broader previews related 
to the development, definition, and re- 
search use of these methods are dis- 
cussed, and each of the methods briefly 
mentioned here is dealt with in greater 
detail. 


Conceptual Background of 
Projective Techniques 
The projective techniques of per- 


sonality study have entered into psy- 
chological and social sciences research 
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and appear destined to remain there as 
integral parts of the body of techniques 
and concepts now characteristic of the 
field. Undoubtedly a wide variety of 
influences has been responsible for the 
rapid growth of the use of these tech- 
niques in the last twenty or more years. 
The greatest single influence perhaps 
has been the development and spread 
of psychoanalytic theory among per- 
sons concerned both with research and 
direct clinical work. Even though many 
persons dealing with research related 
to children have not adopted all aspects 
of psychoanalytic theory, the great 
vigor and apparent relevance of psy- 
choanalytic theses have influenced to 
some degree almost all elements of per- 
sonality theory. The projective tech- 
niques have seemed particularly relevant 
to an analytically oriented personality 
theory, in part because of their pur- 
ported relevance to the delineation of 
those elements of the person of which 
he was consciously unaware. The in- 
vestigator who was prepared to accept 
a logic of unconscious motivations 
found himself drawn to techniques that 
seem to have an affinity to just these 
motivations. The data of the projective 
instrument also bear considerable simi- 
larity to the dreams, associations, and 
clinical interviews within the frame- 
work of which psychoanalysis itself 
developed. 

A second determinant of the in- 
creased concern with projective tech- 
niques has been the emphasis upon the 
concepts of holism. The holistic em- 
phasis has been upon study of the full 
context of behavior and on viewing the 
actions of individuals in terms of their 
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essential unity and interrelatedness. The 
investigator who has accepted some ele- 
ments of this viewpoint, and who feels 
the need to relate each item of behavior 
to other items, finds it increasingly 
difficult to limit himself to the tradi- 
tional objective instruments for per- 
sonality measurement. The projective 
techniques provide the investigator with 
a rationale for implementing an inter- 
est in holistic theories. They also 
adapted themselves to the description 
and study of a wide variety of per- 
sonality characteristics and avoid the 
necessity of studying, in relative isola- 
tion, a small number of specific pre- 
determined dimensions. 

A general growth of interest in in- 
strumentation, stemming from the rise 
of logical positivism, with its attendant 
stress upon operationalism, has also en- 
couraged the development of tech- 
niques in all spheres, including the pro- 
jective. 

As researchers, as well as society at 
large, became aware of and receptive 
to the idea that human behavior is 
complexly determined, a wide range of 
new problem areas, or new approaches 
to old problems, came into being. 
Among these have been studies of the 
relation of the personality to the social 
environment, of personality commu- 
nalities in persons holding membership 
in similar social groups, in the process 
of socialization, in the relation of in- 
fant and child behavior to subsequent 
adult behavior, in the study of persons 
as consumers, and in the study of the 
personality correlates of social atti- 
tudes, perception, learning, occupa- 
tional choices. In each of these areas, 
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and in others, the projective methods 
have increasingly been used. 


The Nature of a Projective 
Instrument 


It has already been suggested that : 
projective instrument is one who 
users find it adaptable to a particu! 
theoretical viewpoint. This viewpoint 
is one which, on the theoretical side, 
contains substantial portions of psycho- 
dynamic concepts—even if not alway 
specifically psychoanalytic—and utiliz 
a holistic approach to the study of bs 
havior. On the methodological side, th 
projective techniques have appealed to 
persons who choose to implement thei! 
psychodynamic and holistic interests by 
attention to problems of instrumenta- 
tion and the operations of data collec- 
tion. We can define the projective in- 
struments, however, somewhat more 
specifically than by this reference to 
their conceptual context. The term 
projective technique first appeared in 
a publication of Ruth Horowitz and 
Lois B. Murphy (1938). Here they give 
credit for the origination of the term 
to Lawrence K. Frank, whose article 
on these methods appearing in 1939, 
gave great impetus and definition to 
the field. In this article, Frank described 
his view of these techniques: 


+ We may approach the personality 
and induce the individual to reveal his 
way of organizing experience by giving 
him a field (objects, materials, experi- 
ences) with relatively little structure and 
cultural patterning so that the personality 
can project upon that plastic field his way 
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of seeing life, his meanings, significances, 
patte and especially his feelings. Thus 
we elicit a projection of the individual 
personality’s private world because he has 
to organize the field, interpret the material 
and react affectively to it. More specifi- 
cally, a projective method for the study 
of personality involves the presentation of 
a stimulus-situation designed or chosen be- 
cause it will mean to the subject, not what 
the experimenter has arbitrarily decided 
it should mean (as in most psychological 
experiments using standardized stimuli in 
order to be “objective”), but rather he 
imposes upon it, his private, idiosyncratic 
meaning and organization. The subject 
then will respond to his meaning of the 
presented stimulus-situation by some form 
of action and feeling that is expressive of 
his personality (1939, pp- 402-403)- 

Subsequent to this initial definition, 
other writers have defined the pro- 
jective instruments—always with over- 
lapping emphasis but sometimes choos- 
ing to stress one point more than 
another or noting additional character- 
istics in no way contradictory to 
Frank’s statement. Following Gardner 
Lindzey’s excellent summary and analy- 
sis (Lindzey, 1960) of these various 
definitions, we may point to several 
more crucial characteristics of this class 
of instrument. These features may lie 
in the characteristics of responses elic- 
ited from the subject, in the nature of 
the material presented to him for re- 
sponse, or in the attitude taken by the 
experimenter in his subsequent treat- 
ment of those responses. 

With respect to the characteristics of 
response by the subject, the projective 
instruments are notable for several fea- 
tures. 

Sensitivity to unconscious or latent 
aspects of personality. This refers, of 
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course, to that realm of covert, latent, 
unconscious components of personality 
that Frank identifies as his “private 
world.” This would appear to be the 
central purpose of the projective in- 
struments, regardless of the particular 
material presented to the subject or 
the mode of overt response called for 
from him. 

Multiplicity of response permitted 
the subject. In contrast to instruments 
requiring specific and limited responses 
from the subject (the questionnaire 
“yes,” “no,” or “?” or a simple choice 
from a list of four or five possible re- 
sponses), the subject in the projective 
experiment is permitted and encouraged 
to respond with a wide range of ideas 
and/or actions. It is of further impor- 
tance that this wide variety of response 
also stems from the subject’s own im- 
agination and ideas, in terms appropri- 
ate to his own “private, idiosyncratic 
meaning and organization.” 

The lack of awareness of the subject 
of the purpose of the test. Although 
the subject may be quite aware of the 
very general interest of the investiga- 
tor—that his “personality is being 
studied,” “his imagination tested”—in 
none of the projective techniques as 
ordinarily used is he directly aware of 
the details of this goal. Specifically, he 
is not informed of any expectation of 
the investigator of any particular mode 
of response, and, most importantly, he 
is not aware of the variables subse- 
quently to be used in the analysis of 
his responses. This is in contrast to the 
majority of objective scales and ques- 
tionnaires in which the dimension at 
issue is either clearly stated or may be 
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easily guessed by the subject and in 
which, by the exercise of a little im- 
agination, the subject may direct his 
response to influence subsequent inter- 
pretation. 

The amount of response is substan- 
tial in quantity and complex in quality. 
In contrast to the objective techniques, 
the response of the subject is usually 
voluminous, rich, varied. 

The response is of a fantasy character 
and structured by no formal rules of 
“right or wrong” set by the examiner. 
Most projective techniques call for a 
response stemming entirely from the 
subject—without a “guessing” frame- 
work as to the “real” answer, presum- 
ably known by the examiner. The sub- 
ject is, of course, given some instruc- 
tion: “Tell a story,” “Make up an end- 
ing,” “Tell me what you see,” and so 
forth, but emphasis is laid on the re- 
sponse being “just whatever occurs” to 
the subject. 

With respect to the analysis of the 
responses, there are several elements of 
importance, 

Multiplicity of concepts of analysis. 
The very richness and elaborateness of 
responses to most projective techniques, 
and the fact that so few guidelines are 
provided to the subject in responding, 
tend to result in a kind of data to which 
a wide range of personality concepts 
may be applied. Thus the projective 
data may generally be analyzed in terms 
of many concepts applied to them sub- 
sequent to securing the response. A 
TAT story may be analyzed with re- 
spect to the degree of cognitive or- 
ganization and structure, with respect 
to the images of adult figures presented, 
or with respect to the degree of 
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achievement and ambition displayed. A 
single Rorschach response, containing, 
for example, both color and form de- 
terminants, may reflect upon the extro- 
vertive aspects of the person, upon the 
degree of his organizing qualities, as 
well as upon the degree of commonness 
or uniqueness of his thought processes. 
Holistic analysis. As suggested ear- 
lier, investigators preferring a more 
holistic view of man have been promi- 
nent among those utilizing the projec- 
tive instruments. The attraction for 
them of these techniques lies, of cou 
in the very fact that the completen 
of data and the multiplicity of concepts 
that can be applied to them permit and 
encourage the investigator to examine 
the complexities of the individual and 
to see the various interrelations of one 
element of the personality to others. 
With respect to the physical charac- 
ters of the test-stimulus presented to 
the subject, the projective techniques 
differ from many other techniques. 
The various forms of the projective 
instruments are portrayed in a subse- 
quent section. Here it is indicated 
merely that they range widely over the 
realm of “objects, materials, experi- 
ences” to which Frank referred. Among 
the objects and materials, mention may 
be made of such diverse stimuli as ink 
blots, jars of cold cream, incomplete 
sentences, miniature toys representative 
of people, places, and things, photo- 
graphs of persons from various noso- 
logical categories, cartoons represent- 
ing various frustrating circumstances, 
collections of paints, paper, and color- 
ing pencils, and so on. Among the “ex- 
periences” mentioned by Frank might 
be included such things as the instruc- 
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tion to act out certain roles, as in the 
psychodrama, and the presentation of 
two toy trains with the comment by 
the examiner that they are on the same 
track and cannot pass, “What will hap- 
pen?” Perhaps the balloon-situation de- 
scribed by Stone in Murphy (1956), in 
which it is possibly the license to 
“pop” them as much as the balloons 
themselves that constitutes the stimulus 
to projection, should also be in- 
cluded. 

Usually, the projective instrument is 
seen as one, as Frank comments, of 
“relatively little structure and cultural 
patterni This is indeed true of 
many of the projective stimuli and 
perhaps is truer of them generally than 
of the objective personality instru- 
ments. This characteristic of “ambi- 
guity” or “lack of structure,” however 
appropriate it may be when comparing 
the projective instruments with other 
personality methods, does not lie so 
much in the stimuli themselves as in the 
context and setting in which they are 
used. Many of the stimuli used in vari- 
ous forms of miniature toys are in 
themselves perfectly well structured 
and even culturally patterned—the toys 
are familiar enough to most children; 
dogs are recognizable as such. Even the 
houses are culturally patterned, at least 
for American children, in that they are 
clearly not Swiss chalets nor Navaho 
hogans. The automobiles are not nec- 
essarily recognizable as Fords, Chevro- 
lets, etc., but they are familiar and are 
again clearly not Simcas, Volvos, or 
Fiats. Although the Rorschach ink blots 
are not culturally patterned in the sense 
that they represent reality-objects in 
common use, they are perfectly identi- 
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Szondi pictures are certainly “strange” 
in that they appear “old-fashioned.” 
They are as stimuli hardly unstruc- 
tured and ambiguous—they are old 
photographs. 

It is true that some of the Rorschach 
cards, even though recognizable as ink 
blots, are more complex than others 
and harder to deal with and some of 
the Thematic Apperception Technique 
pictures are similarly stranger, more 
complicated, and harder to identify 
than others. 

It is when the subject attempts to 
implement the instructions given with 
the projective stimuli that the issue of 
ambiguity becomes most relevant. An 
ink blot is still an ink blot and remains 
perfectly unambiguous until the sub- 
ject accepts the condition, “Imagine 
what else it might be”; the boy with 
the violin in the TAT is perfectly 
straightforward and quite highly struc- 
tured until the subject concedes he will 
try to “tell a story” about it; even a pile 
of paints, paper, and crayons is unam- 
biguous until the subject applies his 
efforts to creating an imaginary situa- 
tion, as he does when he paints a pic- 
ture, tells a story, or “sees” a dancing 
bear. The ambiguity lies in the fact 
that there are no, or very few, guide- 
lines provided for him when he pro- 
ceeds to create this imaginary situation 
or product. Ambiguity may be only a 
relative term and not sufficiently de- 
scriptive of the projective stimuli per 
se, but it is a most appropriate charac- 
teristic when we consider that the re- 
sponse made stems from the total con- 
text of the technique itself and the task 
set for the subject. 
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Psychological Theory and 
Projective Techniques 


It has already been suggested that one 
of the notable characteristics of the pro- 
jective techniques is the wide range and 
variety of concepts to which they have 
relevance. In the next section, which 
deals with a number of specific tech- 
niques, attention is called to the con- 
cepts customarily used in connection 
with each instrument. As a general 
background, however, it may be useful 
to discuss briefly some ways in which 
psychological theory relates to the pro- 
jective techniques, 

The central question is, “In what 
ways do the projective techniques re- 
late to general psychological theory?” 
Here one is concerned with the ways 
in which general personality theories, 
or general theories of behavior, serve 
as explanatory systems for the behavior 
of the individual during the process of 
utilizing the stimuli of the technique 
and for the resultant responses. This 
concern reflects the fact that the pro- 
jective techniques and their interpre- 
tive systems appear to have been de- 
veloped with only minimal explicit re- 
lation to theories aimed at accounting 
for realms of behavior broader than, 
but presumably including, the projec- 
tive situation. This neglect stems not 
from the view that this area of theory 
is unimportant but rather from the fact 
that the efforts in this direction are few 
in number and only in their embryonic 
stage. 

It may be recalled that the projective 
techniques were developed within a 
clinic-oriented setting and by persons 
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influenced, if not entirely absorbed, by 
psychoanalytic theory. The Rorsch 
in particular, was presented initially by 
Herman Rorschach (1942), himself 
greatly interested in psychoanalytic 
theory. His monograph makes occa- 
sional reference to this body of theory 
His posthumous publication of a single 
case psychoanalyzed by Oberholzer 
(Rorschach and Oberholzer, 1942) 
deals more explicitly with the relation 
of this theory to the Rorschach. In 
more recent ‘years, Schafer has con 
cerned himself with an explicit conce 
for psychoanalytic theory in one book 
(1954a) and one article (1954b). Simi- 
larly Rapaport (1950, 1952) has dealt 
with this topic in several influential 
publications, The reader may also bc 
referred to Holt (1954) for a general 
discussion of the relation of personality 
theory to the Rorschach and to Klopfer 
(1954) for a similar consideration of its 
relation to Jungian psychoanalytic psy- 
chology. An excellent discussion and 
review of attempts to relate various 
psychological theories, including the 
psychoanalytic, to projective testing 
may also be found in Lindzey (1960). 

Lindzey’s summary statement pre- 
sents the present state of affairs in this 
area. 


In general, it appears that projective 
techniques and psychoanalytic theory pos- 
sess many congruent elements. A consid- 
erable number of psychoanalytic concepts 
display real utility in accounting for, or 
making rational, practices found among 
projective users. This, of course, is scarcely 
remarkable in view of the extent to which 
these practices have been established by 
individuals influenced by psychoanalytic 
theory. Less encouraging are the steps that 
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have been taken toward a careful and pre- 
cise specification of the detailed relation- 
ship berween psychoanalytic theory and 


projective techniques. Several interesting 
and encouraging attempts have been made 
to demonstrate the utility of some portion 
of psychoanalytic theory in the under- 
standing or interpretation of some seg- 
ment of projective techniques. However, 
it seems clear that a rational and carefully 
specified bridge from psychoanalytic 
theory co projective techniques remains 
a hope for the future—it is definitely not 


an accomplishment of the past (1960, p. 
22 

Although the psychoanalytic body 
of concepts would appear, as a single 
theoretical stance, to have the most 
relevance for the projective techniques 
at present, at least two other bodies of 
theory have interested various investi- 
gators. These are perception theory and 
stimulus-response theory. In the in- 
stance of perception theory, the con- 
necting links have been the obvious 
fact that the response to the projective 
instrument involves some elements of 
perception and that body of research 
relating perception to personality and 
motivational variables. The effort here 
has been similar to that in the psycho- 
analytic area—to explore the extent to 
which projective data can be explained 
in perception theory terms and, via that 
mechanism, be related to the broad 
body of general psychological theory 
in a more integral way. Abt (1950) has 
made an effort in this direction, includ- 
ing attention to holistic theories but 
relying heavily upon perception. Erik- 
sen (1951, 1954) and Eriksen and 
Lazarus (1952) have undertaken in- 
vestigations aimed at demonstrating the 
pertinence of perception theory to the 
projectives. Bruner (1948) sees in per- 
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ception theory a possible mechanism of 
overcoming the primarily empirically 
based development of the Rorschach. 
All of these authors point to the rele- 
vance of various aspects of perception 
theory to projective interpretation and 
to a possible pathway for demonstrat- 
ing the continuity of these instruments 
with general theory. 

Similar efforts have been made to 
integrate projective instruments with 
stimulus-response theory. Auld (1954) 
has, with the TAT, shown certain cor- 
respondence between some concepts of 
stimulus-response theory and projective 
data. Goss and Brownell (1957) have 
applied S-R concepts to elements of the 
projective stimuli and the interpretive 

rocess. Certain elements of the work 
of McClelland, Atkinson, Clark, and 
Lowell (1953) stem from S-R theory. 

In each of these efforts to relate the 
projective field to one or another gen- 
eral theory—psychoanalytic, percep- 
tion, stimulus-response—the major con- 
tribution might be said to be to the 
theory itself rather than to the field of 
projective testing. Individual experi- 
menters with the projective must still 
rely upon some other branch of theory 
in the direct task of classifying or 
interpreting responses. However, by 
showing that certain concepts of these 
theories have relevance for the projec- 
tive stimuli or data, they do throw new 
light upon these phenomena. In so do- 
ing, they extend our knowledge of the 
meaning of the stimulus situation and 
of the elements of response in relation 
to their interpretation in some broader 
terms. Some of these efforts have pro- 
vided direct guides to the interpreta- 
tion of projective response. More have 
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been limited to demonstrating that the 
projectives may be seen as a subclass 
of behavior explainable in part by the 
theory in question. Although all of 
these are admittedly tentative first ef- 
forts, they show promise and serve as 
important steps to the end of exploring 
the relation of projective test behavior 
to broader concepts and theories. 

It is also possible to relate certain 
specific scores or interpretive concepts 
used in projectives to personality con- 
cepts, if not to such broad realms of 
theory as suggested above. These 
studies of smaller issues have an imme- 
diacy and direct relevance to the use 
researchers may make of the instru- 
ments. The majority of these studies 
have had the Rorschach as their focus. 
This may stem in part from the Ror- 
schach’s longer history. In part it may 
stem from the fact that, more than 
most projectives, the Rorschach has 
been developed with a limited range 
of specific response scores, each of 
which, alone or in specified combina- 
tions, has a stated meaning in person- 
ality terms. It becomes therefore con- 
siderably more manageable to inquire 
into the correlates of these scoring cate- 
gories and to examine the meaning 
given to them by a priori statement in 
clinical judgment. 

Examples of these efforts are found 
predominantly in the examination of 
the Rorschach movement response. This 
has been a logical choice in that most 
experts in this field agree on the cen- 
trality of this category of response to 
the Rorschach interpretive system. The 
initial statement of Rorschach (1942) 
about the M response dealt with per- 


sonality and behavioral correlates, Vari- 
ous workers in the Rorschach have ex- 
tended or modified these notions, but 
few have attempted a theoretical ra- 
tionale for such relationships as empiri- 
cal clinical notations seem to indicate. 
Schachtel (1950) has paid considerable 
attention to this issue, in the M and 
other responses, seeing the M’s as rep- 
resenting a direct projection of kin 
thetic sensations that are intims 
connected with the deepest layers of 
personality. Piotrowski (1957) noted a 
greater direct relation between the M 
and certain overt tendencies. Thus he 
comments that the M response reflects 
“|... a steering mechanism which di- 
rects the individual to play certain 
definite roles in those interhuman rela- 
tionships that are vital to him,” The 
M responses for Piotrowski are “acces- 
sible to direct observation in the exter- 
nal motor behavior of the subject . . .” 
Both Schachtel and Piotrowski have a 
great deal more to say about such re- 
sponses, but for our purpose here they 
are notable in their effort to provide a 
theoretical rationale as well as empirical 
correlates. Rapaport (1950, 1952) and 
Werner (1945) have also made specific 
statements of the theoretical meaning 
of these responses, and Singer and his 
associates (Singer and Spohn, 1954; 
Singer, Meltzoff, and Goldman, 1952; 
Meltzoff, Singer, and Korchin, 1953) 
have reported experiments based upon 
various theoretical formulations. 

In the case of the TAT, studies in 
this area have been largely in the gen- 
eral relation of fantasy to overt expres- 
sion, in part a concern with issues of 
interpretation and in part an effort to 
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explain the relation of TAT responses 
to other personality concepts. Sanford, 
et al. (Sanford, Adkins, Miller, and 
Cobb, 1943), have reported low posi- 
tive correlation between aggression as 
seen in TAT responses and measures 
of overt aggression. Mussen and Naylor 
(1954) have also studied this relation- 
ship, adding a more specific attention 
to the social-class background of sub- 
jects and to the anticipation of punish- 
ment for aggressive acts to be expected 
in this social-class group. Kagan (1956) 
continued this trend of thought, intro- 
ducing the issue of anxiety and subdi- 
viding aggression into types. Other per- 
sons have approached this issue in dif- 
ferent ways, in each instance, however, 
attempting to introduce experimental 
variables related theoretically to the 
proposed explanation of various rela- 
tions between fantasy and overt expres- 
sion. 

In including these brief statements of 
ways in which theory and projective 
data have been related, the object has 
been only to call attention to the fact 
that the relation of the projective tech- 
niques to systematic behavior theory is 
indeed a tenuous one. This does not 
diminish the very substantial research 
on the projectives in which various em- 
pirical relations have been observed, 
but it does emphasize the continuing 
need to undertake investigations that 
stem from a theoretical position. Such 
studies will serve the objective of 
strengthening the knowledge of the 
meaning of projective data as well as 
the goal of lending a more substantial 
rationale to observed empirical rela- 
tions. 
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ASSOCIATIVE TECHNIQUES 


The associative techniques present a 
stimulus to the subject and require him 
to respond with the first thought that 
comes to mind, The principal technique 
here is the Rorschach, in which plates 
of inkblots are presented and the sub- 
ject is asked to report what he sees in 
them. Other forms present words to 
which associations are requested. 

The Rorschach Test consists of ten 
plates or cards upon which are repro- 
duced symmetrical ink blots. These are 
such that might be produced by fold- 
ing a piece of paper over a drop of ink. 
Five are gray and black on a white 
background, two others have some 
colored portions, and the remaining 
three are almost entirely colored. They 
are presented to all subjects in the same 
order, with instructions to report what 
they see in the blots. Emphasis is usu- 
ally laid upon the suggestion that there 
are no expected right answers, but that 
whatever the subject sees is what is 
wanted. The examiner intervenes in 
this process to a minimal degree, merely 
recording the responses verbatim and 
passing the subject each succeeding 
card as he appears to be finished with 
the preceding. The elapsed time be- 
tween the initial presentation of each 
card and the subject’s first response is 
also recorded. Once the subject has 
given as many responses to each of the 
ten cards, which may range from about 
15 responses for the total set of cards 
in younger children (and subjects in 
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the 70’s) to 25 to 40 for adolescents and 
adults, the subject is to re-examine each 
of his responses and to tell the examiner 
more about the factors contributing to 
his response, In this phase, usually 
called the inquiry, the examiner’s role 
becomes somewhat more elaborate. 
Here he must secure from the subject 
information permitting a scoring on 
location, determinant, and content. Lo- 
cation is scored in terms of the area of 
the blot used by the subject when he 
identifies what he sees. Commonly, a 
response is scored for location in terms 
of its reference to the whole blot (W) 
or a detail of it. The detail responses 
may be usual details (D), small details 
(Dd), or rare or unusual details (Dr). 
Use of the white spaces is scored sepa- 
rately (S). The location scoring is seen 
as answering the question, What part of 
the total stimulus did the subject use 
in his response? The determinant scor- 
ing is designed to answer the question, 
What element in the stimulus prompted 
the response? Here a common paein 
is to score F if the subject indicates it 
was primarily the form or shape of the 
particular blot area; M for movement 
if seen in human figures, FM for move- 
ment in animal figures, and 77 for in- 
animate figures. Responses may also be 
determined by shading or texture in the 
grayish blot areas or by the use of 
color. Certain combinations of deter- 
minants are also common. Thus a re- 
sponse that appears to be based both 
on form and color may be scored FC 
if the form appears to come first or CF 
if the color predominates. Content is 
normally scored for its originality (O) 
or popularity (P) among responses and 
for some further breakdown of content 
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such as the categories human (H), hu- 
man detail (Hd), animal (4), animal 
detail (Ad), nature (N), and so on. 
The total responses (R) and the reac- 
tion times to cards also figure in the 
scoring. 

This statement of scores is only a 
general one indicative of many of the 
basic issues thought relevant to the 
categorization of responses. Different 
specialists elaborate this system in vari- 
ous ways, which they consider to re- 
flect most appropriately the psycho- 
logical processes influential in produc- 
ing the responses in question. We may 
add to this scoring statement various 
ratios of responses—the proportion of 
M to C, the percentage of F, the ratio 
of human to animal response, ete. These 
scores and ratios are then summated 
into a psychogram that serves as a de- 
vice for bringing the total pattern of 
scores together for review in graphic 
presentation. 

Once the scores are summated in this 
fashion, the interpretive task begins. 
This is based upon certain assumptions 
and empirical findings regarding the re- 
lation of the scoring patterns to per- 
sonality variables. The publications of 
specialists, which are suggested below, 
must be studied for a full picture of 
the quite complex process of interpre- 
tation. A very rough review may be 
given here, however, of the general 
order of concepts used in this process. 
As summarized by Beck (1951), the 
three central areas of personality dealt 
with by the Rorschach are intellectual 
activity, the externalized emotions, and 
the inner emotional life. In the area of 
the intellective activity the perception 
of form (F) is seen as a crucial indica- 
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tion. The perception of certain amounts 
of good form is seen as a measure of 
clearness of perception. At the same 
time, extreme amounts of form re- 
sponses in, for example, depressive 
states are representative of rigid, obses- 
sive caution, whereas very low form 
suggests unstable emotional conditions 
and clouded perception. In the area of 
the location scores (W, D, Dd, etc.) 
also lie indicators of the intellectual life, 
particularly in the perception of W 
responses, in the order and sequence of 
W and detail responses, and in the de- 
grees of organization, which Beck calls 
Z. The originality versus the com- 
moness of response is also relevant, as 
estimates of the originality of thought 
processes. Animal responses, very easily 
seen in the cards, suggest a lack of 
richness of mental process. 

In the area of the externalized emo- 
tional life, the use of color and of shad- 
ing in response is seen as crucial. Color 
represents a mechanism for projecting 
the exciting, usually positive, external- 
ized feeling. Within the varieties of 
color responses, the purer responses— 
those involving minimal form—suggest 
the more egocentric and self-gratifying 
orientations, Color well mixed with 
form—the FC—suggests the socially 
tempered person able to interpret his 
own desires with those of other per- 
sons. The shading responses, in which 
the light and darkness elements of the 
blot are used, suggest more painful 
emotion, anxiety, depressed feelings, 
the sense of inadequacy. Use of the 
white space appears to relate to stub- 
bornness, hostility. 

The inner life itself appears most 
centrally reflected in the movement re- 
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sponses seen as portrayals of strongly 
felt wish references. The M is thus the 
internalized element of emotion as the 
C is the externalized. 

Studies of the stability and validity 
of the Rorschach have resulted in en- 
couraging if not definitive findings. 
The approaches to these problems have 
been varied. They may be divided, fol- 
lowing Rabin (1951a), into molar, mo- 
lecular, and experimental studies. The 
molar approach is essentially the one in 
which the complete interpretation of 
the individual record is compared with 
some comparable independent data, the 
psychiatrist's report, therapy records, 
and life history. These studies show 
positive correlation, suggesting that, in 
the hands of skilled interpreters, diag- 
nostic validity can indeed be demon- 
strated. They normally utilize forms of 
comparable global criteria, such as psy- 
chiatric diagnosis, which are themselves 
open to question, It seems unquestion- 
able that the Rorschach is a useful tool 
in clinical diagnosis, even though its 
validity regarding the prediction of 
personality structuring and dynamics is 
less clearly demonstrated. 

Molecular studies attempt to investi- 
gate the validity of specific Rorschach 
variables, or combinations of them, nor- 
mally against some hypothetically re- 
lated behavior in another domain. Thus 
color response has been shown to cor- 
relate with the use of color in drawings. 

Experimental studies attempt to cre- 
ate a situation of some specified effect 
upon the subject in terms of which 
particular Rorschach responses would 
be expected to change. Thus hypnot- 
ically induced mood changes produce 
certain patterns of scores hypothesized 
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in relation to those moods. Sodium 
amytal, normally producing coopera- 
tiveness and friendliness, does seem to 
reduce the amount of card rejections. 
Other forms of experimental studies 
alter the stimulus itself—achromatic 
cards—or change the method of admin- 
istration—reversed order of presenta- 
tion. Correlates of Rorschach patterns 
in somatic and psychological spheres 
have also been undertaken. All of these 
experimental studies add considerably 
to knowledge of the validity of re- 
sponses and interpretive possibilities. In 
total, however, they are by no means 
definitive. They suggest that much 
work is yet to be done to establish the 
general usefulness of the instrument 
and, more importantly, perhaps, to de- 
termine the extent to which specific 
scores and indices are indeed reflective 
of the psychological processes proposed 
by various investigators. 

The Rorschach has been used princi- 
pally with disturbed adults, and it is to 
this setting that the majority of the 
comments above apply. The use with 
children has been less extensive but still 
follows the same basic principles. The 
age of 3 seems frequently referred to 
as the lower age limit, though use has 
been made profitably with younger 
children of advanced intellectual de- 
velopment. The central issue regarding 
age appears to be the question of the 
stage of intellectual development or, 
more centrally, the development of 
perception. The use of the instrument 
with adolescents and adults assumes 
fully developed intellectual and per- 
ceptual equipment. It is upon this pre- 
mise that the meaning of scores is based. 
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If, however, children have less well- 
developed perceptual processes, as seems 
clear, then to what extent may we 
sume the same interpretive principles 
This dilemma is accentuated by the 
fact that it is often difficult to secure 
from children full inquiry information 
vital to the assigning of scores in the 
usual fashion. It is for these two rea- 
sons—limited inquiry and incomplete 
development—that Rorschachs of chil 
dren below school age should be inter- 
preted with caution and their adminis- 
tration undertaken with special atten- 
tion to the communicativeness of the 
instructions. A central problem of 
scoring, and subsequent interpretation, 
stems from the inadequacy of inquiry 
information, particularly in children 
under 4. The issue is, should one score 
exclusively on the basis of information 
received or should one guess? The 
“guess” is, of course, based on the usual 
determinants of comparable responses 
when given by older subjects. Beyond 
a general adaptation of the language 
of the examiner to the age of the child, 
examiners differ somewhat as to how 
much variation there should be in the 
method of administration used with 
older subjects. A common method 
seems to be to modify the procedure 
only with an eye to the establishment 
of adequate rapport—appropriate, of 
course, for all subjects—and to assume 
that reduced inquiry specificity repre- 
sents not a failure of administration but 
the true state of the mental processes 
being studied. 

These issues are excellently reviewed 
by Klopfer (1956) in the second vol- 
ume of Developments in the Rorschach 
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Technique. In particular, the reader 
should examine Chapter 1 on problems 
of administration, scoring, and method 
by Klopfer, Fox, and Troup; Chapter 
2 on interpretation by Klopfer, Spiegel- 
man, and Fox; Chapter 4 on the sig- 
nificance of age patterns by Fox; and 
Chapter 5 on the study, with Rorschach 
and other methods, of the development 
of perception in children by Meili- 
Dworetzki. Additional references on 
these topics will be found following 
each of these chapters. A new report 
of norms for children 6 through 11 will 
be found in Ledwith (1959). 

The child’s reaction to the Rorschach 
situation is discussed and illustrated by 
Anna Hartoch (1956). In her discussion 
particular stress is laid upon the inter- 
action of tester and child during the 
process of securing the record and upon 
the utilization of this full range of data 
in the interpretation of the resulting 
record, 

Since the use of the Rorschach with 
children is a special case of the use of 
this test, the reader should also examine 
the more general Rorschach literature. 
The original report on this instrument 
is that of Herman Rorschach’s Psycho- 
diagnostics: A Diagnostic Text Based 
on Perception (1942). Beck’s three vol- 
umes (1944, 1945, 1952); Klopfer and 
Kelley (1942), and Klopfer, Ainsworth, 
Klopfer, and Holt (1954) describe the 
central principles and practices. Rabin’s 
chapter (1951a) in Anderson and An- 
derson’s Introduction to Projective 
Techniques provides a statement and 
additional references on the issue of 
validity. A review of other uses of the 
Rorschach less related to children— 
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alternative forms used to study specific 
variables and special indices for limited 
purposes—may be found in Gardner 
Lindzey (1960). 


OTHER ASSOCIATIVE 
TECHNIQUES 


The most useful of the associative 
techniques, and the one in this group 
about which the greatest information 
can be found in research and accumu- 
lated experience, is indeed the Ror- 
schach. Nonetheless, the basic model 
for the associative methods is perhaps 
the Word Association Test. In this test 
a list of stimulus words is presented, 
usually verbally and one at a time, to 
which the subject is asked to respond 
with the first word that occurs to him. 
The time elapsing between hearing the 
stimulus word and the response is nor- 
mally recorded, In some uses the sub- 
ject may also be asked to give other 
words that occur to him, a chain of 
associations initiated by the original 
stimulus word. Various lists of stand- 
ard words exist, particularly the early 
list of 100 words of Kent and Rosanoff 
(1910) and the more recent list of 60 
by Rapaport, Gill, and Schafer (1946). 
In Rapaport et al. a statement of the 
principles of analysis and interpretation 
of these responses and an important 
new statement of a theoretical rationale 
for the instrument will also be found. 

In general, the assumption of analysis 
is based upon the logic that particular 
patterns of response-words and either 
very long or slow reaction times are 
indicative of psychological disturbance. 
The variable of neurotic disturbance is 
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the principle one investigated, although 
the choice of a different set of words 
would permit study of other, and less 
general, concepts. The development of 
a special set of words—appropriate to 
the vocabulary and life circumstance of 
children—would probably be necessary 
for its use with other than adult groups. 
It would be necessary, even with such 
a new list, to administer this list to 
substantial groups of children of vari- 
ous characteristics to gain a background 
of expected responses and to experi- 
ment with administration appropriate 
to the verbal skills of children. Such a 
list would have to be composed to in- 
vestigate some specific predetermined 
variables for maximal usefulness. In 
nearly all forms, however, the instru- 
ment would provide only the judgment 
of “disturbed or not-disturbed,” or per- 
haps “usual or not-usual,” with respect 
to the variables reflected in the stimulus 
words. Such a limitation is a very con- 
siderable one and places this technique 
very low on the list of projective in- 
struments useful with children. 
Woodrow and Lowell (1916) did 
combine a set of norms based upon 
1000 children, 9 to 12 years of age, 
using the Kent-Rosanoff list, less ten 
words. Other studies with children in- 
clude one by McGehee (1944) on age 
norms, one by McDowell (1928) on 
stutterers, and one by Mitchell, Rosan- 
off, and Rosanoff (1919). The study 
by Sanford (1936) on school-age chil- 
dren illustrates the use of this technique 
not for general clinical analysis but for 
the investigation of a specific variable. 
Anthony (1943) has used the instru- 
ment on r1- and 12-year-old children 
with a modified method of administra- 
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tion. Useful treatment of the method 
will also be found in Bell (1948) and 
Rotter (1951). 

Another associative technique of 
some interest but of very limited use 
has been the Cloud Pictures of Struve 
(1932) and Stern (1938). This instru- 
ment consists of a set of three pictures 
of clouds, suggested by the game of 
seeing things in passing clouds. The 
objective was to design a set of in- 
distinct, visual stimuli appropriate for 
the study of normal children. Follow- 
ing some elements of the Rorschach 
procedures, the subject is to outline 
each object he sees and to name it. 
Stern also proposed a 
phase—roughly analogous to the “test- 
ing the limits” of the Rorschach. In 
spite of a certain superficial appropri- 
ateness to ordinary childhood activities, 
this method has appealed very little to 
psychologists and, in the presence of 
techniques of much greater scope and 
power, seems unlikely to do so in the 
future. 

The same general conclusion might 
apply to associative techniques based 
upon auditory stimuli, The Verbal 
Summator of Skinner (1936) and a 
modification by Shakow and Rosen- 
zweig (1940) present various indistinct 
sounds to which subjects are asked to 
give meaning. Although certain posi- 
tive results are found when compar- 
ing groups, there is always a very 
limited value in it for studies with chil- 
dren or for studies of individual dy- 
namics. More material on this approach 
may be found in Bell (1948) and 
Lindzey (1960). One modification, by 
Davids and Murray (1955), shows 
somewhat greater though 


suggest Ive 


promise, 
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work on it has been limited to the ini- 
tial report. Its advantage appears to lie 
in the fact that the sounds used are 
organized according to eight different 
dispositions: pessimism, distrust, anx- 
iety, resentment, and so on. The sub- 
ject is asked to identify the ideas he 
heard and to say which were major and 
which minor ideas. Although no work 
with children is reported, independent 
clinical judgments and other data on 
adults appear to relate meaningfully to 
ratings derived from the test. 
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The construction techniques involve 
the subject in the development of some 
form of an organized plot, told in re- 
sponse to pictorial stimuli of varying 
degrees of structured content. These 
methods require a more elaborate and 
integrated verbal response than the as- 
sociations required in the associative 
techniques. 

Thematic Apperception Techniques. 
The TAT is the principal and most 
widely used construction technique. It 
is the model upon which other related 
techniques are based. The instrument 
was devised by Morgan and Murray 
(1935). As presented in its first 
published form by Murray (1943), the 
test consists of 30 pictures of varying 
content and degrees of ambiguity. One 
of these is a blank card. These 30 cards 
consist of ten so-called neutral cards, 
which are administered to males and 
females, ten for females alone, and ten 
for males alone. Of the ten cards for 
male subjects, certain are intended for 
both adults and children, and some are 
for only one or the other. The same 
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applies to the female cards. Thus a given 
subject will be given the ten neutral 
cards plus the ten cards appropriate to 
his age and sex. 

There are various sets of specific in- 
structions for the administration of this 
test. They will be found in the several 
general manuals indicated below. All 
are modifications of the basic and still 
commonly used instructions of Murray. 
In essence, the subject is presented each 
card individually and asked to tell a 
story about it. He is to tell what is 
going on in the scene, what the people 
are thinking and feeling, what led up 
to the scene, and how it will come out. 
Murray (1943) prefers that the set of 
20 be administered in two separate 
sessions, followed by an inquiry into 
the origin of the stories and some as- 
sociation to certain elements. Rapaport, 
Gill, and Schafer (1946) suggest an in- 
quiry, either after each card or the 
entire series, into any elements of un- 
clarity—of a perceptual, verbal, or 
meaning sort. Rotter (1946) and Henry 
(1956) prefer delaying inquiries until 
all stories are completed. Regardless of 
the particular location of the inquiry, 
most examiners use only a few general 
questions of no set character, aiming 
principally at clearing up points that 
appear ambiguous in the story as told 
or about points that appear to be of 
unusual interest. 

The analysis of the resulting record 
is by no means standardized at present. 
Murray’s initial proposal first identified 
the hero, the central figure of each 
story, and then described the motives 
of the other characters and the pressures 
upon the hero as they were portrayed 
in the story. These were described in 
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terms of the set of needs and pressures 
earlier presented by Murray and others 
in Explorations in Personality (1938). 
Although this approach is frequently 
used by many investigators, various 
other methods have been suggested 
[Aron (1949), Tomkins (1947), Bellak 
(1947), Fry (1953), Fine (1955), Stein 
(1955), Henry (1956)]. A number of 
these systems may be examined as they 
are compared in application to a single 
case in Shneidman (1951). Many of the 
basic principles of interpretation have 
been summarized by Lindzey (1952). 

The central assumption of the TAT 
method, as well as most of its modifica- 
tions, is that the subject reveals basic 
elements of his personality in construct- 
ing his stories, These elements may be 
seen in the particular content of the 
stories he tells as well as in the form 
in which he tells them. Most of the 
comprehensive systems of interpreta- 
tion are based upon these two elements 
of stories. The form elements of stories, 
partially independent of the content, 
include such variables as length of 
stories, degree of organization, re- 
sponses to details of cards, inclusive 
whole responses, preponderance of at- 
tention to past, present, or future ref- 
erences, and variability in such factors 
as these from one story to another. 
Content variables include such molar 
units as the thema or central elements 
of plot in each story or somewhat less 
comprehensive elements as the kinds of 
action attributed to various figures— 
male, female, old, young—the outcomes 
of stories—happy, sad, hostile, affiliative 
—in relation to particular people in 
stories or to kinds of thema, the se- 
quences of action, and the kinds of 
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people or issues introduced for which 
there are no direct representatives in 
the picture stimulus. In addition, most 
interpreters utilize the concept of sym- 
bolic content, in reference to some form 
elements of cards seen as having special 
symbolic implications, or total story 
situations reflective of various crucial 
life crises—oedipal drama, authority re- 
sentment, sexual anxiety, and so forth. 

Plans of analysis also normally in- 
volve interpretive inference as a basic 
element. Such schemes differ in the 
stress upon the amount of inference re- 
quired and in the attention that tlicy 
call to describable elements of the 
responses themselves used as doc 
mentation of their inferences. In com- 
parison with the Rorschach, it is often 
said that the TAT provides data 
relevant to the content of the per- 
sonality rather than its structure. T 
is a certain justification for this notion 
in the fact that the TAT stories provide 
an obvious richness of content related 
to specific attitudes, viewpoints, role 
relationships, conflicts, and personal 
dispositions. However, both instru- 
ments have been used to develop in- 
ferences in both structural and content 
areas. The distinction stems more from 
the fact that the customary focus of 
analysis of the Rorschach has been upon 
form variables, whereas that of the 
TAT has been upon content variables. 

The elements of personality with 
which the TAT deals have been ex- 
cellently stated by Lindzey: 
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The richness of the response data elic- 
ited provides a relatively satisfactory basis 
for a wide variety of different kinds of 
analysis. The thematic material contained 
in the stories permits the interpreter to 
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make inferences concerning many differ- 
ent dispositions and conflicts and, in addi- 
at deal can be learned concern- 


tion, a ¢ 
ing the subject’s psychological world and 
his relation with others, as well as with 


his physical world. Furthermore, the wide- 
spr use of this technique, in addition 
to the special forms that have been de- 
veloped. make clear that a good deal is 
known about its operation in a number of 
special settings. The test appears to be 
sensitive to depth or unconscious factors 
but at the same time clearly reflects con- 
scious factors and situational determinants 
so that under appropriate circumstances it 
can be used to assess any of these kinds of 
variables. The test activity is a relatively 
natural one for almost all subjects and thus 
it is usually easy to explain to subjects and 
to elicit cooperation. The absence of a 
single widely accepted objective scoring 
system has led to a healthy approach to 
the instrument in which the method of 
analysis employed is not accepted as God- 
given and beyond criticism and further- 
more is usually constructed in such a 
manner as to maximize the interests of the 
particular investigator. 

On the other hand the data collected by 
the instrument are ponderous and difficult 
to subject to analysis. Furthermore, the 
absence of a single objective scoring sys- 
tem means that the individual employing 
the instrument often spends dozens of 
hours in analysis for every hour spent in 
the collection of data. The fact that the 
instrument is responsive to dispositional, 
situational and fleeting personal determi- 
nants means that it is difficult for the in- 
terpreter to be certain whether a given 
test characteristic reflects an enduring per- 
sonal characteristic or a transient state. 
The entire test procedure rests upon Com- 
plex linguistic skills so that for subjects 
who because of education or intelligence 
find it difficult to manipulate verbal sym- 
bols the test is relatively inappropriate. 
Furthermore, when individuals from a 
different culture are studied it is difficult 
to interpret their stories with confidence 
because of the differences in the language 
which they employ (1960). 
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The TAT has been shown to be use- 
ful with nursery-school children by 
Schafer and Leitch (1948), for some- 
what older children by Sanford (1943), 
and for psychotic children by Leitch 
and Schafer (1947). Rautman and 
Brower (1945) have shown that stories 
written by third-to-sixth-grade children 
provide useful data, There is a variety 
of modifications of the principle of 
the TAT with specific reference to 
children, Horowitz and Murphy (1938) 
describe several of these. These mod- 
ifications generally involve the selection 
of a new set of pictures and aim either 
at the general description of personality 
or the investigation of some specific 
facet of it. Balken and Vander Veer 
(1942, 1944) utilized a special set in 
the study of neurotic children. Their 
set consisted of 12 pictures chosen from 
a number of sources, including maga- 
zine illustrations, children’s scrapbooks, 
and the TAT. Henry (1947) has re- 
ported a study of Hopi and Navaho 
children based on a special set of line 
drawings intended to reflect certain 
issues contained in the TAT. 

The Children’s Apperception Test 
(Bellak, 1954) consists of ten pictures, 
each consisting of animal figures only. 
The authors propose that children can 
identify themselves more easily with 
animals than persons. The scenes reflect 
common problems of childhood—feed- 
ing conflicts, sibling rivalry, toilet train- 
ing, Oedipal problems. Published ex- 
perience with this instrument has not 
been extensive to date. Profitable use 
has been made of the instrument in 
clinical settings, though at least two 
researches (Biersdorf and Marcuse, 


1953; Light, 1953) suggest that the 
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productivity is not greater in compari- 
son with the TAT. An unpublished 
research of Budoff (1955) developed 
a set of pictures of humans paralleling 
each of the CAT pictures. In a study 
with normal children the human set 
produced more and more psycho- 
logically relevant material. Those chil- 
dren who were by other means judged 
less well adjusted appeared to be among 
those with whom the animal pictures of 
the CAT were more productive. 

The Michigan Picture Test was de- 
veloped by Hartwell, Hutt, Andrew 
and Walton (1951) and consists of 20 
pictures, only one set of 12 having been 
published to date. They are aimed at 
grade-school children and are scenes, 
largely school-dominated, of events 
common to children of these ages. 

The instrument is aimed principally 
at distinguishing well-adjusted from 
poorly adjusted children in school situa- 
tions. Preliminary well-documented 
work by the authors suggests it is suc- 
cessful at that task. Subsequent work 
in process may demonstrate its useful- 
ness beyond this area. 

Other techniques closely related to 
the TAT are now available, although 
the work on them has been too limited 
to evaluate their usefulness with chil- 
dren. 

The Make-A-Picture-Story Test of 
Shneidman (1948, 1949, 1952) has cer- 
tain properties that might make adapta- 
tion of it very useful with children. 
The test consists of a number of back- 
ground scenes presented individually 
to the subject along with a large num- 
ber of movable human and animal 
figures from which the subject selects 
to put on the picture and about which 
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he then tells a story. The opportunity 
to select, reject, and manipulate the 
figures, and the attendant opportunity 
for process observation and intervie 
suggests the potential appropriate 
to children. 

van Lennep’s Four Picture Test 
(1948, 1951) contains four-colored pic- 
tures representing two persons interact- 
ing, a group setting, and two pictures of 
a single person. The subject is to put 
these four in any order he chooses and 
tell a story incorporating them. Only 
one story is available, although the 
order of choice of pictures is an added 
variable. 

The Object Relations Test of Phillip- 
son (1955) consists of three series of 
four pictures and a blank card. The 
pictures represent a one-person, two- 
person, three-person, and a group 
situation, The series differ in respect 
to the reality content and context, each, 
however, being vague and ambiguous 
as to age and sex of persons. Details of 
the figures, in particular the faces, are 
omitted or are very indefinite. The set 
is administered in a fixed order, the 
blank card being given last. Phillipson 
stresses the development of a positive 
subject-examiner relation prior to ad- 
ministering—somewhat more than 
many examiners do, Beyond this pre- 
caution, the instructions of administra- 
tion are highly similar to those proposed 
for the TAT. Phillipson’s cards present 
an impression rather of the order of the 
shading portions of the Rorschach. 
Although they contain representations 
of human figures, they are, nonethe- 
less, vague and abstract. Series A and B 
are in light and darker charcoal shading, 
respectively, whereas Series C is some- 
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what more lifelike, though still am- 
biguous. In C some color is introduced. 

The logic of this set is bound to the 
psychology of object relations devel- 
oped by various British psychoanalysts, 
notably Melanie Klein (1948) and 
W. R. D. Fairbairn (1952). The inten- 
tion in the three series is to parallel an 
hypothesized development of object re- 
lations. Series A is designed “to stim- 
ulate primitive dependent needs and 
the consequent anxieties while the in- 
definiteness of the situation and the 
absence of other reality content will 
help to emphasize the subject’s way of 
dealing with these early tension sys- 
tems.” Series B, somewhat more definite, 
in darker shading, is intended to stress 
fantasy about threatening and uncom- 
promising objects. The darkness is seen 
as evoking expression of anxieties about 
the control of forces inside the self 
and in the outer world. Series C, still 
more lifelike, is seen as presenting a 
rich and varied environment. The 
lighter level of reality representation 
and the color is to intensify the threats 
and supports of the stimulus in terms 
of emotional involvement. The blank 
card is seen as a point of “maximal ten- 
sion” and as an opportunity for the 
“dominant unconscious fantasy” to be 
expressed and the transference relation 
to the tester to be seen. 

Although it remains to be seen how 
successful Phillipson has been in his 
intentions, it is apparent that the design 
of the cards parallels the British object- 
relations theory and that a basic defini- 
tion of the latent implications of the 
stimuli are in these terms. The result- 
ing stories, being plots as in most TAT 
stories, lend themselves to interpreta- 
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tion in most any system, however. Phil- 
lipson’s interpretation is, of course, 
along the lines of examining the variety 
and character of the object relations. 

The test has been used fairly widely 
by Phillipson and his associates in 
clinical and other settings, but sufficient 
material has not yet been published to 
evaluate its use, especially with pre- 
adolescent subjects. However, the test 
is important, if only because of its 
systematic relationship to a specific 
theory of some elements of personality. 
The Blacky Pictures, it will be seen 
below, have been developed in the same 
spirit; they hold a similar relationship 
to classical psychosexual stage develop- 
ment as Phillipson’s instrument to ob- 
ject relations theory. 

Sets of pictures have also been suc- 
cessfully used, not as an alternative 
TAT, but with respect to some special 
interest. Horowitz (1939) used a set 
of children of different races to in- 
vestigate racial attitudes. Amen (1941) 
used a special set of 15 pictures, with 
certain movable cut-out figures, in an 
investigation of the development of 
perceptual responses. Vernon (1940a,b) 
and Grotjahn (1941) have also studied 
perceptual processes in the reaction of 
children to pictures and the relation of 
these elements to personality. 

A single card representing a group of 
four persons, a younger couple and 
an older couple, has been used by 
Neugarten and Gutmann (1958) to 
study age and sex grading in adults. 
This approach, bearing similarity to 
Horowitz’s earlier study of social atti- 
tudes, has considerable promise for 
work with younger subjects. Hess 
(1959), using a modified TAT set and 
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an administration procedure proposed 
by Henry and Guetzkow (1951) for 
studying interacting groups, has in- 
vestigated the emotional climate and 
organization of family groups with 
adolescent children. 

A somewhat more specific use of new 
pictures or a modified use of the TAT 
set has been used to investigate specific 
variables. These include study of 
achievement (McClelland, Atkinson, 
Clark, and Lowell, 1953), affiliation 
(Atkinson, Heyns, and Veroff, 1954), 
dominance (Veroff, 1957), group co- 
hesiveness (Libo, 1953). Although used 
more with adults than children, this 
approach holds promise for the study of 
children. 

The Blacky Pictures (Blum, 1949, 
1950; Blum and Hunt, 1952) combine 
several of the principles of modification 
of the TAT principles suggested earlier. 
It is a set of 12 pictures. The pictures 
are of animals in cartoon style and 
purport to be various scenes in the life 
of a dog called Blacky. These scenes 
also deal with Mama, Papa, and Tippy, 
a sibling. The sex of both Blacky and 
Tippy is unspecified, permitting sub- 
jects to attribute either maleness or 
femaleness to either or both dogs. This 
cast of characters recurs throughout but 
in differing circumstances. Although 
the scenes are designed in part to re- 
flect various childhood problems, more 
importance is to be placed in their 
design around certain psychological 
variables stemming from psychoanalytic 
theory of sexual development. This 
instrument thus combines the advan- 
tages of being designed to investigate 
certain specific variables while also 
providing data appropriate to broader, 
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more holistic interpretation. Moreover, 
its childlike facade appears not to limit 
its usefulness to that age group. 

Each card is presented with a brief 
identification, e.g, “Here is Blacky 
with Mama,” and the subject is asked to 
tell a story in the regular TAT style. 
A planned and often multiple-choice 
inquiry follows. Then choices of best- 
and least-liked cards are elicited. Analy- 
sis is in terms of the 11 psychoanalytic 
variables around which the test wa 
designed. These include such vari 
as oral eroticism, Oedipal intensity, 
masturbation guilt, castration anxiety 
(penis envy), and positive identification. 
An additional modification (Blur 
1956) secures information on the tend 
ency of subjects to employ various of 
the analytically defined mechanisms of 
defense. 

Dreams are not generally considered 
a projective technique, presumably be- 
cause they are entirely spontaneous 
products of a subject, are seldom under 
any control by an experimenter, and in- 
volve no specific stimulus materials. 
However, the end product, the dream, 
partakes of the most crucial of the 
qualities of a projective response, the 
reflection in an outward form of the 
inner preoccupations of the subject. 
The analysis of this end product has 
seldom been undertaken in a systematic 
way, though the problem presented 
differs little from that of analyzing a 
TAT or other forms of verbal produc- 
tions, including thematic elements. 

The biggest drawback to any planned 
use of dreams as a projective instrument 
is, of course, that the experimenter must 
wait for the subject to report his dreams 
rather than stimulate that production 
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umstances convenient to the 
exanl . The subject can also readily 
alter or censor his dream in the re- 
porting of it, though a comparable de- 


under 


gree of censoring probably applies to 
most other projective performances. 
Dreams have been produced experi- 
mentally under hypnosis by several in- 


vestigators (Fahrer and Fisher, 1943; 


Klein, 1930; Sweetland and Quay, 
1952), but there is some question of 
the relation of such dreams to the 
dreams of regular sleep (Brenman, 1949; 
Kanzer, 1953) and of the applicability 
of this procedure to children. Unfor- 
tunate not all persons dream, or at 
least many report that they do not. 


Dreams have been considered most 
sensitive to the deepest and most in- 
tensely personal aspects of personality. 
As productions, they are entirely the 
product of the dreamer, uninfluenced, 
except in the reporting, by any set or 
preferences or suggestion of an ex- 
aminer. Although certain transforma- 
tions undoubtedly occur in reporting, 
such distortions at least stem principally 
from the dreamer himself. The back- 
ground of the dream, and, in part, the 
mechanisms of transformation, may be 
studied by the traditional psycho- 
analytic technique of securing free 
associations to the dream. Some stand- 
ardization of the conditions of report- 
ing the dream may be arranged (Hall, 
1956). 

The intimate relation of the dream to 
the inner world of the subject recom- 
mends further study of the dream, and 
especially sequences of dreams, in spite 
of the various difficulties involved in 
systematizing the collection of them. 
The basic procedures of analysis might 
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readily start from procedures now in 
use with other projective instruments. 
The resemblance of certain projective 
instruments to dreams has frequently 
been pointed out (Holt, 1951). The 
most extensive use of the dream has been 
in therapeutic settings, often through 
the technique of free association, The 
aim here has been related to some ele- 
ment of the therapy process. The basic 
logic for this process should lend itself 
to analysis in more diagnostic circum- 
stances, This logic stems largely from 
Freud’s The Interpretation of Dreams 
(1953). Relevant books on dream lan- 
guage by Guthiel (1939) and by 
Fromm (1951) would be useful. Hall’s 
works (1953, 1956) relate dreams to 
more traditional psychological and re- 
search concerns. 


COMPLETION TECHNIQUES 


The completion techniques require 
the subject to examine some situation 
devised by the experimenter and to re- 
spond to some stated question regard- 
ing the outcome or completion of that 
situation. The situation may be short, 
as with incomplete sentences, or it may 
be more elaborate, as with the cartoons 
of the Picture-Frustration Test and the 
semiplots of the Insight Test. In com- 
parison with the construction tech- 
niques, the cues provided by the stimuli 
usually form a more integral part of 
the subject’s response and thus limit to 
some extent the nature of the reply. In 
comparison with the association tech- 
niques, the stimulus is usually more de- 
fined and, excluding possibly the Ror- 
schach, more complex. 
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The Sentence-Completion Method. 
The sentence-completion technique is 
based upon the concept of the word 
association test. It differs importantly, 
however, in that the stimulus is nor- 
mally longer and more complex, and 
the response required must result in a 
full sentence. A typical form of this test 
may contain 25 to 100 stems, and they 
may be very short (I . . . My mother 
. ..) or more elaborate and aimed at 
some more specifically defined situa- 
tion (When the boss bawled him out, 
he . . .). The usual instruction asks for 
the completion of the stem in any way 
the subject chooses. Speed is not a 
crucial element, though subjects are 
usually requested to report fairly 
quickly whatever occurs to them with- 
out too much thought. Items may be 
phrased in the first person or the third 
person. Hanfmann and Getzels (1953) 
have demonstrated that the first-person 
format tends to encourage responses 
more related to conscious and manifest 
aspects of person, whereas the third 
person form taps less overt material. 

Some forms of this instrument take 
on the character of a more standardized 
test to estimate some single variable. 
The Rotter Incomplete Sentences Blank 
(Rotter and Rafferty, 1950; Rotter and 
Willerman, 1947) is designed to top 
the variable of personal adjustment. The 
test includes 4o items and a fairly ob- 
jective scoring system that demonstrates 
satisfactory scorer reliability and cor- 
relation with independent measures of 
the same variable. Other forms are de- 
signed to test an array of personal 
variables—attitude towards heterosexual 
relations, friends, superiors, school, 
fears, guilt feelings, abilities, and goals, 
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The Sacks Sentence Completion Test 
(Sacks and Levy, 1950) is of this order. 

In general, this instrument is seen as 
providing information on attitudes, 
motives, and conflicts at a more con- 
scious level rather than on the structure 
and organization of personality. One 
advantage of this fact is that a sentence- 
completion instrument may be designed 
by each investigator to study attitudes 
in a wide variety of contexts. One d 
advantage of this very flexibility 
course, that normative data tends to be 
limited, and each study must rely upon 
logics of internal consistency or the 
face validity of the items. It can readily 
be used with any subjects able to 1 
and write, thought it can also be read 
to subjects and their verbal responses 
recorded, 

Productive use of special groups of 
incomplete sentences has been made 
in a number of researches. Cameron 
(1938a,b) has studied both children and 
adults. His interest was not personality 
but rather the nature of language and 
thinking processes. Sanford et al. 
(1943), designed an instrument of 30 
items, with the Murray need-press 
system as a guide in the design of stems, 
and used it with third- through eighth- 
grade school children. Rohde (1946) 
used the need-press system not for the 
original design of stems but for cate- 
gorization of responses in a study of 
high-school children. She found the test 
valuable for personality analysis in these 
ages. The test has also had extensive use 
in a variety of clinical situations with 
adults (Rotter, 1951). Its application in 
the study of attitude structure and 
socialization has been shown by Getzels 
and Walsh (1958). $ 
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Summaries of work with this instru- 
ment are provided in chapters by Rotter 
(1951) and Bell (1948) and in a recent 
comprehensive book by Rhode (1957). 

Storytelling and Completion Meth- 
ods. Storytelling and completions repre- 
sent a useful variation of the TAT and 
sentence-completion methods. Despert 
and Potter (1936) and Despert (1940) 
have used a story-telling method. They 
report on the retelling of such stories 
as The Big Bad Wolf and Goldilocks 
by children 4 to 14. In addition, they 
asked these children to make up stories 
on topics given by the experimenter. 
Wright (1941, 1945) in a more formal 
circumstance used stories and other 
methods to evaluate the effect of moral 
conflict upon children, She also used 
toys and at one stage asked the children 
to make up stories about the toys. 
Another variant of the story method is 
seen in the work of Haggard (1942). 
This form was based on comic-strip 
characters. Children chose their favorite 
characters, told what was happening in 
the newspaper version of these comic 
strips, and then created their own plot 
around these characters. 

Various other efforts in this direc- 
tion utilize the completion aspect more 
centrally, The Story Elaboration Test 
of Murray (1937, 1938), used with 
college students, consists of 32 different 
dramatic situations. Each is to be de- 
veloped by the subject into a full plot. 
Zucker (1943) studied the emotional 
attachment of children to their parents 
by the use of a  story-completion 
method. He presented stories essentially 
complete, except for the ending, and 
asked his subject to state the outcome. 
A multiple-choice format has also been 
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developed by Kelly and Bishop (1942) 
and by Roody (1943). 

Sargent’s Insight Test (1944, 1955) 
and its adaptation for children by Engel, 
the Children’s Insight Test (1958) rep- 
resent a very useful combination of the 
advantages of the paper-and-pencil-test 
format and some principles of the pro- 
jective tests. The two tests present a 
series of armatures describing in skeletal 
form several plot situations to which the 
subject is to respond by answering the 
questions: 1. What did the person de- 
scribed in the armature do? 2. How did 
he feel? The following is a typical 
armature from the Children’s Insight 
Test: A boy came home from school 
and found his dog had run away. What 
did he do? How did he feel? Other 
armatures deal with the child’s mode 
of problem solving in relation to his 
parents, unrelated adults, other children, 
school failure, loss of love object, dis- 
tance, time, and illness. A boy’s and a 

irl’s form, of 13 items each, are avail- 
able. The test is appropriate for chil- 
dren between 6 and 10. 

A system of scoring and analysis has 
been worked out, which focuses upon 
various elements of the expression of 
affect, defensiveness against affect, and 
thought processes indicative of malad- 
justment. 

The Picture-Frustration Study of 
Rosenzweig (1945) provides a series of 
cartoon situations, each dealing with 
some kind of frustrating circumstance, 
The subject is asked to provide a verbal 
response for the person who has been 
frustrated. The cue to the frustrating 
circumstance is contained in a state- 
ment made by one of the persons and 
is illustrated by the cartoon. A chil- 
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dren’s form (Rosenzweig, Fleming, and 
Rosenzweig, 1948), as well as an adult 
form, is available, as is a modification 
aimed at attitudes toward minority 
group members (Brown, 1947; Sommer, 
1954). 

The focus in analysis is upon the 
direction of the verbal response—extra- 
punitive, intrapunitive, impunitive— 
and upon the mode of response— 
obstacle dominant, ego defensive, need 
persistent. Combinations of scores in- 
dicate the tendency of the subject to 
employ various of these manners of re- 
sponse. They are also an indication of 
how common this response pattern is. 
Scoring is aided by sample responses and 
by various norms (Rosenzweig et al., 
1948; Rosenzweig, Fleming, and Clark, 
1947). 

The P-F Study has been used fairly 
widely in research and in clinical situa- 
tions (Lindzey and Goldwyn, 1954; 
Mirmow, 1952). The data of the P-F 
are limited to the specific variables used 
by Rosenzweig and would appear to re- 
flect more generally the manifest and 
overt levels of behavior than they do 
deeper levels. At the same time these 
variables are clearly defined and well 
measured by this instrument. The 
theoretical conception of frustration 
used is stated in Rosenzweig’s volume 
Psy chodiagnosis (Rosenzweig, 1949). 


EXPRESSIVE TECHNIQUES 


The expressive techniques are those 
that tend to combine diagnostic and 
therapeutic features. Although not all 
of the expressive techniques dealt with 
here need be utilized therapeutically, 
most of them are so employed by some 
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investigators. They present to the sub- 
ject a flexible, plastic material or situa- 
tion to which he is to respond by in- 
corporating the stimuli into some | nd 
of personal, novel production. Unlike 
the construction techniques, the pro- 
duction is often not exclusively verbal, 
and the process of producing the end 
result is frequently as important as the 
final production. When the process is 
of central importance and the investiga- 


tor places stress upon it, the technique 
has a greater tendency to be u d 
therapeutically. 


Play Techniques. The contemporary 
uses of various forms of play techniques 
for personality research are the deriva- 
tive of some of their uses in 
therapeutic realm. Freud’s report of 
Little Hans (Freud, 1925) and the child 
analysis through play reported by Hug- 
Hellmuth (1921) represent the begin- 
nings in this area, Since those days, 
the work of Anna Freud (1928) and of 
Melanie Klein (1932) has considerably 
extended this therapeutic use. Reviews 
of the therapeutic work with play 
may be found in Kanner (1940), Roger- 
son (1939), Bell (1948), and Solomon 
(1951). The use of play in therapy 
with modifications of technique and a 
different theoretical background are 
represented in the work of Taft (1933), 
Allen (1942), and Lowenfeld (1935, 
1938). A bridge to the investigation of 
specific personality elements occurs in 
such normative studies as those of 
Bonham and Sargent (1928), Horne and 
Philleo (1942), in the use of play to 
study social adjustment of children, as 
in Anderson (1937), Issacs (1933), and 
Wright (1942). Erikson, in addition to 
his classic monographs (1937, 1940) on 


the 
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the interpretation of play constructions 
has extended this descriptive and norma- 


tive concern in his studies of the Sioux 
(1939) and the Yurok (1938), as has 
Roheim (1941, 1943) in his explorations 
of the Normanby Islanders and Henry 
and Henry (1944) in their study in 
Pilaga 


hese studies and others make it 
amply clear that the play methods have 
great usefulness in the diagnosis and 
measurement of personality attributes. 
The work of Levy (1933, 1936, 1937) 
has been instrumental in showing the 
potentialities of play in a relatively 
standardized way for the study of 
specil elements of  personality— 
hostility and sibling rivalry. The work 
of Sears and his associates (Phillips, 
1945; Pintler, 1945; Sears, 1947; Sears, 
Whiting, Nowlis, and Sears, 1953) has 
advanced the uses of this instrument 
by study of various issues of technique 
and the experimental study of such 
variables as aggression and dependency. 

The materials of the play techniques 
vary with the investigator but normally 
include an array of objects that the 
subject is to use, play with as he chooses, 
or arrange in accordance with some in- 
structions. In some instances, the objects 
are quite varied and numerous. The 
Miniature Life Toys of Lois Murphy 
(1956), for example, include the follow- 


ing: 


. . dolls, which might be representative 
of family members, furniture for the main 
areas of home activity; planes, trains, cars, 
boots, animals, and soldiers for activity 
outside of home; farm animals and a 
fence; wild animals; cowboys, Indians, 
soldiers; and Lincoln blocks and larger 
blocks, which can be adapted for any 
purpose. The material is of different col- 
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ors. There are some broken pieces, some 
obviously old, some new in each box. 

The instructions and testing circum- 
stances are very unstructured and em- 
phasize the child’s freedom to play with 
the toys in any manner he chooses. 
The examiner records verbalization and 
play sequences, interfering very little 
beyond maintaining some involvement 
by following conversational cues pro- 
vided by the child. 

The World Test is not dissimilar in 
nature and use. Developed initially by 
Lowenfeld (1939), its use and diag- 
nostic relevance are described by 
Buhler and Kelly (1941) and by 
Bolgar and Fischer (1947). 

More structured forms, generally in- 
cluding fewer toys related directly to 
more limited objectives, are those of 
Sears and Levy referred to above and 
others made by Lynn (Lynn, 1955, 1957; 
Lynn and Lynn, 1959). Despert (1937), 
in a more clinical setting, has also 
introduced a specific element of struc- 
ture in a different way. Entrusting a 
child with a sharp knife and the in- 
struction to scrape cardboard, Despert 
notes a useful method of encouraging 
latent aggression trends. The shavings 
were then mixed with glue and various 
forms were modeled. Although 
Despert’s method has a strong thera- 

eutic intent, it illustrates an early ap- 
proach to the use of play circumstances 
for the study of specific psychological 
elements. 

A related series of play situations with 
more limited goals than those of the 
Miniature Life Toys or the World Test 
include tests developed for research 
use by Murphy, Lerner, Stone, and 
others (Murphy, 1956). These cover a 
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wide range of test situations related to 
the study of various personality at- 
tributes. Some are aimed at the expres- 
sion of latent feelings and are compara- 
tively limited in nature. These are the 
Sensory Toys and the Dough and Cold 
Cream Situations, involving exploration 
of sensory and tactile modalities. 
Various active play methods involve 
the presentation of specific situations 
aimed at the study of aggression and 
frustration. Group techniques are an 
interesting modification, since they are 
used with small groups of children and 
partake of psychodrama elements. They 
include semistructured situations in 
which children participate under adult 
guidance in a Potential Leadership 
Game, an Imposed Leadership Game, 
and a Family Game. Each of these meth- 
ods is presented in fuller detail in 
Murphy (1956). Bell (1948) and Wolt- 
man (1952) also provide general dis- 
cussions of these methods and of work 
with them. 


DRAWING AND PAINTING 


Drawing and painting are two modes 
of securing projective data that lend 
themselves readily to the study of 
children. Developments in this field 
might be said to have had three general 
sources. One is undoubtedly the inter- 
pretation of the free drawings and other 
art products in the psychiatric study of 
disturbed children. The reports of 
Bender (1937), Bender and Rapaport 
(1944), Bender and Woltmann (1937), 
and Despert (1938) illustrate the use 
of drawing in the clinical setting. 
Montague (1951) presents a discussion 
of the use of such drawings in the 
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psychoanalytic study of schizophrenic 
children. The second approach is that 
normally called the study of expressive 
movement and includes drawing aniong 
a variety of other expressive perso nal 
movements. Here the work of Wolff 
and Precker (1951) and of Harms 


(1946) is noteworthy. The well-il- 
lustrated volumes of Alschuler and 
Hattwick (1947) provide excellent 
documentation of the nature and variety 


of children’s drawings and paintings. 
The third approach to drawing has 
stemmed from its early use as a 
measure of intelligence. The work of 
Goodenough (1926) called attention to 
drawing as an indicator of cognitive de- 
velopment in children as well as to the 
various features of these drawings that 
appeared to relate more to personality 
variables than to cognition. 

Out of these combined beginnings 
has come several contemporary uses of 
both drawing and painting as projective 
devices. One of the best known is the 
technique described by Machover 
(1948, 1951). In her usage, the subject 
is asked to draw a person and, after this 
has been drawn, to draw a person of 
the opposite sex. Standard equipment 
is used each time, and any verbal com- 
ments or unusual approaches to the task 
are recorded. A closely related form is 
that of Buck (1948a,b, 1949), who asks 
for a drawing of a house, a tree, and a 
person. Buck has proposed a series of 
hypotheses related to the areas of per- 
sonality sampled by his method. These 
include intelligence, interpersonal re- 
lations, family relations, major per- 
sonality needs, affect state, and psycho- 
sexual stage. Although many of these 
require experimentation, the procedure 
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has been thoughtfully developed and 
holds promise. 

Painting is highly similar to drawing 
and was brought to public attention in 
the form of finger painting by Ruth 
Shaw (1934). Elkisch (1945) and par- 
ticularly Napoli (1946, 1947, 1951) have 
developed this procedure as a projec- 
tive re hnique, 

In both the Machover draw-a-person 
approach and the Napoli development 
of fingerpainting the interpretive rules 
stem from attributing various meanings 
in personality terms to the end product 
and the behavior of the subject 
during the process. In the Machover 
format such elements as the following 
are seen as crucial: thickness and thin- 
ness of line, location of drawing on the 
page, size of drawing, symmetry, and 
content. In addition, various sectors of 
the figures are seen as conveying data 
regarding particular personality areas. 
Thus the head and facial features are 
seen as expressive of social needs and 
responsiveness as well as a reflection 
of intellectual ambitions and the extent 
of drive for a rational control of im- 
pulse. Within these head features 
further subdivision also occurs. The 
hair, for example, is seen as having 
sexual symbolic meaning, as has the 
nose, The interpretive process involves 
the application of general clinical logic 
to the various modes of response and the 
gradual building up of a personality 
portrait, 

In the finger-painting format par- 
ticular attention is given to the process 
of creating the final product. Thus the 
administrator must be sensitive to such 
factors as the way in which the hands 
are used, the kinds of motions—smear- 
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ing, scrubbing, patting, scribbling— 
rhythm, order, and other related ele- 
ments. More than one painting is 
generally desired, and interrelations 
within a series are important. Inter- 
pretation is made within the three basic 
areas of data, the performance observa- 
tion, the painting itself, and the 
verbalizations. Principles of analysis are 
not dissimilar to those of Machover, 
though they are aided by various em- 
pirical studies. A distinctive feature of 
finger painting is color, which is given 
special significance beyond its role in 
the analysis of other parts of the total 
production, Thus blue is found to be a 
color dominantly used by males and is 
thought to reflect masculinity, security, 
drive, sincerity. Extreme amounts sug- 
gest sadism, violence. Black with blue 
in a male indicates despondency. Green 
indicates a controlled but highly de- 
veloped emotionality and reflects a 
creative potential. 

An extensive summary of these em- 
pirical rules and their personality cor- 
relates is provided in Bell (1948). 
Anastasi and Foley (1940, 1941a,b,c) 
have summarized a great deal of the 
literature on art production in the ab- 
normal. Swensen (1957) provides a 
summary of experiments designed to 
test certain of the interpretive proce- 
dures of the drawings of the human 
figure. 


PSYCHODRAMA 
AND ROLE PLAYING 


The psychodrama is the unique con- 
tribution of J. L. Moreno to what Haas 
and Moreno (1951) call the “action” 
approach to the study of human in- 
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terrelations. Some of the basic ideas of 
the psychodrama are presented in 
Moreno (1946). 

The psychodrama requires the in- 
dividual to act out, by himself or with 
others, some kind of human drama or 
personality relevant situation. It has 
been used in various forms in industrial- 
training situations, in survey research, 
in studies of military adjustment, and in 
the study and treatment of a wide 
variety of personal maladjustments. 
These have included studies of the 
behavior disorders of children (Green- 
hill, 1945; Horwitz, 1945; Shoobs, 1944; 
Sullivan, 1945) and a proposal regarding 
its use in the social psychological study 
of the family by Foote and Cottrell 
(1955). Del Torto and Cornyetz (1944) 
have developed a series of nine psycho- 
dramatic test situations, which they see 
as a projective instrument to provide 
data on the subject’s social inter- 
actions. 

Although this method has been used 
less frequently in the study of per- 
sonality attributes, it appears to hold 
considerable potentiality if brought into 
context of more limited personality 
variables. The Activity and Situation 
Tests and Leadership Games of Murphy 
(1956) have elements closely related 
to psychodrama methods, adapted to 
the study of more specific attributes in 
children, 


EXPRESSIVE MOVEMENTS 


The realm of phenomena usually 
called expressive movements is closely 
intertwined with the study of per- 
sonality and the projective techniques. 
This area is, however, more a field of 
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study of certain elements of behavior 
and their unconscious correlates than 
a projective technique. Certain expre 
sive movements are, of course, involve 
in some projective techniques. In 
sense, certain of the procedures for 
analysis of play and painting, of the 
sensory toys of Murphy, and of 
movements of acting out in psycho- 
drama concern themselves with expres- 
sive movements. One area of expre Ive 
movement, handwriting, may in some 
senses be considered a projective tech- 
nique. 

The distinction made by Bel 
(1944) and utilized by Wolff 
Precker (1951) calls attention to th 
aspects of behavior. There is that 
havior that is adaptive and involves 
centrally realistic adjustment or re- 
sponse to tasks demanded of us. Al- 
though certain responses may be in- 
appropriate, hence maladaptive, the 
focus is still upon the effort to make a 
realistic adjustment to some external 
world stimulus. Certain behavior is 
projective and concerns behavior in 
which one ascribes or reveals one’s own 
wishes and needs without necessary 
conscious awareness of doing so. This 
is, of course, one of the crucial features 
of the behavior elicited by a projective 
technique. Certain behavior is expres- 
sive and involves one’s style of response. 
The emphasis in the expressive element 
of behavior is upon the individual 
manner of approaching a task and may 
apply to behavior that is also adaptive 
and/or projective. Thus the field of 
finger painting involves certain body 
movements, ways of making line and 
figures, and a similar style of making 
the resulting painting. Tn this sense, 
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expressive behavior is not the centrally 
projective element, but study of those 
aspects of the total painting process will 
aid in the analysis of that projective 
medium, 

Studies of expressive movement in- 
clude such areas as motor behavior— 
characteristic movements or bodily 
postu handwriting, facial expres- 
sions, proportion and sizes involved in 
drawi and doodles, voice qualities, 
speech attributes, and various others. 


Many of these movement attributes are 
assumed to have some characteristic 
individual consistency, hence to bear 
upon the study of personality. As such, 
they bear more relevance to those pro- 
jective behaviors that include bodily 
gestures or movement than upon those 
that emphasize verbal response or some 
resulting finished product. 

Additional material on studies of ex- 
pressive movements and the ways in 
which they may be utilized in the 
analysis of certain projective techniques 
may be found in Wolff and Precker 
(1951), in Wolff (1946), and in Bell 
(1948). Material dealing directly with 
handwriting analysis may be found in 
a chapter by Wolfson (1951) and in 
Bell (1948). 


Choice or Ordering Techniques 


The choice techniques present the 
subject with a set of stimuli, which he 
is asked to place in a certain order or 
among which he is to indicate pref- 
erence, In the two methods briefly 
discussed here, the Szondi and the Pic- 
ture Arrangement Test, the stimuli 
are pictures. 
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The Szondi test presents 48 photo- 
graphs of adult psychiatric patients 
drawn from eight diagnostic groupings. 
The pictures are shown to the subject 
in sets of eight, each set containing a 
picture from each of the diagnostic 
groups. In each set the subject is to 
select the two he likes the best and 
the two he likes the least. Upon re- 
viewing all sets in this fashion, he is 
asked to select again from those he 
indicated as “liked” the four he likes 
the most and similarly the least liked. 
Susan Deri (1949) reports that the re- 
sults are not meaningful unless the test 
is administered in this fashion at least 
six different times. 

The pattern of choices is then arrayed 
by each of the diagnostic factors repre- 
sented by the pictures, indicating the 
number of likes and dislikes for each 
factor. Interpretation is based upon 
the logic that the choices, positive or 
negative, for a given factor reflect the 
psychic state of the subject with respect 
to the factor. The factors reflect the 
diagnostic groups and include a sexual- 
vector, reflecting homosexual and 
sadistic factors, a paroxysmal-vector, 
reflecting epileptic and hysterical fac- 
tors, a contact-vector, reflecting depres- 
sive and manic factors, and a self-vector, 
including catatonic and paranoid fac- 
tors. 

In introducing this text, Dr. Szondi, 
a Hungarian psychiatrist, also presented 
a fairly elaborate background theory 
regarding the effect of recessive genes 
upon behavior. This theory has never 
been taken seriously in this country. 
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In any event, it is not crucial to the 
interpretation of the instrument. In- 
terpretation is also largely restricted to 
the Szondi factors; hence it has limited 
utility beyond this frame of reference. 

Because of its limited use with chil- 
dren, further information is not given 
here. The test, however, has been used 
quite often with adult clinical groups 
and should not be underestimated in 
that circumstance. Further material may 
be found in Deri’s book (1949), in a 
chapter by Rabin (1951b), and in a 
review of the Szondi literature by 
Borstelman and Klopfer (1953) and by 
David (1954). These references in- 
clude the uses with children and 
adolescents reported up to 1954. In 
addition Deri reports on the use with 
delinquents (Deri, 1954) and Fancher 
and Weinstein (1956), on the responses 
of 7-year-olds. 


THE PICTURE-ARRANGEMENT 
TEST 


This instrument consists of 25 car- 
toonlike pictures depicting the same 
figure in three different but related 
activities, The subject is to indicate the 
order in which these activities took 
place and to describe what is going on 
in each picture with a sentence. The 
sentence may be interpreted clinically, 
though comparatively little work has 
been reported on this part of the re- 
sponse data. The central effort has 
gone into development of objective 
methods of describing the arrangement 
of the pictures chosen by the subjects. 
A representative normal sample of 1500 
provides the base for describing the 
frequency of occurrence of various 
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arrangements. Each of the patterns of 
arrangements found may be scen 
potentially as an indication of some spc- 
cific personality issue. Easy to admin- 
ister and to score, this instrument 
combines projective and objective 
tures and holds promise for future 
development with children. In the 
major presentation of this work by 
Tomkins and Miner (1957) norms, 
as well as rationale and interpretive sug- 
gestions, are provided, which include 
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some limited normative material on 
children. 
Summary 
The projective techniques have been 


used principally within the framework 
of interest in the study of individuality. 
Whether case by case in a clinical 
setting or by larger numbers of cases 
representing some group of special 
interest, the utility of the projective 
method has been its sensitivity to idio- 
syncratic features of personality. 

At the same time, other uses of the 
data of the projective instruments have 
arisen, which concern themselves with 
the exploration of specific variables of 
personality. Here the stress is not upon 
the full range of inferences derivable by 
skilled clinical study of the projective 
protocol. It is, rather, upon the con- 
tribution that limited and previously 
defined elements of the projective re- 
sponse can make to the study of a 
specific variable. É 

Work with the projective instruments 
in the future will undoubtedly follow 
both of these productive routes. In 
the first instance, the framework of 
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individuality, the problems will con- 
tinue to be those of the systematization 
of interpretative procedures, the study 
of the constructs in terms of which 
interpretations are made, and the analy- 
sis of the circumstances of use and 
interpretation under which the validity 


of inferences may be maximized. In the 
second instance, the framework of spe- 
cific variables, the central problem, in 
addition to those above that the two 
approaches share, stems from the isola- 
tion of the variables studied from their 
context of other personality variables. 

Considerable inventiveness has been 
exercised, and is still possible, in the 
development of new projective instru- 
ments or in novel ways of employing 
more familiar ones. The more successful 
ventures, with such new instruments or 
novel methods of analysis, appear to be 
those wherein the choice of instru- 
ment and of its mode of analysis has 
been closely considered in terms of the 
personality variables it was desired to 
study. This implies, in part, that the 
choice of a presently available instru- 
ment should be made by prior study of 
its potentialities of revealing the desired 
data under the circumstances of subject 
and setting proposed. The various ap- 
proaches to the examination of specific 
variables also suggest that we need not 
presume that the concepts of analysis 
originally attributed to the instrument 
are necessarily the only terms in which 
it can be properly analyzed. New terms 
of analysis, however, more easily ap- 
plied to some techniques than to others, 
require the same scrutiny being given 
increasingly to the more traditional 
systems of analysis. Entirely new meth- 
ods, regardless of the objectives of their 
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analysis, should be devised principally 
with regard to their relevance to some 
body of theory or some personality con- 
structs possible of definition and study. 
This does not mean that new techniques 
should not be devised. It implies only 
that their value in the field will lie less 
in the inventiveness of their stimulus 
qualities than in their demonstrable 
usefulness in the study of personality 
concepts. 
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The 12-year-old Democrat or Repub- 
lican is often quite as fervent and sure 
in his attitudes as the politician, An 
8-year-old defending his religion or his 
father shows no doubts. As he feeds 
the stray puppy, houses the fallen bird, 
or protects the caterpillar on the side- 
walk from careless steps, the 6-year- 
old’s humane values are apparent. A 
child’s guilty feelings over telling a lie 
attest to his sense of wrongdoing. 

The pro’s and anti’s, should’s and 
should not’s of children are often very 
strong and unyielding. Side by side with 
these attitudes are surprising areas of 
amorality, undisturbed unawarenesses 
of adult prohibitions, values, and affects. 


There are, also, vague, uncrystallized 
attitudes, and there are uneasy and 
unsettled values as the child struggles 
with incompatible pressures and re- 
wards in a social environment of mixed 
values. 

The formation and inculcation of at- 
titudes and values constitute a critical 
dimension of child development and one 
closely linked with many other aspects 
of development—personality, intel- 
lectual and physical development, social 
behavior, etc. Neither theories nor 
data nor methods of investigation have 
succeeded fully in giving understanding 
of this field, although it is rich in 
thoughtful and fruitful research. 

Attitudes and values appear under 
many names and in varied contexts in 
research. Norms, sanctions, and mores 
of communities and cultures are in- 
vestigated in anthropological and 
sociological research. In the psycho- 
analytic framework values may be 
dealt with in terms of superego func- 
tioning. Learning theorists may absorb 
attitude into behavior theory as af- 
ferent-habit strength, efferent-habit 
strength, drive strength, etc. The educa- 
tor (secular or religious) is concerned 
with character education or conscience 
or human relations. The particular 
framework in which one views the con- 
cepts of attitudes and values influences 
to some degree the research questions 
one designs to answer as well as the 
methodological paths one chooses for 
investigation. There is great variety in 
techniques, representing fundamentally 
different approaches and allowing the 
investigator to exercise preference for 
“precision” or “depth,” statistical or 
clinical, or cultural or individual ap- 
proaches. 
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PLAN OF CHAPTER 


The methodological issues and the 
literature on attitudes and values are 
very extensive and specialized. In at- 
tempting to bring methodological 
knowledge into service of research on 
children, we have tried to avoid 
skimming widely and duplicating in 
dilute form the existing treatises on 
measurement methods. We have se- 
lected representative techniques in 
this field, evaluating them as to their 
particular applicability or disadvantages 
in work with children. When detailed 
discussions of given techniques exist in 
the literature or in other chapters of this 
book, reference is made to these sources. 

First, however, we wish to see re- 
search techniques in some perspective 
by considering the following: 


1. What are the concepts of attitudes 
and values with which we are dealing? 

2. What are the potential contribu- 
tions of research on children’s attitudes 
and values to basic developmental 
theory and to an empirical descriptive 
science of childhood? 

3. What are the ethical requirements 
in this research field? This is often an 
afterthought, a last paragraph of re- 
search design. That it deserves more 
than postscriptive recognition arises 
from the fact that one cannot divorce 
research approaches to children from 
fundamental assumptions about chil- 
dren, and, therefore, one cannot divorce 
research design from conscious or un- 
conscious research ethics. 


The power of various measurement 
techniques is then discussed in relation 
to the requirements of the research 
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problem and research subject. We have 
dealt with characteristics or dimensions 
of measurement (such as verbal or 
behavioral indices and disguised or 
undisguised approaches) with ilh 
tion from specific techniques emb 
ing these characteristics. 


ATTITUDES AND VALUES— 
CONCEPTS 


Some working concepts of attitudes 
and values are necessary if we are to 
discuss their measurement. Both are 
hypothetical constructs. In most gencral 
terms, they are concepts concerning the 
individual's orientation toward aspects 
of his personal and impersonal environ- 
ment and toward himself. In many 
writings attitudes and values are used 
interchangeably; in many other writ- 
ings they are dealt with as distinct 
concepts. The concept of attitude is 
most frequently formulated in terms of 
a “state of readiness for motive arousal,” 
or a “readiness to act” in a given con- 
sistent manner toward a specified class 
of stimuli. Attitudes are further elab- 
orated as having cognitive, affective 
and conative components. Krech and 
Crutchfield (1948, p. 152) define at- 
titude as “an enduring organization of 
motivational, emotional, perceptual and 
cognitive processes with respect to some 
aspect of the individual’s world.” A 
great many discussions and reviews of 
this concept have appeared in the litera- 
ture (see, for example, Allport, 1935; 
Campbell, 1950; Green, 1954). 

The concept of values has been less 
discussed in psychology, but in its 
formulation there is reasonable con- 
vergence from psychological, psycho- 
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analytic, and sociological points of view. 
Lewin (1944, p- 14) speaks of values 
as influencing and guiding behavior, de- 


terminin hich types of behavior have 
a positive or a negative valence, but 
not having the character of a goal. 


Similarly, psychoanalytic theory speaks 
of values as internalized parents. From 
the sociological point of view, Williams 
(1958) cribes a value as “any aspect 
of a situation, event, or object that is 


invested with a preferential interest as 
being ‘good,’ ‘bad,’ ‘desirable, ” and the 
like, “Values are not concrete goals of 
action, but rather the criteria by which 
goals are chosen.” i 
Perhaps it is useful to maintain a 


distinction between the concepts of atti- 
tude and value, with value involving a 
dimension of acceptability or unac- 
ceptability, of should or should not, in 
the subject's or society’s frames of ref- 
erence. Like attitudes, values must be 
inferred from a consistency of responses 
to a given class of object or events. The 
assumption of consistency in the defini- 
tion of attitudes and values is in itself 
a complicated assumption. There may 
be manifestly different responses that 
represent a single consistent orientation 
toward a class of objects. There may 
be consistently similar responses to 
manifestly different objects or events 
that become unified by virtue of the 
attitude or value, As an example of the 
first kind of consistency, a child’s at- 
titudes toward pets may be inferred 
from a variety of responses—the 
amount of time he spends caring for his 
pet, his choice of playmates who own 
pets, the route he follows in walking 
home from school, the books he reads, 
etc.; each represents an approach re- 
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sponse to a class of objects. As an 
example of the second kind of con- 
sistency, a child’s attitudes regarding 
his family’s social status may lead him 
consistently to refuse gestures of friend- 
ship or help in many different cir- 
cumstances from many different chil- 
dren, all of whom he has classified as 
“outsiders.” 

The core characteristics of attitudes 
and values are then an involvement in 
the object (or principle or event), a 
consistency in orientation toward the 
object, and a potential effect on be- 
havior relating to that object. Within 
these definitions there is considerable 
latitude for differential conceptualiza- 
tion and differential emphasis in meas- 
urement. 

The conscientious investigator of 
children’s attitudes and values may not 
be satisfied with this brief discussion of 
concepts. He will want to know where 
and how these concepts fit into general 
psychological theory, how different 
theoretical positions (such as learning 
theory and cognitive and field theory) 
deal with attitudes and values, and what 
contributions have been made within 
these different frameworks. For in- 
stance, a treatment of attitude from a 
field theoretical point of view is to be 
found in Krech and Crutchfield (1948, 
p- 152). Doob (1947) puts forth a be- 
havior theorist’s view in which he 

redicts and hopes for the early demise 
of the concept of attitudes. He takes the 
position that “attitude” can be incor- 
porated into general behavior theory, 
“any ‘principles’ pertaining to attitude 
are only special cases of psychological 
theory” (P- 155). Green (1954), in 
reviewing quantitative attitude measure- 
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ment, takes basically a statistical point 
of view. Attitude is defined as “a 
latent variable whose meaning is derived 
from the covariance of a set of re- 
sponses.” 


CONTRIBUTIONS OF STUDIES OF 
ATTITUDES AND VALUES 


To enable the reader to judge for 
himself the significance of attitude and 
value studies in child development, 
it may be helpful to delineate types 
of scientific objectives in the study 
of children and to localize ques- 
tions of attitudes and values within 
them. Grossly these objectives are 
three: (1) descriptive knowledge of 
children, (2) understanding of change 
that is developmental in the life cycle, 
and (3) prediction of response from 
knowledge of given stimulus conditions 
and conditions of the organism. De- 
scriptive research produces profiles of 
abstractions—what children of a given 
age, sex, or cultural group are like with 
respect to one characteristic or another. 
More truly developmental research is 
concerned with the detailed study of 
the processes of change in given func- 
tions, such as the development of a 
love response in the child, of cognitive 
processes, or of causal thinking. Re- 
search on prediction is concerned with 
the prediction of responses under 
current specified stimulus conditions 
or prediction of antecedent-consequent 
relations in a longitudinal, develop- 
mental sense, such as conditions in early 
childhood in relation to adolescent per- 
sonality. 

Where do questions of attitudes and 
values enter into these objectives? 
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Clearly, attitudes and values are part of 
the first objective, the normative and 
descriptive data on children’s orienta- 


tions toward their human and physical 
environments and toward themselves. 
Here the cognitive side of attitudes (cs- 
pecially for young children) becomes 


very important. To what extent is the 
child aware of various qualities of cx- 
perience, and what kind of cognitive 


structure exists for him to which he 
responds affectively? 

The weight of evidence points to the 
child as a very sensitive perceiver and 
at a very early age. These early sensi- 
tivities and beginnings of crystallized 
attitudes and values have been docu- 
mented most impressively in studies of 
young children’s awareness of social 
group mores and social prejudic 
Goodman's (1952) study of kinde 
garteners’ awareness of racial dif- 
ferences and their preferences associated 
with these differences demonstrates 
this point systematically, as do similar 
findings in other studies, such as Clark 
and Clark (1939), Horowitz (1936), 
and Radke, Trager, and Davis (1949). 

Attitude studies of children have 
flourished in this area of social aware- 
ness with considerable experimentation 
in techniques of measurement. In other 
areas of the child’s life similar data 
on attitudes and values are not so likely 
to be found. We know much less about 
children’s attitudes toward their par- 
ents’ authority and affection, toward 
socially defined children’s roles, and 
about children’s value systems with re- 
gard to morality, justice, altruism, etc. 

The descriptive and normative in- 
vestigations of children’s attitudes and 
values, when simple enumerations of age 
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changes, may be relatively drab and 
contribure little to a theory of devel- 
or behavior. On the other hand, 


opment 

such descriptions in the company of a 
theory or such descriptive data as the 
raw materials for the development of 
concepts and theory are essential data. 
Some investigators have taken the posi- 


tion that the concept of developmental 
stages in perceptual, cognitive, and mo- 


tivational aspects of behavior is rela- 
tively useless, that experiential factors 
account for most of the variance. Fail- 
ures to replicate Piaget’s findings on de- 
velopmental changes in thought char- 
acteristics have been cited as supportive 
evidence (Sears, 1958). Present infor- 
mation leaves us without a full answer, 


and it is well worth investigating how 
far in the life cycle the maturational 
level is significant in accounting for 
variance in attitudes and values. 
Research on antecedent-consequent 
relations may involve the study of at- 
titudes and values as responses and as 
intervening states. They are dealt with 
as responses in the following kinds of 
questions; Variations in attitudes and 
values are resultants of what kinds of 
experience and what kinds of person- 
ality structure? What are the condi- 
tions under which a particular attitude 
is acquired? How does it change under 
varied training conditions? In deliber- 
ate cultivation of an attitude or value, 
how does one proceed? Still regarding 
attitude as a response, one may be in- 
terested in questions about the “envi- 
ronment” of the attitude within the 
individual. In what kind of dynamic 
relation to other attitudes and variables 
does a given attitude exist? What is the 
degree of situationality versus gener- 


ATTITUDES AND VALUES 


649 
ality or consistency of the attitude, of 
stability versus changeability? What is 
necessary to sustain the attitude or 
value? How close to behavior is the 
attitude and value? Many of the goals 
of child rearing and education are di- 
rected toward developing desired atti- 
tudes about authority, altruism, science, 
morality, etc. 

Attitudes and values enter research 
problems also as intervening states. The 
child is exposed to given experiences, 
let us say, to certain specified rearing 
practices of the parent, certain behav- 
ioral characteristics of the child are 
observed. These outcomes are not uni- 
form, however, from child to child. 
Couplings of parent practices and be- 
havioral outcomes are only in the 
vicinity of +.25 to +.35 correlations. 
Attention to intervening conditions, 
among them the child’s attitudes and 
values, may refine these relationships. 
What is experienced by the child? 
What is the nature of the cognitive 
framework in which he interprets the 
parent’s actions, the nature of the at- 
titudes and values brought into play 
by the parent’s behavior? 

How may the intervening states mod- 
ify and complicate the antecedent-con- 
sequent couplings? For example, inter- 
nalized peer group values regarding de- 
fiance of adult authority may affect the 
adolescent’s response to his mother’s 
efforts at cultivating dependent be- 
havior in him. Children’s reactions to 
new persons and conditions can be pre- 
dicted only in part from knowledge of 
the new stimulus situation. The set of 
values regarding personality, status, etc., 
which the child brings into the new 
situation, can be strongly influential in 
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determining his response. Generally, 
“warm, democratic” adult leadership 
tends to be coupled with confident and 
secure relationships between child and 
adult. However, children whose cul- 
tural training devalues democratic pro- 
cedures, disdaining such signs of adult 
“weakness,” are likely to respond quite 
differently to democratic adults. Thus 
predictions or explanations of children’s 
behavior without regard to intervening 
attitudinal or value states suffer many 
errors. Incorporating attitudes and be- 
havior into explanatory schemes would 
seem to be a needed elaboration in 
research design and theory, even though 
resulting formulations will be less tidy. 
By having identified some of the 
problems and objectives of research in 
which attitude and value measurement 
has a role, the breadth of measurement 
problems may perhaps have become 
more apparent and the issues of meas- 
urement more multisided than at first 
quick glance. With the goals of re- 
search partly codified, our next step is 
to consider the actual conduct of atti- 
tude and value research with children, 
as it affects children. 


RESEARCH WISDOM 
AND RESEARCH ETHICS 


Children are now much more in the 
“public” domain of researchers of many 
training backgrounds and interests than 
was the case in the first half of the 
century, when research on children 
came almost exclusively from depart- 
ments of child welfare and education. 
Also, children are now involved in a 
much greater volume of research. The 
need and obligation of investigators to 
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acquire some understanding of children 
before using them as research subjects 
assume, therefore, rather considerable 
importance. The child cannot be used 
naively as a substitute for the college 
sophomore or for the rat, for reasons 
methodological as well as ethical. The 
reasons we wish to discuss here are 
those that concern the cognitive and 
affective sides of the child’s research 
participation, the involvements most 
directly affected in research on 
tudes and values. 

Usually a close relation exists bc- 


tween research wisdom and research 
ethics in studying human behavior. T his 
is nowhere more evident than in work 


with children, The essence of research 
ethics is consideration for the research 
participant. Maintaining confidentiality 
of data and exercising precautions to 
prevent harmful effects of data gather- 
ing and reporting are the essential ele- 
ments. Beyond this core of agreement, 
opinions and ethical practices vary. 
By definition, attitudes and values 
are affect-laden, and to them may be 
attached rewards and punishments of 
“conscience” and society. They are 
part of the “private” world of the in- 
dividual, to reveal or conceal as he sces 
fit; therefore, by probing into them in 
research, one is immediately confronted 
with something of an ethical dilemma. 
However, on many issues subjects of 
our research-wise culture are not hesi- 
tant about laying bare their attitudes. 
The subject’s permission is usually 
recognized as his to give or withhold 
before research is done. With children, 
this permission-giving power is usually 
vested in the parent or parent-surro- 
gate, leaving to the child a much smaller 
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margin of power, that of being coop- 
erative not, once he is in the re- 
search can readily feel a legitimacy 
in this cedure while at the same 


time also feeling that the adult’s right 
to decide must not be the sole guide in 
deciding a child’s participation. The 


obligation remains of granting some 
degree of self-determination on the 
part of the child. His developmental 


level and the problem under investiga- 


tion may be guides in weighing research 
hazards to the subject and deciding re- 
search policy in this regard. 

The child’s willingness to participate 
and his understanding of what he is 


asked to participate in should be looked 
at together in considering ethics and 
procedures. Concealment of research 
purpose—by stating a different research 
objective from the real one, by partly 
disguising the objective, by giving no 
explanation, by completely concealing 
the fact that research is going on—has 
become very nearly conventional in ex- 
perimental approaches, testing, inter- 
viewing, and observations. The pri- 
mary defense for this practice is, of 
course, that without camouflage many 
problems could not be subjected to 
systematic scrutiny. For example, the 
subject’s self-conscious awareness of the 
research purpose would seriously affect 
studies of “influence” factors in atti- 
tude change, studies of “negative” at- 
titudes and values, studies of attitudes 
toward power figures, and so on. With 
older children, our explanations of ob- 
jectives (true or camouflaged) approxi- 
mate those used with adults. In one 
sense or another, the subjects are in- 
formed that it is research, In many in- 
stances they accept the fact that they 
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do not know the specific purpose, but 
they are willing to “go along” or to 
wait until after it is over to be told its 
significance. With younger children, 
we have been quite uniform in disguis- 
ing research as “play.” 

When the older child and adult are 
taken into confidence to the degree 
that “this is research on such and such,” 
they have, most likely, some concept 
of research, Its prestige and its stereo- 
types will vary. Whether the research 
worker is believed to be “psychoana- 
lyzing us,” is referred to somewhat ap- 
prehensively, somewhat hostilely as a 
“head-shrinker,” or is regarded as a 
“scientist” is probably of some impor- 
tance in the subject’s response. These 
several meanings of research filter down 
the age scale, so that it is not surprising 
to have a 10-year-old observe: “You're 
doing research on us. I know because 
back at school a man came around 
getting us to tell stories about some 
pictures he had. Do you know what 
research he was doing? What is your 
research about?” Often simple, frank, 
and satisfying explanations can be 
given. Thus in a study of children’s 
attitudes and behavior in an interracial 
camp this 10-year-old was told: “We 
are interested in learning more about 
children’s camps, what children do to- 
gether that they like or don’t like, how 
they learn to know other children, and 
so on.” 

But below some age or intellectual 
level there is no concept of “research.” 
Here one should consider most care- 
fully the child’s reactions to the re- 
search. One should ask oneself whether 
the content of the research task is con- 
gruent with a “play” explanation. When 
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interview questions or projective play 
devices tap attitudes regarding ego-in- 
volving, possibly painful experiences, 
the child expecting to play with the 
investigator may feel “sold” or tricked. 

The child’s “seduction” into partici- 
pation in research may take something 
of the following pattern. An adult, of- 
ten a stranger, arrives at the nursery 
school, The teacher may invite the 
child to leave his play to join the adult 
for “more” games. The two leave the 
room and go to a room where they are 
alone. A brief “rapport” period follows 
in which the adult may converse with 
the child in a friendly way or may give 
him toys with which to play. Then, 
sometime later, other gadgets (the re- 
search toys, pictures, dolls) are brought 
out, and the child is requested to shift 
to them. Through the toys or purely 
in verbal terms, the investigator por- 
trays various human relationships and 
interactions, to which the child is asked 
to respond, (For example, a picture or 
story may depict a child being disci- 
plined by a teacher, a parent giving his 
child a present, several children fight- 
ing.) Just how the child feels about the 
revelations he is asked to make or what 
meanings their communication has for 
him often go unnoticed by the re- 
searcher interested in some specific 
variable of response. 

A responsibility for knowing the im- 
pact of the research experience on the 
child should be part of any investiga- 
tor’s task. Although we know that this 
is not fully possible, it is possible to 
build into one’s procedures some pro- 
visions for evaluating the impact. 
Planned alertness to the spontaneous 
comments, gestures, and postures of the 
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child can furnish information. Such 
cues are illustrated in a kindergartener’s 
reactions to telling stories about am- 
biguously structured pictures, She drew 
a deep breath at the presentation of a 
picture of white and Negro children 
together on a playground, and c 
mented gravely. “This is getting 
ous.” From another child came a 


n- 


ques- 


tion after some story telling, “How 
did Jeanie do on this test?” And from 


another, “Did I do it right?” (An ex- 
planation of “play” in this instance h2 
not changed the expectation of “test. ”) 

Despite the limitations of the “pia 
approach, it must also be said that “it 
works” in many situations. Its con- 
gruence with task is perhaps the best 
guide to the reasonableness of its use. 
There are other alternatives that can 
better be used when content is 
playlike. An investigator’s explanation 
to children that he is writing a book on 
what children think about such and 
such or that he is studying how chi\- 
dren solve problems are examples o 
plausible purposes in line with the re- 
search objectives, yet do not necessarily 
interfere with the process under study. 
The feasibility and value of taking child 
subjects into full confidence should not 
be completely overlooked. This was 
done in a study in which preadolescent 
children were seen in small friendship 
groups by group workers in a neigh- 
borhood house. The children were in- 
vited to come together to talk about 
intergroup problems, as they experi- 
enced them in their school and neigh- 
borhood, to help the worker who was 
interested in studying these problems 
and writing about them in a book on 
children. 


not 
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Hand in hand with the meanings that 


the researcher tries explicitly to give to 
the child are the subjective meanings to 
the child. \Ve know that the investiga- 
tor of artitudes and values may bring 
the child uncomfortably close to dis- 
tressing is of his life (conflicts with 
authority ires, guilts, painful prob- 
lems of awareness). The investiga- 
tor often faces the hard reality that to 
some degree and for some children his 
research may contribute to a child’s 
problems. This may come about as a 
result of having brought latent feelings 
to the surface through the research by 


direct questions or by indirect tech- 
niques. (The possibility exists, too, that 
the child’s verbalization and playing out 
of feelings in the research may have a 
therapeutic effect.) Also, research may 
create conflicts where none existed be- 


fore. For example, when investigating 
values regarding honesty, the researcher 
may devise situations in which the child 


is motivated to receive certain rewards 
or to avoid failures. In order to do so, 
dishonest behavior may be involved. 
The investigator may have a hard time 
deciding how seriously the child will 
regard his own transgressions. Are 
the experimental techniques justified? 
Should there be some built-in “repair” 
procedures as part of the research to 
handle possible traumata? 

The child’s welfare sometimes enters 
research in another form. The investi- 
gator may suddenly find himself with 
a personal responsibility, which comes 
from privileged knowledge of the child 
or adolescent. Through the child’s rev- 
elations in the research, clinically and 
socially serious attitudes are revealed. 
To divulge them to the parent, teacher, 
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or therapist, from whom help can come, 
is to violate the confidentiality prom- 
ised the child. Not to reveal them may 
leave remediable conditions unreme- 
died. When data are gathered in child- 
hood and adolescence for follow-up 
study in later years of childhood or 
adulthood, the carry-over of the child’s 
expressed attitudes into the future may 
have consequences for the individual in 
cases in which the attitude has particu- 
lar social or personal relevance. 

These methodological-ethical issues 
pose difficult questions. To what extent 
should the researcher restrict himself to 
the use of methods that keep to a mini- 
mum disturbing responses in children, 
in spite of the fact that other methods 
might be more effective and produc- 
tive? To what extent is the investigator 
morally responsible for helping a child 
to restore equilibrium after eliciting 
from him emotional reactions associ- 
ated with some of his attitudes and 
values? The specific instrument used, 
the context in which the research is 
done, the investigator characteristics, 
and the child characteristics together 
determine the impact of a given pro- 
cedure and help to determine answers 
to these questions. No research is justi- 
fiably undertaken on children’s attitudes 
and values without consideration of its 
effects. In part, the answer to what is 
ethical in research lies in what the in- 
vestigator is equipped to understand 
about children and what he is equipped 
to do by way of “treatment” in the 
event that some is necessary. 

Having stressed the problematic as- 
pects of attitude research with chil- 
dren, one should not be frightened 
away from it. The moral is not to 
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blunder into children’s feelings and 
conflicts and private worlds. Knowl- 
edge of children, although it appears 
self-evident, is critically important in 
designing and conducting research. Re- 
search errors of misinterpretation and 
misjudgment of the child often arise 
because of a lack of this knowledge, 
thereby rendering research less efficient 
and mildly or seriously trampling on 
the rights of the child. Errors tend to 
be of two sorts—either making the 
same assumptions for the child as for 
the adult or far underestimating the 
child’s sensitivities and capabilities. 
Areas of error involve the relationship 
of investigator to child, the thought 
processes and verbal facilities of chil- 
dren, and the research-induced moti- 
vations used for children. We return 
to these issues in the discussion of re- 
search techniques. 


THE CHOICE 
OF MEASUREMENT TECHNIQUES 


How does one decide on a technique 
for a given problem of attitude meas- 
urement with children? How does one 
adapt or develop a technique for par- 
ticular research subjects? At least part 
of the answer is suggested in the ques- 
tions. Choice must be in terms of the 
specific problems and research subjects. 
Techniques cannot be rank-ordered for 
effectiveness or validity in vacuo. What 
is a very good approach in one prob- 
lem may be a blundering approach in 
another. We shall see some examples 
later on. To bring research problems 
and techniques into happy union, we 
must know what the problem asks or 
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requires of measurement and what the 
techniques can yield: what assumptions 
are made in a given technique, what 


modifying or limiting factors are in- 
troduced by the particular child sub- 
jects and the particular research con- 
tent. 


Both attitudes and values involve the 
idea of a continuum, from positive to 
negative, right to wrong, or valued to 
devalued. Measurement in terms of a 
quantitative scale along such a con- 
tinuum is, therefore, a reasonable ex- 


pectation. But this information is not 
always possible or sufficient. O ‘en, 
measures sensitive to the “process” and 


“organization” of attitudes and values 
are desired, as opposed to the net | yd- 
uct of affect or to a point on a con- 


tinuum. Therefore, many of the proce- 
dures for studying attitudes and values 
are not measurement in the strict quan- 
titative sense of the word. 

A birď’s-eye view of the varieties of 
approaches can be achieved by organ- 
izing stimulus and response character- 
istics of measurement procedures into 
a conventional classificatory scheme, 
such as that in Table 16-1. In selecting 
or developing a technique, the investi- 
gator will choose between verbal 
behavioral indices, undisguised or dis- 
guised approaches, single or multiple 
indices, manifest or latent meaning 
analyses, contrived or natural situa- 
tions, and clinical or statistical ap- 
proaches. These procedural properties 
are obviously not exclusive to attitude 
measurement, and many of the pros 
and cons for their use with children, 
which have been discussed in other 
chapters (Chapters 14 and 15), apply 
to attitudes and values as well. 
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TABLE 10-!. APPROACHES TO THE MEASUREMENT OF ATTITUDES AND VALUES 


Variations in Stimulus Situations 


Variations in Response Situations 


Verbal Stimuli 


Undisguised purpose or disguised pur- 
questions 
What do you like about school? 


Would you describe a typical school 
day? 
Open ended or structured-alternative 
qu ions 
If a teacher finds that a pupil has 
stolen money from her desk, what 
uld she do with the pupil? 
icher . . . , which of the fol- 
g do you think she should do? 
to him and find out why he 
did it? Scold him in front of the 
class? Send him home with a note 


Single questions, series of questions, at- 
titude tests, attitude scales or inter- 
views 

Written forms or oral forms 

Personal terms or impersonal terms 
What do you think you should do? 
What should a child do? 

Standardized questions or clinically, in- 
dividually oriented questions 

Hypothetical or story situations or real- 
life situations 

Which of the following would you 
be willing to share with a Russian 
child . .. a German child . . . ete.? 
Your home, your playground, your 
school room, etc.? 

Which of the children in your class 
do you like the very best? Which 
next best? Next best? Which least 
of all? 

Incomplete stimulus 
tences or stories 

A mother is a person who . - - 


situations—sen- 


Responses may be oral or written. Alter- 
native responses are presented; subject in- 
dicates his choice. Subject responds freely 
in his own terms. Free responses may be 
guided along given lines to elicit certain 
aspects of attitude or value, such as em- 
phasizing subject’s feelings, the cognitive 
content of his attitudes, or his mode of 
resolving conflicting value alternatives. 
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TABLE 16-1. Continued 


Variations in Stimulus Situations 


Variations in Response Situation: 


Verbal Stimuli Plus “Props” 


Pictorial material accompanied by ques- 
tion probes 
What are the boys doing in this pic- 
ture? 
Doll material accompanied by question 
probes 
What is this mommy doll going to do? 


Behavioral Settings 


Contrived social situations and problem- 

solving situations 

Mother and children brought to ex- 
perimental playroom to wait for 
“detained” experimenter 

Children’s honesty tested in classroom 
situations in which cheating is pos- 
sible 


Verbal responses as above. Behavioral re- 
sponses may require manipulation of pic- 
torial and play materials. Behavioral re- 
sponses may be confined to alternative 
choice or may be freely interpretive. 


Subject may be permitted to behave only 
in structured alternative terms. Subject be- 
haves “naturally” in the situations. Subject 
is asked to report retrospectively on his 
behavior. 


VERBAL INDICES IN ATTITUDE 
AND VALUE MEASUREMENT 


Most research indices of attitudes 
and values are verbal. The verbal ex- 
pression of one’s attitudes (and values) 
calls for introspection on the part of 
the subject, requiring him to call up 
within himself a cognitive organization 
of a given object or class of objects 
or set of events and to express some 
preference, inclination, or interpreta- 
tion with regard to it. A comparison 
of objects (ranking, expressing degrees 
of affective intensity, etc.) may be in- 
volved. How well able the child is to 
do this will depend upon his age, men- 
tal level, and experience, the nature of 
the task required, and the nature of 
the testing situation. Cognitive limita- 
tions, limitations in verbal comprehen- 


sion and facility, and characteristics of 
thought in early childhood set the 
lower limits, developmentally, at which 
one can expect the awareness and the 
power of generalization necessary for 
attitudes or values, Characteristics that 
most directly affect the study of atti- 
tudes and values in young children are 
these: 


1. The young child has a narrow ex- 
perience range. Often the very com- 
mon content about which we want 
to study attitudes may not be a 
part of the child’s experienced world 
or may have very idiosyncratic mean- 
ing for him. Although he may oblige 
a friendly investigator by responding 
about things that are largely unknowns 
to him, it would be in error to ascribe 
attitudes to him. 
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2. His ties to the concrete and the 
immed in his experience and his 


difficulty in dealing with the abstract 


and remote confine the child, too, in 
handling verbal-test situations, Further 
difficulties derive from his inability to 
handle adequately part-whole relations, 
inclusiveness of membership in a class 
of objects or events, and self-other com- 
parisons. Such abstractions, however, 
are often part of attitude measurement. 

3. We ordinarily assume a clear dif- 
ferentiation in the adult between the 
real and che fantasied. In the young 
child’s thinking, these boundaries are 
less distinct; the real, the imagined, the 
wished for, the wished away may not 


always be distinguishable. In measure- 
ment this fact is often lost sight of. In 
fact, in many of our indirect question 
approaches, hypothetical situations, or 
doll-play techniques, we toy with the 
irreal, interpreting it now as fantasy, 
now as real, That our intention may 
not always coincide with the plane on 
which the child is responding injects a 
source of difficulty into verbal meas- 
urements. ` 

4. Language skills, the child’s un- 
derstanding, and his ability to express 
himself are critical in attitude re- 
search. Language barriers as well as 
experience barriers may confound the 
child when the experimenter asks him 
for a choice, an interpretation, or an 
expression of feeling. It is not always 
easy to distinguish which it is. Errors 
or peculiarities in the meaning of words 
for young children have contributed 
often delightful, sometimes pathetic lit- 
tle dramas in child-adult interactions 
and misunderstandings. In verbal inter- 
actions for research purposes, unless one 
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is tuned to recognizing these errors or 
peculiarities, one can be led to errone- 
ous conclusions. For example, in a study 
of kindergarteners’ attitudes toward 
different religious groups (Radke et 
al., 1949), some children, though fa- 
miliar with the labels, Catholic and 
Jewish, had not attached religious con- 
notations to them but rather specific 
meanings growing out of specific con- 
tacts and experiences. (Catholic meant 
another school in the neighborhood at 
which children got spanked if they 
were bad. Jewish meant pickles.) 
Clearly, the feelings expressed by these 
children toward the verbal labels have 
not the same meaning as feelings ex- 
pressed by children for whom “Catho- 
lic” represents a more abstract class of 
membership and belief. Whether early 
feelings attached to these labels will 
generalize as the labels come to stand 
for groups of people is a question need- 
ing further study. Research concerned 
with the life history of attitudes or 
values needs to be concerned with be- 
ginnings such as these, with the way in 
which early fragments of association 
affect subsequent incoming experiences 
and the ways in which they become 
organized into attitudes. 

Although language skills of young 
children are severely limiting for re- 
search, the child’s ability to communi- 
cate feelings through the limited lan- 
guage he possesses is also notable. With- 
out logical and coherent replies to re- 
search stimuli, the preschool child is 
often well able, with a few choice 
words, to convey the emotional sig- 
nificance of the stimulus situation. Also, 
when the content or context of the 
research is in the familiar world of the 
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child, he may be remarkably perceptive 
in the variables being studied. 

The characteristics of language and 
thought that have been noted affect 
research most when one attempts atti- 
tude studies with children of ages be- 
low 6 or 7, but careful assessment of 
similar problems in later years of child- 
hood is not to be overlooked. There 
is a tendency to sigh with relief once 
early childhood is past and to assume 
that now the child can think and ex- 
press himself essentially like an adult. 
This may be true; but cognitive and 
thought processes of school-age chil- 
dren have not received the same care- 
ful scrutiny (except for a study by In- 
helder and Piaget, 1958) that has been 
given to the preschool child. 

A study by Meltzer (1926) demon- 
strates nicely how far from the under- 
standing of many children (10 years 
and older) are many social concepts 
(democracy, capitalism, etc.). How 
careful then must the investigator be 
when his investigations are concerned 
with abstractions! 

We know that thought and commu- 
nication are closely bound up with all 
aspects of development and experience, 
and what we elicit as attitudes or values 
at any age should be interpreted always 
in the light of what is known about 
intellectual and motivational character- 
istics or preoccupations at given child- 
hood periods and of children of given 
backgrounds. Several examples illus- 
trate this point. Ideas about and use 
of rules change over the childhood 
years. This can have a bearing on at- 
titude studies, An absolutist of 8, when 
absolutisms tend to be frequent and 
widespread, is to be judged quite dif- 
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ferently from the 15- or 16-year-c 
with similar absolutist attitudes. 
diagnostic personality significance 
such attitudes is probably much less at 
8 years of age than in later adolescence. 

In the changing nature of make- 
believe, imagination, and ideals through 
childhood we have another example of 
the influence of developmental featu 
on research. Children of different 
may interpret the same research 
within very different frameworks. 
Make-believe themes play a prominent 
and public role in the language and play 
of the preschool age; later, fantasies 1 
wishes are more likely to be covert and 
private and more separated from reality. 
Lewin (1935) observes that ideolo 
may be extremely forceful during ad 
lescence and not readily distinguisha! 
from what is “real.” These differences 
inject another shading into attitude re- 
search with children at different ages. 
Since inferences regarding attitudes are 
frequently drawn from the child’s re- 
sponses to questions couched in “what 
if,” “make believe” terms, the stimulus 
value of “what if” is of considerable 
importance. 
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DIMENSIONS OF VARIATION 
IN VERBAL ATTITUDE 
TECHNIQUES 


A very long and hopelessly specific 
account of instruments of measurement 
would be required to cover all the va- 
riations of techniques that have been 
employed with children. Such an ap- 
proach is impractical as well as unlikely 
to come up with general principles that 
can be applied in further research. 
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Techniques can be sorted, however, on 


a number of dimensions—degree of dis- 
guise or openness, degree of structuring 
given, ether the task framework is 


hypothetical or real, and whether it is 
personalized or impersonalized. By 
evalua some of the virtues and haz- 
ards in child research along dimensions 
such as these, an investigator is in a 
better position to appraise a given tech- 
nique for a given research problem. 


DISGUISED 
OR UNDISGUISED APPROACHES 


_ Direct undisguised questions in daily 
interaction provide most of us with 
a sizable portion of our information 
about the attitudes and values of per- 
sons around us. The predictions we 
make from the responses we receive 
turn out quite well, and we learn to 
count on them in determining our be- 
havior. There are some areas of atti- 
tudes and values about which we would 
never think of asking a direct question 
of our daily associates, yet we make 
inferences, too, about them. Surpris- 
ingly, these frequently appear to be 
valid, particularly when we are draw- 
ing on a large and varied sample of the 
behavior of the individual in question. 
In research our sample of the individual 
is usually a very limited one and our 
relationship very specialized. Within 
these severe limits, how well do vari- 
ous approaches succeed? What are the 
relative merits of disguised and undis- 
guised approaches? 

One of the constants in research with 
children, which enters into the evalua- 
tion of disguised and undisguised meth- 
ods, is the authority role of the adult. 


SUREMENT OF CHILDREN’S ATTITUDES AND VALUES 


659 


The kind of authority varies, it is true, 
but cultural prescriptions regarding the 
child’s role with respect to the adult 
(even the child’s defiance of the adult) 
are significant ingredients of all research 
situations. 

The decision to use an undisguised 
approach relies on the child’s readiness 
to reveal his attitudes to an adult. It 
assumes that he will not be inhibited 
in expression by an authority figure, 
and, also, that he will not be motivated 
to respond according to his conceptions 
of adult norms. For many attitudes, 
these assumptions are reasonably met, 
especially if the research task is set up 
so as not to be a matter of giving “cor- 
rect” responses and if the adult stands 
in no unique role with respect to the 
content in question (such as court 
officer interviewing a delinquent). At- 
titudes and values concerning foods, 
friends, competition, parental disciplin- 
ary techniques, politics, religion, etc., 
may be revealed freely by the child 
under good testing circumstances. He 
may welcome a permissive situation in 
which to voice attitudes troubling him 
or significant to him. 

On the other hand, the child’s keen 
cue seeking regarding the adult's ex- 
pectation, his greater suggestibility and, 
at times, his possessive privacy can in- 
troduce difficulties with direct ques- 
tions. An illustration from the behavior 
of a 5-year-old in a play interview il- 
lustrates difficulty with his suggesti- 
bility. The child built an enclosure out 
of large blocks. The examiner asked, 
“Would you like to be in this all by 
yourself?” “Yes,” replied the child. 
“Would you like your daddy with 
you?” “Yes.” “Would you like your 
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mommy in there with you?” “Yes,” 
“Would you like to be in this all 
alone?” “Yes,” this time with emphatic 
agreement. 

Directness and disguise with children 
are matters of degree and cannot really 
be discussed as two opposite alterna- 
tives. The most open approach might 
be illustrated by: “How do you feel 
about the kinds of punishments your 
parents use with you?” “Do you like 
to play with Negro children?” Here 
both the investigator’s objective and the 
question form are direct. This approach 
is characteristic of many attitude tests 
and questionnaires. The objective may 
be somewhat more concealed, though 
the questions may be direct and appar- 
ently undisguised. Thus, having knowl- 
edge of the parent's actual disciplinary 
practices, the investigator may ask: 
“Here are different kinds of punish- 
ments that parents use. Which do you 
think is the fairest?” “Which next?” 
And so on. Or, in studying sex prefer- 
ences the investigator may ask each 
member of a classroom of boys and 
girls: “Which children do you want 
as your friends? Which children do 
you not want as friends?” 

There are many studies in which the 
topic of the research (attitudes toward 
teachers, toward sex, toward ethnic 
groups) is not really concealed but is 
left “unspoken.” The respondent may 
only gradually acquire insight into the 
area being investigated, as his own re- 
sponses make it increasingly explicit. 
An example is a study by Biber and 
Lewis (1949), in which they investi- 
gated first- and second-grade children’s 
expectations of their teachers. Familiar 
schoolroom situations are presented in 
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pictures, Each picture is explained 
the interviewer, and questions are as! 
the child. 


Oh look at this little girl. She’s crying 


by 
ed 


—see her tears? She’s telling the teacher 
that she has lost the new bracelet she got 
for her birthday. What do you think the 


teacher is saying to her? 


The children are told this is a “game 
of looking at pictures.” There are 13 
such presentations, each involving re- 


lationships with the teacher and life in 
school. Studies of attitudes of young 
children toward ethnic groups have a 
similar pattern. Thus a series of photo- 
graphs or pictured interactions or both 


may be presented. Skin-color dill 
ences appear as part of each picts 
stimulus and sometimes in the investi- 
gators’ questions. (Horowitz, 1936; 
Clark and Clark, 1939; Goodman, 1952; 
Radke, Trager, and Davis, 1949; 
mons, 1950.) 

It seems likely that the respondent 
acquires some level of insight into the 
purposes of these “games,” “stories,” 
etc., as they touch repeatedly on given 
themes. The investigators, of course, do 
not necessarily assume complete or last- 
ing disguise. This procedure may be 
used quite consciously as a step-by-step 
procedure for gaining access to certain 
areas of attitudes, less accessible through 
direct questioning. Whether the child, 
after recognizing the investigator’s pur- 
poses, goes along undisturbed or be- 
comes upset or angry depends on many 
other characteristics of the testing situa- 
tion (see pp. 668-670) especially on the 
relationship or rapport between child 
and investigator. If complete disguise is 
an important objective, techniques of 
real subtlety are required. Probably only 


Am- 
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careful try-outs with children can es- 
tablish the completeness of disguise. 


Situations relying on behavioral indices 
are sometimes more successful than 
verbal techniques in achieving thor- 


ough concealment, for example, in us- 
ing participant observational techniques 
(see pp. 678-682). 

Whether open and direct techniques 
can be used more successfully with 
children than with adults cannot be 
answered categorically. Suggestibility 


and orientation to adult standards are 
on the debit side, but fewer defenses 
and rationalizations in many areas are 
on the credit side. With adults, any 
probes into their fantasies, wishes, and 
dreams are likely to be regarded sus- 
piciously. Children who have not 


learned of the latent significance in 
these responses are less reluctant to re- 
veal them on direct request. 


STRUCTURED 
AND UNSTRUCTURED 
TECHNIQUES 


Degree of structure and degree of 
disguise in a technique often vary in- 
versely, the more highly structured, the 
less disguised. All combinations of these 
dimensions are possible, however. The 
highly structured technique presents an 
organized stimulus, one “complete” in- 
sofar as conveying a common meaning 
to subjects. The unstructured stimulus 
Situation, in contrast, intentionally 
leaves much of the process of organi- 
zation, of giving of meaning, to the 
subject. It is “ambiguous,” vague, or 
incomplete. The response that is called 
for in either stimulus situation may be 
highly structured, a choice of given 
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alternatives, or relatively unstructured, 
a “free” response to the stimulus. The 
tasks presented by these variations 
make quite different demands on the 
child. Both can capitalize on certain 
psychological characteristics of chil- 
dren, and both can err by the same 
token. Before discussing the possibili- 
ties in these alternative approaches, we 
shall look at some of the concrete vari- 
ations that have been employed with 
children. 

Illustrative of a highly structured 
technique is a study by Eberhart (1942) 
of children’s values regarding property 
rights. Pairs of statements are presented, 
and the child is asked to indicate which 
is the more serious of the two. (“To 
borrow your brother's baseball without 
asking or to swipe flowers from a 

ark?”) Boys from grades 1 to 12 were 
tested. With children through grade 3, 
the offenses were pictured and the test 
administered individually. 

Any number of examples from ques- 
tionnaire and interview items illustrate 
this highly structured technique, which 
requires on the part of the child a yes- 
no response or an indication of degree 
of agreement or a choice of multiple 
alternatives. 


How do you feel about parents know- 
ing what teenagers talk about? (Check 
one.) 

a. I think parents have every right to 

know. 

b. I think parents should feel included 
sometimes but not expect to be in on 
all the secrets. 

c. I think that what teenagers talk about 
should be their own business, if they 
want to keep it private. 


In another variation of the structured 
approach the child is given a list of 
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words (cousin, roommate, neighbor, 
etc.) heading columns opposite a list of 
races and nationalities. The child is in- 
structed to write “yes” or “no” to show 
his willingness or unwillingness to have 
the relationship indicated with a mem- 
ber of the groups listed (Zeligs, 1948). 

A picture technique may be equally 
highly structured, Temple and Amen 
(1944) presented a series of pictures 
with a central child figure in each. The 
child is shown in circumstances such as 
eating alone and watching his father 
playing with a younger child. The child 
is given two paper faces, a happy one 
and a sad one, and asked to choose one 
to put on the central child in the pic- 
ture. Following his choice, the child 
is asked his reasons. This combination 
of structured stimulus with unstruc- 
tured response occurs frequently in 
studies of children, in interviews, in 
questionnaires, and in picture and doll- 
play techniques. Further illustrations of 
this method follow: in interviews with 
children (10 to 15 years of age) on 
attitudes toward different cultures 
(Frenkel-Brunswik and Havel, 1953) 
questions such as “What is an Ameri- 
can?” and “Why don’t people like 
Mexicans?” were used. The questions 
are direct, undisguised, structured. The 
child is expected to organize the re- 
sponse. In other cases an episode of be- 
havior is described in specific detail 
(verbally or pictorially), such as a child 
being scolded by his mother because 
he has failed in school. The child is 
called upon to give his reactions, usu- 
ally aided by probes: “How does he 
feel about the punishment, etc.?” “This 
little boy isn’t feeling well. He’s com- 
ing up to the teacher and telling her he 
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has a stomachache. What do you think 
the teacher is saying to him?” 

From the highly structured stimulus, 
one can move gradually toward less 
and less structure, toward requiring the 
subject to impose more and more of his 
own organization and meaning upon 
the stimulus. Instead of a picture of 
mother spanking child, mother and 
child’s intentions may be suggested only 
by facial expressions or gestures. Even 
fewer clear cues may appear in the 
picture; the child’s face may now be 
turned away and the mother’s expres 
sion may tell unclearly of anger or 
worry or fear. The interpretation is 
up to the child. 

Many studies with children use 
proaches somewhere in the 
range on this continuum. Pictures used 
by Trager and Yarrow (1952) in study- 
ing attitudes of 5-to-8-year-old chil- 
dren portray children at play, with one 
child possibly seen as being on the side- 
lines. Racial differences are suggested 
in the drawings, The child is first asked 
an open-ended question, “Tell me about 
the picture.” Increasingly focused ques- 
tions follow, “Tell me about this little 
boy (child on sidelines). . . . He’s not 
playing. Why isn’t he playing?” And 
so on, 

The free-association test used by 
Meltzer (1943) with children ọ years 
and older to get at their attitudes to- 
ward parents falls in this intermediate 
group. He introduces his procedure as 
a game in “loud thinking.” He presents 
a word and the child is asked to “shout 
out” the first ten things he thinks of. 
Names of people like Washington and 
Lincoln are given, followed by “your 
mother” and “your father.” 


ap- 
ap: 


niddle 
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With children in the upper grades, 
written tasks of this semistructured 
variety have been developed. One of 
the most revealing in the study of chil- 
dren’s values (Kalhorn, 1944) is a sim- 
ple procedure in which the child is 
asked, 


What could a child be doing at home 
that is a fine thing to do and someone is 
praising im? What could a child be do- 
ing at home that he should not do, and 
someone is scolding him? 


This is repeated with the addition of 
“a very fine thing and someone is prais- 
ing him very much” and, similarly, 
“something he should never do and 
someone is scolding him very hard.” 
Each time he is asked who is praising 
or scolding. With fourth to eighth 
graders this approach brought out 
value differences and differences in the 
sources of values between Mennonite 
children and non-Mennonite children. 

The incomplete sentence technique, 
which can be graded from the semi- 
structured (“I feel like cheating 
when .. .”) to the most incomplete 
provocation for response (“Most 
girls...” “My mother - - .”), also 
appears in attitude studies (Harris and 
Tseng, 1957). This form of the tech- 
nique and certain applications of doll 
play, in which the manipulation of the 
materials and the child’s verbalizations 
are relatively undirected, represent 
minimally structured approaches to at- 
titudes and values. They are less likely 
to be used in investigations of attitudes 
and values than in studies of personality 
and motivation, since a very large sam- 
pling of responses would be required 
to elicit data relevant to the specific at- 
titude or value. 
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The virtues and failings of structured 
and unstructured approaches are dis- 
cussed in the literature from varying 

oints of view (Campbell, 1950; Deri 
et al, 1948; see also chapters in this 
book on interviewing and projective 
techniques). There are certain issues 

articularly pertinent to research with 
children. Attention to developmental 
characteristics of limitations in language 
and thought processes is particularly 
important in certain structured tech- 
niques. Some make minimal demands 
on the expressive verbal facility of the 
child and in this sense they are highly 
suitable for work with children. These 
are techniques requiring the choice of 
alternative responses, agree to disagree, 
like to dislike, yes or 10, and similar 
categories. In reality, these techniques 
require, on the part of the children, 
very attentive responses and a sifting 
of associations and inclinations in order 
to reach a resolution in the form of one 
of the presented alternatives. This is a 
complex process for an easy-answer 
product. It should be remembered that 
the process as well as the product must 
be applicable to the child subject. When 
using such techniques, in which a sim- 
ple model of response is given, namely, 
choosing one of a standard set of alter- 
natives, one must be extremely careful 
to keep the child’s attention on the 
stimulus. It must be sufficiently simple 
and motivating if the child is not to slip 
into a routine and meaningless pattern 
of choices. The young child’s tendency 
to perseverate may manifest itself most 
readily in these techniques. A lack of 
comprehension of the questions, or a 
lack of motivation or actual resistance 
from children of any age, can be 


664 


masked quite easily in apparently co- 
operative selections of alternative 
euorecs: To guard against this kind of 
“misuse” by the subject, the investiga- 
tor can employ various “checks.” Thus, 
by repeating questions in varied forms 
at several points in the measuring proc- 
ess, as a check on the consistency of 
reactions, he can arrive at a safer esti- 
mate of the meaningfulness of data. By 
following the structured responses with 
open-ended probes, such as, “Could you 
tell me why you feel that way?” or 
“What are your reasons for choosing 
X?”, he may gain assurance or dis- 
couragement about his technique. 
Structured alternative responses in 
attitude and value areas run several 
other risks with children. They may 
produce too-neat representations of 
what may be very unclear and uncer- 
tain processes. The structured response 
cannot readily differentiate between 
the child who values altruism or hon- 
esty very highly and the child who 
has barely formulated principles. Re- 
search techniques of multiple-choice 
responses can force or create the form 
of the attitude by the items and alter- 
natives presented and can impose a 
framework which, though not incor- 
rect, is partly out of step with the 
child’s reality. Both points need illus- 
tration. Suppose we are interested in 
children’s attitudes toward mentally ill 
behavior; we describe an episode of 
paranoid behavior and then ask whether 
the person acting this way does so be- 
cause he is (1) “bad,” (2) “sick,” or 
(3) “unhappy about something.” The 
adult framework is automatically 
stamped on the child. The child may 
not have thought of any of these ex- 
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planations; he may not indeed have 
seen the “paranoid” behavior. 

To illustrate the “miss” of reality in 
another way, suppose one is investigat- 
ing children’s attitudes toward parent's 
behavior or characteristics that have 
social connotations (being alcoholic, 
adhering to “foreign” customs or lan- 
guage, being in a given occupation, 
etc.). Expression of like or dislike, pride 
or shame, might be obtained with struc- 
tured questions and limited alternative 
responses, which would very reason. 
ably assess attitudes, On the other hanc 
more significant dimensions might be 
missed. We will suppose in this c 
that the child lacks a general frame- 
work with regard to each of the spt 
cific parent characteristics. Instead his 
feelings about his parent’s adherence 
to particular customs or forms of 
speech might be situationally deter- 
mined. (“I like them when we are alone 
in the family. I’m ashamed of them in 
front of my teacher.”) Feelings about 
the father’s occupation, although nega- 
tive in two children, may have very 
different qualities. (“It makes me want 
to cry.” “I wish I could run away.” 
“I hate my father.”) 

The limitations and risks of a struc- 
tured technique by no means rule it 
out as useful and appropriate for many 
attitude problems. An awareness of 
limitations should, however, influence 
instrument construction and most cer- 
tainly the interpretation of findings 
from a given instrument. 

The strengths of structured response 
techniques are apparent from a number 
of standpoints. When attitude areas 
disturbing to the individual (attitudes 
toward self, sex, parents, etc.) are in- 
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vestigated, an approach that takes away 


from the child the burden of verbaliz- 
ing the painful thoughts has advantages. 
With highly specific, pointed questions 
and with carefully differentiated alter- 


native responses, the structured ap- 
proach also has the advantage of being 
able to explore systematically the sub- 
areas of attitudes and values and to 
sample widely along dimensions deemed 
important By this means the extent to 
which a generalized response tendency 
does exist and the limits or boundaries 
of the attitude may be assessed. Struc- 
tured techniques however, do not al- 
ways sample either widely or well but 
rely on a few questions chosen rather 
arbitrarily to index the attitude. Ade- 
quacy of item sampling depends upon 
an explicit theory underlying item se- 
lection. 

By using relatively unstructured stim- 
uli and free responses it is assumed that 
artificialities of structured techniques 
are avoided and that the “natural” form 
of the attitude is obtained. To discover 
the structure of attitudes, freedom from 
adult-imposed organization is needed; 
to obtain the cognitive and motiva- 
tional aspects, the child must be able 
to contribute freely and uniquely in 
the testing situation. 

The question of relative validity and 
reliability of the two approaches has 
not been resolved in all-or-none fashion 
(again, see discussions in the literature 
cited above). This is certainly true for 
work with children; the specific tech- 
nique and application determine the 
answer, The unstructured approaches 
have the potentiality but not the as- 
Surance of greater depth. In ferreting 
out test unreliabilities from attitude 
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changes that have occurred in the child 
between measurement times, the open- 
ended responses (if they are sufficiently 
descriptive) have some advantage. Thus 
an attitude that appears changed on a 
score of negative-to-positive affect may 
contain unchanged cognitive and mo- 
tivational factors. On the other hand, an 
attitude score may be the same on two 
testing circumstances, yet considerable 
cognitive reorganization may have 
taken place. These test-retest trends are 
more readily discernible in clinical type 
data than in the entirely prestructured 
response categories. If the research prob- 
lem is one concerned with development 
and change in attitudes, the investigator 
will do well to choose his technique 
knowing its possibilities and limitations 
with respect to the amount of infor- 
mation he desires about the attitude 
change. 


THE HYPOTHETICAL 
AND IMAGINED 
VERSUS THE REAL 


Attitudes can be approached ob- 
liquely or frontally in another respect; 
namely, by putting the issue in hypo- 
thetical or imagined terms or in real 
terms. Consider the following alterna- 
tive forms that might be used in study- 
ing attitudes toward given social char- 
acteristics of peers: 


1. “Suppose you were going to 
choose three boys to go with you on 
a camping trip. Which would you 
choose?” (A series of descriptions of 
hypothetical companions is given the 
subject from which he will make his 
selection.) 

2. “We are teaming up next week 
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for field trips in botany. Which class- 
mates would you prefer to have with 
you on your trip? Name three.” 


An investigation of children’s atti- 
tudes toward the father’s authority role 
in the family could proceed with ques- 
tions of either of the following forms: 


1, “Suppose you were dropped from 
the high-school team because of poor 
grades and when your father heard 
about it he stopped your allowance. 
How would you feel about that?” 

2. “Think of the last time your father 
punished you. Why was the punish- 
ment given? What did your father do? 
How did you feel about that?” 


Investigators have been somewhat 
cavalier about choosing one or the 
other of these techniques, when per- 
haps it is important to be more con- 
cerned about the inferences that are 
made from responses to each kind of 
question. Does the child subject per- 
ceive and respond to the hypothetical 
and real differently? On what planes 
of reality and generality are the re- 
sponses to each approach? In the ex- 
ample on attitudes toward the father’s 
discipline, are the child’s reactions a 
reflection of his image of his own 
father, a digest of fathers he has ob- 
served, a highlighted experience with 
his own father, a stereotype of what 
fathers are like, a fantasied hoped-for 
father? Is one or another of these pos- 
sibilities encouraged more by “hypo- 
thetical” or “real” approaches? The 
familiar picture of the unbearably cruel 
teacher and punitive mother portrayed 
by children “playing school” or “play- 
ing house” raises similar questions as to 
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the plane on which the child is re- 
sponding. 

We have raised these questions of 
data interpretation but have not found 
methodological research that goes far 
in sorting out answers. However, the 
questions raised may leave the inv i 
gator somewhat more reserved 
modest in his conclusions and generali- 
zations from attitude studies than he 
might otherwise have been. 


PERSONALIZED 
OR IMPERSONALIZED 
TECHNIQUES 


In one approach to attitude measure- 
ments, the child is asked to reac in 
terms of his own attitudes. In an alter- 
native approach, his attention is directed 
away from himself to qualities o! the 
stimulus situation (how does the child 
in the picture feel?). In the latter 
the child’s reponses, manifestly, are not 
about himself, but it is assumed that his 
handling of the materials is determined 
by attributes within him. This “projec- 
tive” approach is popular in attitude 
research, for it avoids the direct ap- 
proach to sensitive areas, and, with 
some techniques, it avoids verbal re- 
sponses. As a research method with 
children it has a particular appeal, since 
play materials and situations can be 
readily transformed into projective de- 
vices (doll play, dramatic play, story 
telling, etc.). 

The theory and technology of pro- 
jective methods is the topic of Chap- 
ter 14 and is not discussed here. Uses 
of projective instruments for the meas- 
urement of attitudes and values involve 
no special assumptions or problems 
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other than those that apply in the use 
of projective methods in the study of 
child personality. However, because at- 
titude stuc are often quite specific 
as to content, i.e., attitudes toward pa- 
rental authority, siblings, sex identity, 
or illness are investigated, projective 
techniques tend to be somewhat more 
focused in research on attitudes than in 
personality research. For example, when 
using doll play to study attitudes, the 
investigator is likely to identify the 
dolls as representing given persons or 
roles (parent, sibling, teacher, or social- 
group member) or even to set up quite 
specific situations or interactions for 
eliciting the attitudes. Thus in a study 
of young children’s attitudes toward sex 
differences Conn and Kanner (1947) 
supply a variety of dolls, to represent 
a new baby and the child’s siblings. 
Questions are asked in such a way that 
the child answers for the dolls. Point- 
ing to the doll, the interviewer asks, 
“What would he want to know about 
the baby?” “Which one of them wants 
to know where babies come from?” 
Attitudes of 4-year-old hospitalized 
children toward nurse, doctor, and 
medical procedures are approached 
similarly through doll play by Erickson 
(1958), again with the investigator 
carefully identifying each object for 
the subject. i 

Play media of many kinds are used, 
dolls and doll equipment more often 
than any other. Dolls, interestingly, are 
adaptable to ages beyond the doll- 
playing years if carefully staged. In the 
context of drama, for “example, older 
children can be made interested (Chein 
and Evans, 1948). Puppets similarly are 
suitable for older children. (Older boys, 
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however, may not always go along 
wholeheartedly.) 

Pictures are used widely and are 
adaptable to many contents and to a 
wide age range. They have been ap- 
plied fruitfully in studying attitudes in 
many different areas, attitudes toward 
parents (Grace and Lohmann, 1952), 
toward teachers (Biber and Lewis, 
1949), toward self (Horowitz, 1943). 
The model in this kind of study is usu- 
ally a pictured interaction, varying in 
ambiguity, presented with a series of 
interview questions. General questions 
of the “tell-me-what-is-happening” va- 
riety begin the series and are followed 
by more specific and directed questions, 
such as “What does he expect the doc- 
tor to do?” This picture approach is 
fine for the preschool-age child, and 
it is equally adaptable to the adolescent, 

Not all uses of pictures in the study 
of attitudes or values are “projective.” 
Pictures often serve as important inter- 
view props, but questions are directed 
toward the respondent: “Here is a pic- 
ture of a boy getting a prize in school. 
Has this ever happened to you? What 
did your mother say when it hap- 

ened? How do you feel about 
school?” An example of this approach 
is found in a study by Morgan and 
Gaier (1957) of child and parent re- 
actions to punishment situations. 

Incomplete stories and sentences are 
still other projective devices to which 
children respond reasonably well, de- 
pending on the appeal and provocative 
quality of the story or sentence 
(Harris and Tseng, 1957; Seaton, 1949). 
Verbal facility becomes an important 
factor here. 

Although it is not difficult to devise 


668 


play approaches to which children will 
respond with interest, it is difficult to 
design projective tests of attitudes or 
values in such a way as to be certain 
that one has sampled the attitude proc- 
ess adequately. Have essential and en- 
during aspects been sampled in the par- 
ticular limited doll play or story by the 
child? How far can one safely gener- 
alize about a child’s “conscience” or 
social values from his play with dolls 
in a given structured circumstance or 
from his completions or interpretations 
of given story stimuli? At present, re- 
search is at the stage of quite arbitrary 
selection of projective situations, with 
not enough concern about problems of 
generalization. What is needed is more 
careful consideration on the part of the 
research worker of the specific assump- 
tions he is making in devising a par- 
ticular form of projective stimulus and 
in using the response to it as repre- 
sentative of a generalized attitude or 
value system. 

In the preceding pages we have 
looked at major dimensions in which 
attitude techniques differ, whether the 
technique uses interview, test, scale, 
or behavior, We turn now to “tech- 
niques” in the specific sense. In coming 
to representative methods, it should be 
pointed out that the following para- 
graphs are not complete in themselves 
as analyses of each technique, but they 
bring up additional special points, 
unique to the given approach, which 
have not been discussed, 


INTERVIEW 
AND QUESTIONNAIRE METHODS 


The interview, in one or another of 
its various forms, is the most frequently 
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chosen approach to children’s atti 
and values. Its adaptability to develop- 
mental differences of interviewees and 
to psychological requirements of vari- 
ous contents and problems makes the 
interview an extremely valuable re- 
search tool. Nonetheless, as in most 
research techniques, its efficiency de- 
pends on understanding the conditions 
that are created for the respondent by 
the technique, per se, and the condi- 
tions that are optimal for given r 
search objectives. Some of the c 
tions of interviewer-child relationships 
and of question form, of particular 
relevance in the measurement of atti- 


ndi- 


tudes and values, are discussed here. 
Interviewing is treated more fully in 
Chapter 14. 

One of the most critical factors in 


using the interview in the study of at- 
titudes is the symbolic value of the 
interviewer for the child and its influ- 
ence upon the research process. In ex- 
periments in learning and conditioning, 
characteristics of the examiner have 
been found to influence the course of 
learning, though the task is objective 
and often simple and the investigator 
is assumed to be “neutral.” The inter- 
viewer’s influence on the respondent in 
an investigation of attitudes and values 
is probably many times greater and 
more direct. One such influence, as 
pointed out earlier, stems from the in- 
terviewer’s relationship to the child as 
a particular kind of authority. In his 
authority role, the investigator may be 
respected, rebelled against, distrusted, 
or admired by the child. His role may 
be very specific, such as teacher, doc- 
tor, law officer, or parent. Whatever 
the child’s image of this authority and 
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the codes governing his relation to the 
authority, it is likely that these factors 
will influence his response to the re- 
search task. We know that authority 
in general as well as specific authority 
roles carry different meanings for chil- 
dren depending on their age, social and 
cultural background, and specific ex- 
periences. Systematic biases can, there- 
fore, become part of attitude findings 
by virtue of the interviewer-child rela- 
tionship. When the interview is used 
to compare attitudes and values of 
children of different ages or of differ- 
ent social and cultural groups, the find- 
ings may include a spurious age-linked 
or social group-linked relationship that 
derives from the interviewer-child re- 
lationship, A hypothetical study illus- 
trates this point. We wish to study 
values of honesty and altruism in chil- 
dren from two cultural groups. In one 
of these cultural groups parental con- 
trol and dominance are strong and de- 
manding; in the other, parent-child 
status differences are less marked and 
relationships are more informal. The 
interview questions explore circum- 
stances and considerations under which 
selfish and unselfish motives, honest and 
dishonest behavior are accepted by the 
child. Our findings show considerably 
greater adherence to moral principles 
by children in the first than in the sec- 
ond group; but, before we conclude 
cultural differences and theorize as to 
inculcation processes in each, we shall 
want to consider a possible research er- 
ror deriving from the relation between 
interviewer and child. How free do the 
children of the two backgrounds feel 
to express their own motives when 
these motives conflict with the pre- 
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sumed adult sanctions? In this instance, 
children governed by the more con- 
trolling parents may not feel free to 
express selfish motives to the adult or 
to compromise on honesty for fear of 
adult disapproval. Children brought up 
with the more informal relationship 
with parental authority, on the other 
hand, may be freer to reveal their own 
feelings. Is it these differences or is it 
differences in values that are reflected 
in the research findings? An adequate 
research design must take these several 
ossibilities into account. 

In addition to the interviewer’s au- 
thority roles, there are other symbolic 
properties that may affect attitude 
measurements. The interviewer may 
symbolize a group, an ideology, or a 
convention with a direct bearing on 
the content of the interview. Thus one 
might expect a systematic biasing in- 
fluence from the race of the interviewer 
in studies of racial attitudes, the sex of 
the interviewer in interviews about at- 
titudes toward physical characteristics 
or sex, the age of interviewer in ado- 
lescents’ discussions of dating patterns, 
of parent practices, etc. The more subtle 
personality characteristics of the inter- 
viewer are probably also significant. In 
a careful research design many sources 
of bias can be avoided; if they cannot, 
an attempt should be made to measure 
the nature of bias that exists. 

These liabilities in the interviewer do 
not minimize the over-all importance of 
a personal relationship in obtaining 
valid data from the child subject. It is 
only when a supportive relationship has 
been established between interviewer 
and child that the delicate and ego- 
involving contents common in attitude 
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and value studies can be studied with 
maximum effectiveness and with some 
understanding of the defenses, rebellion, 
reluctance, puzzlement, fears, etc., that 
are evoked by the research. We know 
from research experience that children 
do become distressed in investigations 
of attitudes (Clark, 1939; Goodman, 
1953; Radke et al., 1949) that touch on 
highly emotional areas in their personal 
lives. Ethical considerations, therefore, 
place importance on the personal re- 
sponsibility of the investigator for un- 
derstanding and handling the distresses 
occasioned by him. 

In contrast to the interview, with its 
personal relationship, is the written 
questionnaire. The two approaches 
create very different subjective settings 
for the child, although identical ques- 
tions may appear in both instruments. A 
questionnaire distributed to groups of 
children at a time involves a minimal 
personal relationship between subject 
and investigator. Therefore, motiva- 
tions for doing the task, cushions for 
possible traumata occurring as a result 
of the content, guards against idiosyn- 
cratic interpretations of the task, and 
handling of reverberations that con- 
tinue within the child afterward—these 
factors must either be ignored or built 
into written instructions and group ad- 
ministrative procedures. The latter is 
not possible except in a general way. 

Some investigators have proposed that 
anonymity, which is possible on a 
questionnaire, is an advantage in the 
study of affect-laden areas; they argue 
that the child will be freest under 
these circumstances, freer perhaps than 
with an interviewer, and will be less 
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traumatized by revealing personal atti- 
tudes. There is some validity in this 
argument, but it is limited by another 
factor; namely, anonymity is feasible 
primarily in written, group-adinin- 
istered questionnaires with structured, 
check-list responses. As soon as free 
responses are called for, the subject's 
fear of revealing his identity and the 
children’s difficulties in writing el 
orate responses arise. Therefore, any 
advantages from anonymity are re- 
stricted by the limits of a highly struc- 
tured measuring technique. 

If one may hazard a formula for de- 
ciding between a written group qucs- 
tionnaire and a personal interview for 
investigating the attitudes of children, 
it might be as follows: faith in a written 
questionnaire for children should 5 
inversely with emotional content, ex- 
pectation of resistance, likelihood of 
culturally stereotyped responses, and 
degree of deviance in the child group. 
Whenever it is used, the attitude ques- 
tionnaire should be tested as an inter- 
view, to establish a “feel” for children’s 
reactions to it, before applying it in the 
field. 

Interviews and questionnaires are the 
research creations of professional per- 
sons who, as a class, have the “peculi- 
arity” of being very verbal. It is well 
to remind oneself of this fact in de- 
vising questions for children, Many 
children, especially children whose 
home and community cultures do not 
foster verbalizing one’s feelings and 
values and ideas, may find intensive 
questioning difficult and foreign to their 
ways. 

The technique of asking questions, 


1 


THE MI 

which at first blush is a simple, com- 
mon-sense procedure, is not such if it 
is to be used fruitfully with children. 
It requires a combination of various 
kinds of knowledge and skills. 


1. A theoretical sophistication is nec- 


essary in the selection of questions. 
Specific questions only sample a general 
attitude or value. How adequate is the 


sample chosen in a given interview? 


What assumptions are made about the 
specific questions in relation to the 
attitude? These are questions of 
validity (see pp. 682-684); they are 
questions, too, that, if considered even 
speculatively, will help to avoid the 
more carelessly chosen stimulus items. 


2. A technical knowledge of the in- 
fluences of question form on response 
is important. The form of the question 
must not itself bias the response. Prob- 
lems of “leading” questions, ambiguity, 
double-barreledness, and similar prob- 
lems of item construction, which have 
been explored in the adult testing field, 
are applicable downward in the age 
scale, with added attention to the child’s 
tendencies toward suggestibility and 
perserveration, discussed earlier. 

3. Knowledge of the psychology of 
the respondent is a necessary guide in 
formulating questions. In research with 
children this is knowledge of develop- 
mental and social environmental factors 
(see earlier discussion, pp- 656-658, 
669). In formulating questions for chil- 
dren, the adult’s frame of reference in- 
evitably enters in, variously sensing, 
variously distorting the child’s perspec- 
tive. On occasion, the child and adult 
may not be “speaking the same lan- 
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guage” at all. An example of this is 
found in a study by Landreth and 
Johnson (1953) on the meaning of skin- 
color differences to preschool-age chil- 
dren. Social interactions were pictured. 
One face was omitted, and the child 
was asked to fill in the appropriate face 
by choosing from insets differing in 
“Skin” color. Even though the children 
responded thoughtfully to the research 
task, the judgments of a portion of the 
subjects were made out of considera- 
tions quite foreign to the purposes of 
the investigators. White upper-class 
children in the sample perceived the 
task as a perceptual color-matching 
problem, whereas the other children, as 
ascertained by their remarks, perceived 
it as a racial question. The investigators 
were alert to this fact. 


“Questions” in attitude interviews 
with children quite often are only part 
of the testing stimulus. Props of pic- 
tures, dolls, and story settings are used 
liberally as “provocations,” for research 
experience has demonstrated the value 
of concretizing the child’s task in these 
ways. The attitude questions can be 
phrased in terms of tangible objects or 
events furnished by the props. Not only 
are the ideas and concepts made easier 
by the tangible referents, but the child 
can respond in the language of actions 
and deeds, which is easier for him than 
summaries of his feelings, motives, or 
ideology. Although he cannot readily 
verbalize his ideologies regarding fair- 
ness, altruism, or morality, he can ex- 
press specific acts or decisions indicative 
of his ideology. For example, when 
presented with the story of the two 


672 


children who have broken their 
mothers’ dishes, one who has broken 
ten dishes accidentally and the other 
who has broken one dish intentionally, 
children from preschool to high school 
are able to deal with this stimulus 
productively at their various levels of 
sophistication. 

Another method of increasing the 
concreteness of questioning in explor- 
ing the attitudes of children is that of 
exposing them to a given limited ex- 
perience (seeing a movie on warfare, 
playing or working with children of 
other national origin, or having the 
teacher carry out specified authority 
patterns) and questioning them after 
the experience. A funneling of specific 
to general questions (the reverse of the 
usual procedure) might be suited to 
this technique, i.e., starting with con- 
crete events in the experience and mov- 
ing toward generalizations beyond 
these specifics, 

A suggestion by Merton et al. 
(1956), in a discussion of the focused 
interview with adults, seems very ap- 
propriate for children. When studying 
attitudes about events in the past or 
about complex relationships in the 
present, the investigator might stimulate 
the child to bring to mind the details 
of the critical events by describing 
them, “reliving” them. After this con- 
cretizing has occurred, questions of at- 
titude are asked. This would mean a 
procedure something like the follow- 
ing: the investigator exploring attitudes 
toward siblings or toward illness would 
ask the child to describe what he and 
his brother had done together the pre- 
ceding day or week or to describe when 
he has last been sick or when someone 
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in his family had been sick. This would 
serve to focus the child’s attention and 
to leave less doubt concerning his 
referents in responding to attitude 


questions. 


SPECIAL INTERVIEW 
OR QUESTIONNAIRE 
APPROACHES 


Sociometric Techniques. An inter- 
view and questionnaire technique that 
has proved highly useful in attitude 
research is the sociometric technique 
(see Chapter 19 for a more general dis- 
cussion of sociometric techniques). It 
has many variations, but its most typical 
form is to have each member of a group 
respond about every other group mem 
ber in terms of some specified reaction 
or interaction, One may attempt to tap 
very general attitude sets or preferences 
with questions such as, “Who are the 
children that you like most? Least?” 
One may also focus on more narrowly 
specified dimensions, “Whose manners, 
appearance, ideas, etc., do you like 
best?” Or interactions, “Whom would 
you like to sit with, eat with, have at 
your birthday party?” (There are many 
discussions of this methodology; 
Lindzey and Borgatta, 1954.) 

As a technique for children, this has 
the advantage of asking simple ques- 
tions. When a child is asked to choose 
the children he likes best, the ones he 
wants for friends, or does not like, one 
is not setting up a particularly dif- 
ficult task, at least, not one entirely 
outside his usual experience. He has 
been making these decisions in the 
course of his everyday associations. 
Whether he is 4 or 14, the choice in 


see 


THE MEASUREMENT OF CHILDREN’S ATTITUDES AND VALUES 


some sense involves similar processes. 
Hence the technique can be useful for 
studies d to age changes, in which 
the equivalence of verbal stimuli at 
successive ages is often a difficult ques- 
tion. 

The emphasis in sociometric attitude 
studies may be on individual children 
in the stimulus group, in terms of the 
attitudes expressed toward them by the 
respondents, or on the attitudes of the 
respondents as they are revealed in 
choices and rejections of stimulus chil- 
dren. For example, the stimulus chil- 
dren may be subgrouped by the 
investigator (into boys and girls, whites 
and Negroes, preadolescents and ado- 
lescents, upper class and lower class, 
etc.). Choices on sociometric questions 
are interpreted in terms of these sub- 
groups. Thus, what are the respondents’ 
attitudes regarding sex, maturity, class, 
etc.? Overchoosing or underchoosing 
of members of a given subgroup be- 
yond a chance expectancy is taken as an 
indication of a positive or negative atti- 
tude. (See Lindzey and Borgatta, 1954, 
for statistical methods.) 

Several assumptions are made in 
Studies of this kind: (1) the subgroup- 
ings in the mind of the investigator are 
also in the awareness of the children, 
and (2) when a selective bias appears, 
the subgroup characteristics, and not 
Some other unknown correlated factor, 
are the basis of choice. Let us suppose 
that we found children’s choices on a 
playground predominantly within their 
own sex group. We might conclude 
that attitudes of preference for own 
Sex are very strong. However, when 
We decide to investigate further, we 
find that these children have been at- 
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tending sex-segregated schools. Thus 
prior acquaintance enters as a factor to 
be reckoned with. The problem of cor- 
related variables, known and unknown, 
complicate interpretations of socio- 
metric data. Whatever the specific pur- 
pose of using sociometric techniques 
(to assess group cleavage, personal 
preference, etc.), an assumption is al- 
ways being made that the variable that 
is real to the investigator is real to the 
subject and is not confounded by other 
correlated variables. 

The adaptability of the sociometric 
technique, in form and purpose, has 
added to its popularity as an approach 
with children. With it one may elicit 
and compare the wished for, the ex- 
pected, the accustomed, the sanctioned, 
[“With whom would you like to be 
friends? Who will choose you as a 
friend? With whom have you played 
most? Whom should you (not) choose 
as a friend?”]. Questions may be di- 
rected to affect (“Who is liked or dis- 
liked?”), to intentions regarding be- 
havior (“Whom will you choose as a 
partner?”), or to cognitive elements in 
perception and choice. In the guess- 
who technique, an example of this ap- 
proach, the child is asked to look among 
his associates and to tell, “Who likes to 
boss others around,” “Who is someone 
that everybody likes,” “Who is afraid,” 
etc. 

The sociometric method may be 
directed with equal ease to the child’s 
attitudes concerning his face-to-face 
groups (such as his class or club) or 
to attitudes concerning hypothetical 
groups that can be defined simply 
verbally or pictorially. A favored ap- 
proach with children is to present 
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pictures, individual photographs, such 
as the Horowitz “Faces” test (1938), 
or group scenes in which readily iden- 
tifiable characteristics are represented 
(age, sex, race, etc.). The subject is 
asked to indicate which of the pic- 
tured persons he would choose as ful- 
filling given descriptions. Sometimes 
the child’s immediate interaction group 
is pictured, in this way making certain 
that each group member is brought to 
the attention of the respondent. Yar- 
row et al. (1958) followed this pro- 
cedure in studying the formation of 
interpersonal preferences and percep- 
tions in children’s groups in the early 
stages of getting acquainted. By placing 
before the respondent the pictures of 
all the children in the group, difficulties 
of not remembering names and the pos- 
sibility of overlooking certain children 
were bypassed, and comparisons and 
contrasts among children were probably 
made easier. 

Sociometric adaptations blend into 
other assessment procedures that re- 
quire the child to sort individuals along 
prescribed dimensions. Thus rank or- 
derings, social distance tests, and cer- 
tain projective tests are akin to socio- 
metric methods. Of the social distance 
test, a bit more should be said, for it 
could be used, more than it has been, 
for assessing the social or personal “dis- 
tances” that the child feels. In its 
original form (Bogardus, 1925), this test, 
of course, dealt with the willingness of 
the individual to admit members of 
different national groups into various 
social relationships (entrance to coun- 
try, residence in neighborhood, accept- 
ability as a marriage partner), The 


relationships were roughly “scaled” in 
terms of the distance or intimacy of 
the relationship. One need not dwell on 
the inappropriateness of these particular 
items for children (below adolescence), 
but it should be apparent that items 
more critical in the child’s life space 
might be devised and presented in this 
form. The form of the test is appeal- 
ingly simple and can be made meaning- 
ful to the child. 

Not only in studying social attitu les 
toward different groups of people but 
as a technique for exploring attitudes 
of “personal distance,” this approach 
appears to have potentialities that have 
not been used with children. For ex- 
ample, the “distance” between the child 
and various significant adult fig 
between child and different family 
members, or between child and other 
children might be explored in this 
manner. 

The common characteristic in the 
techniques discussed in this section is 
that of obtaining the child’s placement 
of persons relative to one another on a 
variety of yardsticks. The data furnish 
meaningful “summary” judgments or 
decisions of the respondents, but there 
are limitations in such decisions in atti- 
tude assessment. Sociometric choice 
cannot be automatically equated with 
attitudes, for situational and personality 
factors (e.g., unfamiliarity with stimulus 
persons or conscious distortion) can 
enter into choice. Sociometric and 
related techniques are best used, there- 
fore, in the company of other ap- 
proaches. It is important to measure 
in this way the correspondence 
between sociometric data and inde- 
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pendent criteria of attitudes as esti- 


mates of validity, but it is equally 
important to use several techniques 
planfully, with the objective of ob- 
taining data from them on different 
qualities or aspects of the attitude, thus 


to arrive at a better understanding of 
the nature of the attitude. 

Attitude Scales. If groups of in- 
dividuals are to be compared or if the 
effects of particular experiences are to 
be measured in “before” and “after” 
assessments, an attitude “score” may 
be desired. This requires the develop- 
ment of a scale. With adults, the scal- 
ing of attitudes and opinions is a highly 
developed field and constitutes a large 
body of literature in psychology and 
sociology. The standard techniques of 
scale construction are amply reported 
(see Thurston and Chave, 1929; Likert, 
1932; Stouffer et al., 1950; Green, 1954, 
as well as many textbook discussions). 
Here we are concerned not with the 
technicalities of scale construction but 
with the question of how well the 
capacities of children make them suit- 
able subjects for the use of scales. 

The construction of an attitude scale 
requires the selection of a set of items 
to which the individual can respond 
(indicating his agreement, preference, 
approval, and the like) and from which 
he can be assigned a score on a quantita- 
tive continuum, The response model for 
most scales is a simple one: a yes-no 
type of decision or a graded yes-no. 
The items follow one upon the other, 
calling for rapid shifts of problem and 
focus by the respondent. In some scales 
the items are manifestly quite distinct. 
In other scales the items may vary only 
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slightly on the same theme. The sub- 
jects’ comprehension, concentration, 


and openness are essential for valid 
scale responses. 

The main considerations when using 
scales with children are the following: 


1. One cannot use scales with chil- 
dren unless one has evidence that the 
children have reasonably organized or 
crystallized attitudes on the issue 
studied (see earlier discussions of de- 
velopmental and experiential factors 
in language and thought of the child and 
problems of meaningless and persevera- 
tive responses on structured questions). 

2. The requirements of responding to 
a battery of questions would not seem 
to be especially trying to school-age 
children accustomed to tests at school. 
However, length and difficulty level 
of a scale should be considered in terms 
of the age and other characteristics of 
the children being studied. 

3. One cannot assume that because 

uestions are consistently and meaning- 
fully ordered for adults they will be 
similarly ordered for children. It is 
necessary to test scalability on children 
of the particular developmental age one 
is studying. Just as scales do not readily 
transfer from one culture to another, so, 
too, they cannot be assumed to transfer 
from adult to child culture. For ex- 
ample, attitudes regarding property, 
components of prejudice, attitudes 
toward illness, age, and sex could be 
expected to differ for children and 
adults. Children must themselves be used 
in scale construction, much as the taste 

anel for favored drugs consists of a 
group of children. Comparisons of adult 
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and child scale-criterion groups may 
indeed be interesting developmental 
data in themselves. 

In general, scales are useful in the 
measurement of children’s attitudes 
when it makes sense to deal with a 
single dimension; too often investigators 
attempt to force multidimensional 
problems into the Procrustean bed of 
one dimension because scales seem 
“scientific.” It may be well to look at 
some of the kinds of research problems 
on attitudes and values that do not lend 
themselves to scale measurement. If an 
investigator is interested in the forma- 
tion of attitudes or values, in the 
processes of change over time or as the 
result of identifiable experiences, in the 
constellation of forces maintaining an 
attitude, or resisting or creating change 
in an attitude, more “qualitative” meas- 
ures would seem necessary. 

Through a detailed report of data on 
a preschool child’s values regarding the 
use of aggression, we can perhaps see 
more concretely what one could and 
could not measure with a scale ap- 
proach. Murphy, Murphy, and New- 
comb (1937, pp. 407-409) cite a study 
by Fite of children’s values regarding 
aggression. The following is part of the 
description of one child just under 4 
years of age. 


When Nancy was asked what she would 
do if another child hit her, she said she 
would tell the teacher. Would she hit him 
back? “No, because I want to be good 
because my mother and father say I should, 
so I’m trying my best.” And later she gen- 
eralizes still further; “You have to tell the 
parents. You must tell everything to the 
parents ‘cause they’re bigger. The child 
asks the parents what to do and then they 
help the child.” . . . In a repetition of the 
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situation two weeks later with the second 
series of pictures, her strongly filial atti- 
tudes held up pretty well. No, she would 
not hit back, and if her aggressor kepr at 
her, she would call her mother to make 
him stop. “Mother would say, “This boy 
hurts, and ’cause it’s not nice and would 
he please take his hands off ’cause it’s not 
nice.’” The experimenter tried to make it 
harder: “But suppose you were at school, 
and you called the teacher, and the teacher 
didn’t hear you.” Nancy admitted in 
case that she’d have to “push his hand « 


Then for a moment her feelings slipped 
through: “Then I would show him how 
it feels.” But as if in contrition over that 


admission, she replied to the next 
tion, “Would you hit him?” with, “I 
would be nice and good. I would give 
him a lollypop.” 

Several months later, at the beginning, 
when the experimenter asked if she should 
make her own doll hit Nancy’s, Nancy 
responded, “Yes,” rather casually. But at 
the actual hitting, her face grew serious, 
and at the suggestion that she make her 
doll hit the experimenter’s, she drew her 
doll back from the experimenter’s and, 
with a very anxious expression said, “No, 
I don’t want to.” She seemed quite near 
the point of tears, and to each question 
about hitting replied vehemently that hit- 
ting was not all right: “I know because 
my father and mother say so, I know it.” 
Both the experimenter and the assisting 
research worker were struck by the un- 
happiness of her tone and expression, and 
the vehemence with which she pulled her 
doll back when asked to make it hit. Might 
there be a reason to suspect that in some 
way her confidence in what her parents 
said was being undermined, and that the 
degree of emotionality shown was a sign 
of some conflict? 

Six months later, she replied with com- 
plete casualness that she didn’t think it 
right to hit people. “’Cause it hurts. 
’Cause I felt it one time.” (Note that “it 
hurts” seems to be the first true “reason” 
to develop.) “I think it’s not so good— 
neither does my mother.” 


ues- 
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This time, when asked, “If some other 
child hits vou first, then what’s the thing 


to do?” Nancy’s immediate response is, 
“Hit them back.” Moreover, she’s not 
above 1ancing a little to back this up: 


“Cause that’s what my mother tells me 
every night before I go to bed.” (The 


pure fiction of this statement was checked 
with the parents.) Also, another reason 
was mentioned: “ ’Cause that’s what some- 
times other children do to me.” Now it 
seems that in fantasy she has loaded her 


parents on her side and in actuality she 
has her social experience to back her up, 


and thus perfect accord is made. 


A scale score could register this 
child’s resistance to the use of aggres- 
sion, but it could not reflect the emotion 
with which the child responded nor 
her competing impulses. It might not 
have marked the change at the sixth- 
month questioning, when the child still 
thoughr it “not right” to hit people, 
but had resolved much of the con- 
flict that she had portrayed in her 
earlier responses. 

Before leaving the topic of scaling 
attitudes, it should be noted that it is 
possible to make use of the principles 
of scaling, yet not obtain data through 
a scale composed of structured ques- 
tions with fixed answer categories. Chil- 
dren may be asked questions about their 
attitudes toward aggression or adult 
supervision and may be allowed to 
give free responses. The investigator 
can assign responses to scale categories. 
Kohn and Carroll (1960) used this 
procedure in an interview study with 
10-year-olds concerning their social re- 
lationships with peers and parents. For 
example, a number of questions in the 
interview schedule pertain to the child’s 
attitudes toward his relationships with 
his father. Responses to these items were 
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assigned categories by the interviewer. 
A Guttman-type scale was constructed 
for these items. 

Behavioral measures of attitudes and 
values might be handled similarly for 
some research purposes. Data obtained 
in observational records of behavior in 
experimental or natural situations could 
be coded and scaled (see pp. 679-682). 
Investigators working with children 
have not generally made use of these 
procedures, but they would appear to 
be useful and worth exploring. 


PERSONAL DOCUMENTS 
AND SELF-RATINGS 


This approach has serious limitations 
for systematic study of attitudes and 
values with children as with adults 
(Allport, 1942). The occasional produc- 
tive use that has been made of children’s 
themes, artistic products, diaries, and 
the like, deserves passing reference. 
The casual and often accidental ac- 
cumulation of such materials is an 
important limiting factor in their use- 
fulness in research. In general, these 
documents can be revealing as data 
used with more systematically gathered 
data. In achieving clinical understand- 
ing of a given child, they may also 
contribute substantially. 

Paintings and drawings, especially, 
have interested investigators working 
with children at all ages (Wolff, 1943), 
though more often in the interests of 
personality diagnosis than in attitude 
measurement. Pupils’ themes have been 
put into service in various studies as 
sources of data on attitudes and values, 
although the extraneous and uncon- 
trollable factors that enter into such 
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productions (such as literary ability 
and secondary motivations as in a class- 
room theme) raise major reservations. 
They may, however, be sufficiently 
diagnostic to reflect group differences 
or to serve as relatively gross screening 
techniques. 

In the company of a more “solid” 
technique the child’s day-to-day prod- 
ucts can act as catalysts for verbal 
expression of attitudes. Yarrow (see 
Chapter 14, p. 570) reports a highly 
successful technique for getting at cer- 
tain personal data on 5- and 6-year-olds. 
The child brings his scrapbook of recent 
experiences and collections to the inter- 
view. Semistandard questions are ex- 
plored in the context of these materials, 
with the child’s involvement greatly 
enhanced. As a technique for approach- 
ing attitudes toward self, this approach 
appears to have interesting possibilities. 

Self-ratings by the child have rarely 
been tried. However, sociometric ques- 
tions are sometimes turned on the child 
himself, i.e., asking him by whom would 
he be chosen or for which of a series 
of guess-who items would he pick 
himself? From his choices, inferences 
are possible about his self regard. More 
direct self-ratings by children would 
probably suffer from the same kinds of 
distortions and biases found in their 
use with adult subjects. 


BEHAVIORAL INDICES 
OF ATTITUDES AND VALUES 


Thus far only verbal indices of atti- 
tudes and values have been discussed, 
although behavior may also serve as the 
basis for inference. Observations of be- 
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havior can be made and coded in terms 
of attitudes and values. Behavioral 
studies have several strong appeals: they 


avoid verbal barriers, they can (but may 
not always) circumvent self-conscious 
and possibly threatening revelations, 
and identical behavioral indices can be 

of 


used for children of a wide range 
age levels and cultural backgro 
They can also be supplemented by 
bal responses, either by including verba 
data in the observational records or by 
following observational records with 
verbal tests or interviews, Behavioral 
measures are suited to experimental as 
well as naturalistic approaches. 

There are drawbacks, too. First, the 
behavior that one would like to observe 
in order to assess attitudes or values may 
not be easily induced. The critical be- 
haviors may be very infrequent in occur- 
rence, and thus a very large volume of 
irrelevant data may have to be recorded 
and analyzed in order to achieve a 
reasonable sample of the relevant data. 
Then, too, behavior (as well as verbal 
responses) may be embroidered or 
faked. To be certain that comparable 
behavioral circumstances are being 
observed for different subjects and at 
different times and that there are no 
important determinants of the behavior 
other than the attitudes or values in 
which one is interested, very careful 
prescriptions must be laid down for 
behavioral measures. This applies 
equally to contrived and natural situa- 
tions. Here an explicit theory or set of 
hypotheses linking behavior with at- 
titude or value is of help to the in- 
vestigator in guiding his sampling of 
behavior and his analysis of behavior. 
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Let us consider an example of the 
possible use of behavior in a study of 
attitudes. Suppose the investigator's 
purpose is to study attitudes of children 
toward physical handicaps in their 
peers, and he decides to use classroom 
observations of behavior as his data 
source. In each classroom there are 
several children with visible physical 
handicaps. He will do well to specify 
in advance exactly what information 
concerning attitudes he will attempt to 
obtain and in what behaviors he assumes 
the attitude will be manifested. Thus he 
may be interested solely in general 
affective qualities of attitudes, which he 
may define behaviorally as approach and 
avoidance behavior of the nonhandi- 
capped toward handicapped children. 
Approach and avoidance may be de- 
fined concretely in terms of choice of 
peers for desired classroom functions, 
as partners, etc. He may be interested 
in the more differentiated cognitive 
aspects of attitudes. He will then want 
behavioral data on a wider spectrum of 
personal interactions, coded in terms 
such as subordinate-dominant relations, 
nurturant-succorant interactions, ag- 
gressive-affiliative reactions, etc. He 
may be interested in attitudes in rela- 
tion to specific situations. In this case, 
he will need to specify situational fac- 
tors carefully. 

Since behavior in school may be 
governed by an institutional ideology 
of democratic relations, the investigator 
might also want to obtain behavior that 
is more “private,” reasoning that under 
such circumstances the more personal 
and genuine attitudes would be dis- 
played. For this purpose, his design may 
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include contrived (though to the child, 
natural) situations to get at the child’s 
behavior when he is not aware that he 
is being identified or observed. 

None of these suggested behavioral 
indices is a “pure” measure of attitudes. 
For each, the investigator must be con- 
cerned with the things that influence 
behavior but are neither attitudes nor 
values. For example, the immediate en- 
vironment may influence the opportu- 
nities for certain behaviors to occur. 
Classroom seating arrangements or 
ability groupings may thus influence 
peer choices. Personality differences 
contribute another important, nonatti- 
tudinal influence. 

The use of behavioral data requires 
observational techniques for data 
gathering. For discussions of basic issues 
of observational techniques—sampling, 
observational schemes, recording de- 
vices, and analysis schemes—the reader 
is referred to Chapter 3 and to compre- 
hensive treatments of the topic, such as 
that of Heyns and Lippitt (1954). 
Variations of behavioral designs that 
have been used successfully in the in- 
vestigation of children’s attitudes and 
values are reviewed there. 

Many studies of children make use of 
behavior “once removed” from the 
child’s own behavior by asking him to 
use test materials in certain ways. The 
subject is asked to choose dolls or pic- 
tures to represent his responses (Clark 
and Clark, 1939) or to manipulate dolls 
or other play materials to convey his 
ideas and feelings. In this nonverbal 
approach, complex stimuli can be used 
and children can respond meaningfully 
who could not manage the same prob- 
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lems verbally. For instance, several 
groups of investigators have succeeded 
in studying awareness and values of 
young children, in regard to social class 
and social roles, by introducing class 
and role symbols into backgrounds in 
doll play (housing and street scenes of 
differing economic levels, costume dif- 
ferences, etc.). (Hartley and Schwartz, 
1951; Trager and Yarrow, 1952). In 
these studies interview questions supple- 
ment performance data. 

Dolls are used by Chein and Evans 
(1948) in studying social attitudes in 
ways acceptable to older children. Sub- 
jects, 7 to 14 years of age, are supplied 
with miniature “movie sets” and asked 
to provide action and dialogue for 
standardized story situations. Role play- 
ing may be used similarly. The in- 
vestigator may pose a problem, take the 
role of one of the significant actors, and 
have the child assume the other role. 
The investigator may take the role of 
the child and ask the child subject to 
assume the role of doctor or mother. 
This approach is highly revealing in 
individual cases in which considerable 
rapport and comfort have been estab- 
lished, It has been more of a clinical 
than a research technique. However, the 
possibility of its development for 
systematic research purposes appears 
feasible. Stanton et al. (1956) describe 
this type of technique as they have used 
it for survey research with adults, on 
problems of parent-child relations, 
housing, and marital adjustment. The 
approach seemed to work very well. 
It should be noted, however, that there 
was usually some special entry into the 
households; it was not simply a polling 
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type of contact. This technique seems 
worth more study with children. 

Experimental situations that do not 
rely on the child’s representation of 
behavior with actors, but which obtain 
behavior from the child in his own 
right, have an important place in re- 
search on children’s values. Most of 
these experimental situations set up 
choice or decision situations, In level 
of aspiration or problem-solving de- 
signs, in which achievement values or 
attitudes of self regard are the object 
of research, the situation is likely to 
be set up with a testlike quality (Lewin 
et al., 1944). More often, however, the 
investigator is interested in contriving 
a “natural” choice situation for the dis- 
play of values, one in which the child 
is not aware of the fact that he is being 
tested. 

An extensive series of ingenious be- 
havioral tests of values with this “real” 
quality about them were devised by 
Hartshorne and May, a long time ago 
(1928). The “tests,” in which op- 
portunities to cheat in school work 
and at games, to steal, to give false 
answers about themselves, to choose 
between personal or group rewards, be- 
tween keeping or sharing with a needy 
person, and so on, were devised, were 
given to 11,000 school-age children. No 
study of children’s values has since 
been carried out on such an impressive 
scale, both qualitatively and quantita- 
tively. 

A few studies following this model 
have been reported. One by Wright 
(1942) used an experimental laboratory 
technique successfully for studying 
selfishness and generosity in children, 
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5 to 8 years old. She used simple but 
child-motivating situations in which 
the child retained or shared toys with 
other children variously described to 
him by the experimenter. Selfish or 
generous behavior was studied in rela- 
tion to the perceived traits of the other 
children (such as their degree of need). 

It would seem most fruitful to review 
and replicate the Hartshorne and May 
studies (1928, 1929) with the benefits of 
modern developments in observational 
techniques and within a systematic 
theoretical framework in which the 
antecedents and personality correlates 
of differences in value orientations are 
considered. This approach combines the 
advantages of experimental control with 
genuineness of the situation and ego- 
involvement by the subjects. In devising 
experimental studies, the desires for 
experimental cleanness and for the in- 
tactness of the value or attitude process 
are often in conflict. Except in rather 
rare designs, one or the other tends to 
suffer. Sacrifice of experimental con- 
trols means a loss in the explanatory 
power of the data. Reduction of the 
attitude or value to a simpie standard 
decision in an activity that is not ego- 
involving may leave only a tortured 
inference that the processes being 
measured are attitudes or values. Un- 
doubtedly, one of the most difficult 
steps in designing experimental studies 
is the devising of reasonable and effec- 
tive motivating conditions for children 
in which behavior indicative of attitudes 
or values can be elicited. Of utmost im- 
portance is the experimenter’s sensitivity 
to what constitutes reasonable and ef- 
fective motivating conditions for chil- 
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dren; reasonable in the sense that ethical 
principles are not compromised; effec- 
tive in that children can comprehend 
and can be emotionally involved in the 
situation. Unfortunately, the nature of 
values and attitudes is such that the 
contrived tests are potentially disturbing 
to the child. Consider, for example, 
the anxiety and guilt that may follow 
the “undetected” and rewarded dis- 
honesty that the child exhibits in the 
experimental situation or the severe 
conflicts occasioned by tempting the 
child in areas of moral prohibitions. 
Any investigation in this field will do 
well to have built into it a measure of 
the impact of the research upon the 
child; this can be done by planned 
observations or interviews during and 
after the procedure. The research de- 
sign should also include criteria for a 
decision to terminate a procedure with 
a given child, should the procedure 
cause marked disturbance in him. 

To avoid the necessity of inducing 
value issues experimentally and to be 
able to study powerful value issues, re- 
search has turned to field studies of 
behavior in its natural settings. In field 
designs several procedural choices are 
open. The completely untampering ap- 
proach is that of the participant ob- 
server. His approach may be that of the 
anthropologist. Studies of the value 
systems of primitive cultures and the 
inculcation of values in their young 
have been made with this method. It 
relies heavily upon the synthesizing ca- 
pabilities of a trained observer living for 
a time in the culture and having wide 
access to its “inner workings.” (Mead, 
1928; Whiting, 1941). Such studies yield 
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what may be regarded as a case study 
of the attitudes and values of a group, 
and like individual case studies can run 
the gamut from unsystematized descrip- 
tion to an observational account fash- 
ioned by a specific theoretical position. 

To go beyond case studies of in- 
dividual children or groups of children, 
it is clearly necessary to introduce the 
familiar requirements of systematic re- 
search procedures: specified criteria for 
sampling behavior, for categorizing be- 
havior, and for recording behavior. 
Only in this way has the investigator 
the bases for comparing or drawing 
conclusions about values and attitudes 
expressed by different children, at dif- 
ferent times, in unlike circumstances. 
(See Chapter 3 for analysis of method- 
ological problems of field studies.) 

By investing the participant observer 
with the philosophy and tools of the 
experimenter, field studies and experi- 
mental studies are brought somewhat 
closer together in techniques. Re- 
searchers in roles such as teacher or 
counselor or nurse can study the chil- 
dren’s attitudes and values in natural 
settings. However, studies of this kind 
are few in comparison with the in- 
vestigations in which verbal responses 
of children are used. 

The observer in a field study can 
proceed very much as he would 
laboratory group studies. Time sam- 
pling, event sampling, running accounts, 
and precoded rating categories are tech- 
niques transferable from the laboratory 
to the field. 

Unless a natural situation presents it- 
self, in which the values or attitudes of 
interest to the investigator have a high 
likelihood of manifesting themselves, 
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the naturalistic approach may be some- 
what sterile. If, on the other hand, one 
is able to find the “right” circum- 
stances, extremely powerful and cg 
involving attitudes and values, w! 
could not be duplicated in the labora- 
tory, can be studied. One might, for 
example, do studies of children’s be- 
havior, expressive of attitudes or values 
in situations of illness or disaster, in 
first days at school, in separations from 
family or friends, or in serious com- 
petitive activities. 


VALIDITY AND RELIABILITY 


Problem formulation and data-gather- 
ing in research on children’s attitudes 
and values have been the emphases in 
this chapter. Problems of analys 
generally been slighted in the belief 
that characteristics of children neces- 
sitate research adaptations mainly in the 
problem formulation and data- -gatheri ing 
phases. Sometimes, however, methods of 
analysis are also involved. When prop- 
erties of data from children differ from 
“parallel” data from adults, identical 
analysis procedures in the scoring or 
ordering of responses or in the inter- 
pretation of findings may not be 
justified. Such examples have been 
noted, as in attitude scaling, in which 
scalability for adults may not mean 
scalability for children (p. 675), and in 
the personality interpretations of rigid- 
ity in attitudes of children and adults 
(p. 658). The importance of designing 
studies of attitudes or values, with data 
analysis as much in mind as data collec- 
tion, is underscored; the fuzzy nature 
of indices of attitudes and values only 
makes the demands on analysis more 
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stringent. For the specific and detailed 
discussions of data analysis, the reader 
is referred to manuals and papers deal- 
ing with analytic procedures in social 
and psychological research, such as 
Berelson (1956); Festinger and Katz 
(1953); Jahoda et al. (1951). 

Questions of validity and reliability 
of attitude measures are issues at all 
stages of a research project, and some 
attention is due them in regard to the 
techniques that have been discussed for 
children. Investigators in this field often 
ignore these issues. The difficulties in 
validating inferred processes such as at- 
titudes and values are obvious major 
deterrents. However, some examination 
of attitude and value assessments is 
usually possible in terms of what they 
predict, how they relate to other in- 
dependent measures of presumably the 
same processes, or how they conform 
to an accepted theory or validated re- 
lationship. 

Because data on attitudes and values 
vary greatly from items that are “self- 
evident” in meaning (“Which class- 
mates do you like best?”) to items re- 
quiring complex inferences (“What will 
the boys in this picture do if the teacher 
asks them to stop what they are do- 
ing?”), the questions of validity are 
varied too. Face validity is more readily 
accepted for items of the first illustra- 
tion, in which the response has a com- 
mon meaning for all, than for data of 
the second kind. The reliance on face 
validity has in its support the argument 
that we are, after all, interested in the 
subjective orientation of the child, his 
reality, and his evaluation of a situa- 
tion presented to him in the testing. 
This position is defensible only if the 
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investigator is content to restrict his 
conclusions very carefully. However, 
he is in a stronger position by compar- 
ing results based on his verbal instru- 
ment or behavioral test against theoreti- 
cally reasonable external criteria. Al- 
though not a test of “validity” in the 
strictest sense (since the criteria are not 
direct measures of the inferred vari- 
ables), such checks add to the known 
character and meaning of the attitudi- 
nal data obtained. 

The reliability of children’s responses 
is also very puzzling to the investigator. 
To characterize a process in the indi- 
vidual child, to compare individuals or 
groups, or to predict to other processes, 
a consistency or stability in the attitude 
measurement is necessary. It is well 
documented that several factors tend to 
contribute instability to children’s re- 
sponses. The tendency for children’s 
reactions to be situationally determined 
to a very considerable extent has often 
been observed. This applies more to 
younger than older children, Consist- 
ency in retest responses may, therefore, 
be low, even though at the time of the 
test the child may truly be expressing 
his feelings of the moment. This sug- 
gests the necessity for a wide sampling 
of an attitude within the child subject 
before one assumes “a generalized re- 
sponse tendency” or a stable measure. 
Also, it calls for closer attention to 
situational influences in the measure- 
ment circumstances themselves, which 
may contribute to stability or instabil- 
ity. Among these influences, the rela- 
tionship of investigator to child is an 
exceedingly influential situational vari- 
able (see pp- 668-669). 

Changes over time as a function of 
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maturational and experiential factors 
may be anticipated in such fluid and 
formative processes as children’s atti- 
tudes and values. There may be more 
dramatic changes, and changes that oc- 
cur within shorter time intervals, for 
the child than the adult. Teasing out the 
various sources of change may be ex- 
ceedingly difficult, A dramatic example 
of change on retest after a new experi- 
ence in the life of a first grader is shown 
in the following: a Negro child in an 
all-Negro school in a northern city was 
examined in interviews intended to 
measure racial attitudes. Negro and 
white dolls were used in the interviews. 
On the days between interviews a white 
substitute teacher replaced the regular 
teacher who was ill. A marked change 
in content occurred in the child’s doll 
play. The adult white doll figured 
prominently in nurturant interaction 
with the child dolls, both Negro and 
white, in the second test. In the first 
test the adult white doll had been com- 
pletely ignored. Thus the careful in- 
vestigator of children’s attitudes and 
values must be alert to more than the 
characteristics of his testing techniques 
if he is to avoid errors in research con- 
clusions, 


Conclusions 


Attitudes and values of children have, 
for a rather long period, occupied 
fringe areas in the systematic literature 
of child development. Their incorpora- 
tion into more general developmental 
theory would be valuable in develop- 
mental and social psychology alike. 
Their usefulness has been demonstrated 
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in investigating the social contexts of 
children. The intrafamilial contexts, the 
effects of parental handling upon chil- 
dren’s attitudes and value systems, and 
the interactive effects of children’s at- 
titudes and values and parental handling 
upon other consequents in child de- 
velopment are areas in which these con- 
cepts can be given further exploration. 

The investigation of inferred proc- 
esses, such as attitudes and values, makes 
difficult demands on research technol- 
ogy and on the technique of the re- 
searcher; the qualities of the instrument 
and the context and motivation that the 
investigator provides are inseparable 
components. Children are difficult re- 
search subjects, and only the begin- 
nings of methodological research bear- 
ing on their functioning as research 
subjects have been made. For example, 
we need to inquire much more care- 
fully into fundamental questions, such 
as the conditions motivating children 
in research tasks. Can candy M & M’s 
be used like food pellets with rats to 
reward the child and to engage and 
maintain his motivation in performing 
and answering questions? It seems 
doubtful. We have progressed in more 
sophisticated formulation of research 
problems. We must keep pace in our 
sophistication relating to the child as a 
participant in the more varied research 
in which he is now included. 

No one seriously engaged in worthy 
research on the normal adult personal- 
ity or on adult attitudes and values 
would place knowledge of an investiga- 
tory technique as the primary value 
and requirement of the investigator 
and knowledge of adult motivation as 
the secondary. Yet this ideology and 
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practice are still current in research 
with children. Because children are not 
just simplified adults and because 
knowledge of the developing organism 
is not acquired incidentally, while ply- 
ing a technique, this ideology can only 
impede research progress. A closer bal- 
ance in the investigator of knowledge 
and interest in technique and theory 
with knowledge and interest in the child 
will result in sounder contributions to 
the psychology of children and de- 
velopment. 
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Motivation 


and 
Affect 


Daniel R. Miller 
The University of Michigan 


It is striking to see how often a paper 
on motivation ends with the solemn 
protestation that the study has been an 
exploratory one. To demonstrate how 
many loose ends need tidying up, the 
writer usually suggests topics for 
further research. Often the list is a long 
one. The explorer of human motives 
and affects has good reason to qualify 
his empirical results. Different maps of 
the motivational domain present con- 
sistent pictures of a few well-traveled 
routes but provide only vague sketches 
of some areas and label others as terra 
incognita. As a result, the investigator 
often has to rely on arbitrary definitions 
and theoretical assumptions. His prob- 
lems are further complicated by a large 
variety of instruments, the strengths and 
weaknesses of which are only partially 
known. Yet a perusal of studies pub- 
lished within the last fifteen years 
reveals considerable progress, not only 


in the testing of traditional techniques, 
but also in the construction of in- 
genious new ones that have done much 
to enlarge theoretical horizons, This 


chapter is devoted to descriptions and 
evaluations of the techniques that are 
commonly used in the study of motiva- 


tion and affect. 


Contents of the Chapter 


ORGANIZATION OF TOPICS 


Methods for studying motivation may 
be divided into two general types. 
Some are used to elicit behavior, and 


some are used to measure the results. 
The first category consists of physical 
and psychological methods for creat- 
ing desired conditions in one’s subjects 
and methods for selecting groups whose 
previous backgrounds have created 
conditions of interest to the experi- 
menter. The second includes techniques 
such as time samples, ratings, rankings, 
checklists, projective tests, official rec- 
ords, the interview, and the question- 
naire. In applying both types of 
methods, investigators observe and con- 
sult with children, their peers, their 
parents, their teachers, and profes- 
sional personnel. 

This chapter has two primary parts, 
which are concerned with the two 
major kinds of methods. The parts are 
divided into sections, each of which 
is devoted to the different variants of a 
method. The section on verbal reports, 
for example, is devoted to interviews, in- 
ventories, questionnaires, compositions, 
and the like. Each specific technique is 
considered in light of the theoretical 
topics to which it has been applied and 
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the age levels at which it has been tried. 
Purposes of studies and their results are 
sometimes cited when they have a bear- 


ing on the reliability, validity, or theo- 
retical import of a method. When 
relevant, mention is made of the source 


of information, such as peers or 
teachers. 
The primary parts of the chapter 


and some of the sections are concluded 
with evaluations of the techniques. 
The possible pitfalls are described not 
to paint a dismal picture of current 


methods but to stress the importance 
of finding the correct technique for 
each problem. As will become evident, 
the variety of promising methods offers 
a rosy picture of future theoretical de- 
velopments. Some general methodologi- 
cal problems involved in the validation 
of tests of motivation, the selection of 
a sample, and the control of the ex- 
perimenter’s influence on results are 
saved for the end of the chapter. 


AGES 


In line with the subject of the book, 
the chapter is restricted to techniques 
of studying the motivation of children. 
The label of children is arbitrarily re- 
stricted to subjects who have not 
reached the end of their eighteenth 
year, the age when secondary school 
is completed. The range from birth to 
18 years is most readily split into four 
segments: birth to 2 years, or babyhood; 
3 to 6 years, or the preschool ages; 
6 to 12 years, or the first six grades 
of elementary school; 12 to 18 years, or 
adolescence. When references are made 
to such terms as babyhood or ado- 
lescence, they refer to the age ranges 
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just cited. Usually the subjects’ ages 
are given to the nearest year. 

Research on adults is not considered 
in this chapter except for papers rel- 
evant to the methodological problems 
involved in studying children. It is 
possible that some instruments that have 
been tried only on adult populations 
would be ideally suited for research 
with younger subjects. But speculations 
on untried techniques can be misleading. 
The many promising ones that have 
been tested on juvenile populations 
provide the examples cited in this sum- 
mary. 

Three topics, values, percepts, and 
physiological reactions, although very 
relevant to motivation and affect, re- 
ceive little attention in the chapter. 
These omissions are prompted by the 
detailed consideration of these topics 
in other chapters. 

Finally, it is important to stress that 
this chapter does not represent a cover- 
age of all the literature on the motiva- 
tion of children. The available space 
would not permit even a survey of the 
enormous literature on only one of 
the methods, that of projective tech- 
niques. What has been attempted is a 
survey of the different kinds of meth- 
ods. Representative studies of each 
kind of method have been cited in 
order to describe it and to illustrate its 
characteristics and its strengths and 
weaknesses. 


THE EXPERIMENTAL 
CREATION OF GROUPS 


Psychologists have devoted con- 
siderable effort to methods designed to 
elicit certain motivational and affective 
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states. Many of the attempts have been 
ingenious and effective. The different 
techniques may be divided into four 
categories, which provide the outline 
for this initial part of the chapter. A 
child can be manipulated physically by 
such methods as being stroked or being 
unsupported. He can be given some 
sensory stimulation, such as the sight 
of a smiling experimenter or of a doll 
with its eyes at the back of its head. 
He can be subjected to certain psy- 
chological pressures, such as incentives, 
frustrations, and conflicts. Finally, he 
can be manipulated by the experi- 
menter’s structuring of the interpersonal 
situation: the subject can be left alone; 
he can be allowed to interact with 
another person; he can be included in 
a group; he can be exposed to people 
with particular identities who behave in 
particular ways. 


Physical Manipulation 


ELICITING AND RECOGNIZING 
INNATE AFFECTIVE STATES 


Simple physical stimuli were used 
by Watson and Morgan (1917) in one 
of the earliest experimental studies of 
human affect. Observations and manipu- 
lations of infants during their first 
months of life led the investigators 
to conclude that fear, rage, and love 
are part of the “original and funda- 
mental nature of man.” Signs of fear 
occurred in response to loss of support, 
loud sounds, and being jarred. Rage 
was usually provoked by hampered 
movements, and love was elicited by 
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stroking or by manipulation of erog- 
enous zones. 

Conditioning of Affect. In one ol the 
most influential papers in the history of 
American psychology, Watson and 
Rayner (1920) described the condition- 
ing of a little boy named Albert. The 
unconditioned response was a loud 
noise, which startled the child and 
caused him to withdraw. First the-e 
was a number of trials, during which 
the loud noise was combined with the 
presentation of a white rat. Soon Albert 
shrank with fear at the sight of the 
animal, even when there was no sound. 
Whether or not the connection between 
the animal and the noise need be in- 
terpreted as a conditioned response is 
a debatable issue, but no one questions 
the possibility that a person can learn 
to fear an object because it is associ ited 
with another that was originally feared. 

Physical Stimulation of Affect. Aside 
from the people who tried to check 
Watson’s claims, few investigators have 
studied the affective results of physical 
stimulation. To investigate the causes 
of infants’ laughter, Washburn (1929) 
tickled them and also raised them above 
her head where she jiggled them gently. 
Justin (1933) elicited laughter in pre- 
school children by tickling them with 
an ostrich feather. 


Other Sensory Stimulation 


VISUAL STIMULI 


Laughter. Justin (1933) tested the 
effectiveness of different stimuli in mak- 
ing preschool children laugh. The 


MOTIVATION AND AFFECT 


stimuli included a jack-in-the-box, a 


picture of a long-nosed man, a doll with 
eyes at the back of its head, a chair with 
shoes on its legs, a social smile by the 


experimenter, and his falling off his 
chair. 
Smili 


g. Spitz and Wolf (1946) made 


a sophi ticated attempt to analyze some 
properties of the adult’s smile which 
elicit smiling in infants. In different 


sessions the experimenter first smiled, 
then grimaced, and finally wore a mask. 
Then a life-sized puppet and various 
inanimate objects were presented to the 
subjects. Between the ages of roughly 
2 and months the infants smiled, 
laughed, and gurgled when they saw the 
smiling grimacing, or masked man or 
the puppet. None of the responses was 
forthcoming on presentation of such 
inanimate objects as a flashlight, a bell, 
a block, and balls. In Spitz and Wolf's 
opinion the infant smiles on perceiving 
a configuration of certain static ele- 
ments within the human face. 

Fear. Ellesor (1933) employed novel 
visual stimuli to elicit fear in 2-year- 
olds. Among the stimuli were a red 
devil’s mask on a doll 27 inches long, a 
brown fur neckpiece with a mouth and 
glass eyes, a flashlight, and films of new- 
born infants and of a girl in her yard at 
home, At first the children whimpered, 
retreated, and stared. Soon, however, 
they approached, investigated, and 
played with the frightening objects. 
With repeated stimulation, the subjects 
smiled and seemed content to play with 
the objects. 

Causes of fear in young children have 
also been studied by presenting the de- 
tached arm of a doll (Valentine, 1930), 
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isolating each child, striking an iron 
pipe with which he was playing (Jersild 
and Holmes, 1935), asking him to re- 
trieve a ball from a dark room, and 
placing him in a room with a stranger 
(Shirley, 1942; Arsenian, 1943). Re- 
actions to strange situations and to 
strange people varied with age and sex 
of the children and with the extent of 
previous overprotection and rejection. 


AUDITORY AND VISUAL STIMULI 


Smiling. Washburn (1929) elicited 
smiling in young infants by such 
methods as smiling and simultaneously 
making “chirruping” sounds, lowering 
her head to the infant’s body and saying 
“ah-boo,” suddenly reappearing from 
under the table saying “boo,” sud- 
denly emerging from a cupboard, clap- 
ping her hands in rhythm, playing peek- 
a-boo by first pulling a cloth over the 
infant’s face and later holding the cloth 
in front of him, and suddenly presenting 
him with a mirror in which he saw him- 
self. 


Psychological Manipulation 


NEUTRAL INSTRUCTIONS 


Negativism. Relatively innocuous in- 
structions sometimes arouse motiva- 
tional states, such as negativism and 
attention-seeking. To investigate nega- 
tivism in preschool children, Reynolds 
(1928) asked them to perform a num- 
ber of tasks, such as imitating move- 
ments like clapping hands, repeating 
one- and two-digit series, and desisting 
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from a particular activity. Bernstein 
(1955) proposed six tasks to his sub- 
jects: that they separate from their 
mothers, play with dolls, choose a piece 
of candy, draw a person, play with 
finger paints, and play with cold cream. 
Children who refused more than three 
requests were classified as negativistic. 

Attention-Seeking Behavior. As a 
means of investigating attention-seeking 
in children between 4 and 6 years of 
age, Gewirtz (1956) asked them to 
paint at an easel in the presence of an 
adult as long as they wished. Then he 
tabulated such responses as asking ques- 
tions, addressing comments to the 
adult, and seeking help. 


THREATENING INSTRUCTIONS 


Some reactions emerge only when 
children are subject to stress. Only then 
do they experience certain types of 
fear; only then are they impelled to 
act dependently. 

Fear. To study fears of children ap- 
proximately 1 to 8 years old, Holmes 
(Jersild and Holmes, 1935) directed 
them to walk on a board which col- 
lapsed. Then he asked whether they 
would voluntarily walk on it a second 
time. He also requested them to retrieve 
a ball from a dark room where it had 
been “inadvertently” thrown, to obtain 
toys placed near a stranger’s chair, to 
walk across a board 4 feet high, to pat 
a strange collie dog, and to get a toy 
from a box containing a garter snake. 
When a subject refused, he was urged 
to comply. If he was still reluctant, 
the examiner offered to accompany him. 

Dependence. Employing a method 
similar to that of Holmes, Heathers 
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(1953) made his subjects walk a plank 
while blindfolded. The children were in 
the age range of the elementary school. 
The plank was balanced on springs, 
so that it could be tipped sidewise and 
at the end. While a subject was trying 
to maintain his balance, he was given the 
opportunity of taking the examiner's 
hand or of walking without help. ‘The 
choice was interpreted as an index of 
emotional dependence or independence. 


FRUSTRATING AND THREATEN- 
ING SITUATIONS 


The psychological literature on 
anger, regression, and fear reveals that 
psychologists have devoted considerable 
thought to methods of frustrating chil- 
dren. Subjects have been deprived of 
precious objects, asked to continue in- 
definitely with boring tasks, forced to 
make unpleasant choices, and made to 
fail in important endeavors. Each of 
these techniques is now considered in 
turn. 

Deprivation. Some experimenters 
have obviously been intrigued by the 
old saying that something is an easy as 
“taking candy from a baby.” In addition 
to candy, children have been deprived 
of food, mothers, and toys, to name just 
a few of the varied objects. 

Food. At different times during the 
feeding of a 6-month-old baby Sears 
and Sears (1940) removed the bottle 
and held it in front of him. They re- 
corded the time it took him to cry and 
the frequency of his goal-directed 
movements, such as attempts to clutch 
the bottle. Davitz (1952) actually did 
take candy from his subjects. They 
were not babies, however, but children 
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between 7 and g years. Prior to frustrat- 
ing them, the experimenter showed 
them one film and then gave them some 
candy at the start of a second. When 
the film reached its climactic point, 
he stopped it and asked the children to 


return the candy. Then he ushered them 
into a playroom, where he told them 
that there would be no more candy or 
entertainment and that they could play 


with any objects. 
Toys. One of the most influential 
studies of deprivation was performed by 


Barker, abo, and Lewin (1941). 
First the children, who were of pre- 
school age, played with one set of 


toys. Then a more attractive but un- 
available group of toys were exposed 
through a mesh screen. To study 
regressive reactions to frustration, the 
investigators tabulated signs of “primi- 
tivation” in the children’s subsequent 
play with the original set of toys. Block 
and Martin (1955), Korner (1947), and 
Thomas (i951) also conducted investi- 
gations in which preschool children 
were deprived of attractive toys. 

Separation from mothers. In an in- 
vestigation of security feelings, 
Arsenian (1943) observed the behavior 
of babies in a new situation in which 
they were deprived of their mothers’ 
company. The babies’ responses were 
compared to those made in more 
familiar situations or in the mothers’ 
presence, Bernstein (1955) used the re- 
quest to leave the mother as part of a 
test of negativism. 

Satiation. To arouse anger and such 
subsequent reactions as loss of control 
and regression, experimenters fre- 
quently ask children to continue a 
simple task past the point of satiation. 
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Among the tasks on which children 
have been satiated are packing spools 
in a rectangular rack on top of a box 
(Block and Martin, 1955), the crossing 
out of O’s on pages filled with lines of 
randomly placed X’s and O’s (Blum and 
Miller, 1952), filling a pegboard (Bur- 
ton, 1942; Yarrow, 1948), playing a 
very dull game with some sticks 
(Lerner, in Murphy 1956), and draw- 
ing a man 15 times in one sitting (Sea- 
shore and Bavelas, 1942). 

Conflict. There are few topics with 
greater theoretical prominence than 
conflict in psychologies of motivation, 
learning, and psychopathology. It is 
surprising, therefore, that children’s 
conflicts have not been investigated 
more often. 

Choice of acts. Block and Martin 
(1955) offered their subjects a choice 
between two incompatible acts. By 
cranking a toy, which unloaded candy 
into a large vessel, each child could ob- 
tain as much as he wanted. Once he 
decided to taste a piece, however, he 
could not accumulate any more, He 
was thus constantly faced with the 
choice between further cranking and 
tasting the candy. 

Choice of objects. Barker (1942) 
asked 10-year-old boys to choose first 
between two liked beverages, such as 
orange and pineapple juice, next be- 
tween two disliked beverages, such as 
vinegar and a saturated salt solution, 
and finally between two neutral liquids. 
To indicate a choice, the subjects moved 
a lever toward a particular glass. Move- 
ments were timed by a concealed de- 
vice. The data consisted of decision 
times and frequencies of vacillations. 

Godbeer (1940) employed a similar 
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method with preadolescent children. 
Each subject sat at a table in front of a 
vertical board with two windows. In 
a control trial one window was opened, 
the candy was exposed, and the child 
was told to secure it as quickly as pos- 
sible by pushing a handle. The move- 
ments of the handle were electrically 
timed and recorded by a concealed 
pantagraph, and the subject's eye move- 
ments were counted. In a second trial 
he was told to get the presents exposed 
behind both windows. He was thus 
torn by impulses to move the handle 
in the directions of different objects. 
In a third trial he was told that as soon 
as he pushed the handle for one window 
the shutter of the other would close. 
The data collected were decision times 
and frequencies of eye movements. 
Godbeer also studied choices between 
the dissimilar goals of gum drops and 
an equated number of tin soldiers and 
between the unclear goals contained in 
two small metal boxes, either of which 
might have contained candy. 

Creation of Failure. Difficult puzzles. 
Since children take tests in school, there 
is no shortage of subjects for studies of 
failure. With few exceptions, however, 
the investigator cannot rely on school 
performance. Children usually differ 
markedly in the meaning and im- 
portance that they attribute to a test. In 
addition, children vary markedly in 
their performances, The investigator 
usually wants a test to have a relatively 
constant meaning for all subjects, he 
tries to arouse similar involvement in all 
subjects, and he attempts to control the 
performance. Some of the techniques 
of implementing these goals were used 
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by Keister and Updegraff (1937) who 
asked children of preschool age to 
work on a puzzle box one quartei inch 
in depth. To be inserted in that space 
were ten irregularly shaped colored 


figures. These were one quarte! inch 
thick and depicted interesting objects, 
such as a sailboat and an engine Only 
with some difficulty could the figures 
be fitted into the box. A second test 
required each child to move five- 
sided box with adjustable weights of 
60 to go pounds at the ends and through 
the middle. Before the subject began, 
the box was raised momentary to 
show that it covered a group of attrac- 
tive toys. The child was told he could 


play with the toys if he could raise the 
box within ten minutes. 

For his puzzle, Yarrow (1945) em- 
ployed a tinker toy. With it, his pre- 
school children tried to construct a 
simple-appearing but actually complex 
windmill. He measured reactions to 
failure by comparing the amount of 
aggression manifested in doll play be- 
fore and after the attempt was made to 
construct the windmill. 

Difficult tests. Jost (1941) induced 
failure by a memory task involving an 
exceedingly difficult set of numbers. His 
subjects were children of elementary- 
school age. First they learned a series, 
each unit of which was guessed before 
it was shown. Once the series was 
memorized, the subjects got a new one, 
which was structured quite differently 
from the first and was too difficult to 
recall. 

Timed tests. In quite a number of 
investigations failures have been created 
by telling the subjects they did not com- 
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plete the task within the required time. 


Zeigarnik (1927) conducted one of the 
earliest stucies of this type. First she 
allowed her children, whose ages varied 
between 5 and ro years, to complete 
half of a group of tasks that she had as- 
signed. Then she asked the subjects to 
recall as many of the tasks as they 
could, ie pattern of more recent 
methods of investigating selective recall 
was established by Rosenzweig and 
Mason (1934), who tried to get their 
subjects very ego-involved in the tasks. 
These investigators asked a group of 
crippled children to solve some puzzles 
and offered a prize for the best per- 


formance. The prize and the physical 
handicap presumably created high in- 
volvement. Like Zeigarnik, Rosenzweig 
and Mason permitted the completion of 
only some puzzles. The children tended 
to forget their failures, presumably be- 
cause the desire to do well was so in- 
tense. 

Beardslee (in Miller and Swanson, 
1960) asked preadolescents to perform 
a series of tasks modeled after early 
tests of psychological abilities. The tasks 
included the completion of words be- 
ginning with the letters “ba,” pursuit, 
putting a dot in each of a number of 
Circles, rea rranging scrambled sentences, 
Supplying the missing parts of faces, 
mazes, coding, anagrams, and complet- 
ing number series, Half the tests were 
interrupted before they could be com- 
pleted. Gatling (1950) gave his 
preadolescents ten jigsaw puzzles, none 
of which could be completed within the 
time limit and five of which had no 
solutions. Then he compared the num- 
bers of intropunitive and extrapunitive 
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explanations of the results given by 
normal and delinquent boys. 


NTIVES 


Praise and blame. There are at least 
two ways of getting a donkey to 
move. The driver can prod him or 
hold a carrot in front of his nose. 
Psychologists and educators have em- 
ployed both kinds of incentive in studies 
of performance in school. In one of the 
most influential studies Hurlock (1925) 
used praise and blame as incentives. 
Her subjects were boys and girls, white 
and Negro, and in the third, fifth, or 
seventh grades of elementary school. 
After taking an intelligence test, one 
group was severely reproved for its 
performance. A second was warmly 
praised, and a third was given no evalua- 
tion and requested to take the test again. 
A week later all the children took 
another form of the test. The groups 
who had been reproved and praised 
both improved significantly more than 
the control group. In a more recent 
study Garmezy and Harris (1953) 
analyzed the performance of palsied 
children whose ages varied between 6 
and 12 years. After failing on a peg 
board, the subjects were praised or 
reproved. The praise and reproof were 
of short and of long duration in two 
different conditions of the study. 

Concrete reward. Specific objects 
have often served as rewards for good 
performances. A prize was offered by 
Rosenzweig and Mason (1934) and by 
Zubin (1932). Garmezy and Harris 
(1953) provided candy, and Anderson 
and Smith (1953) designed a test of 
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strength so that a bell rang when the 
person succeeded. When he did, he 
received a gold star. 

Invidious comparison. To involve 
subjects in a task, the investigator can 
encourage their competition with one 
another or with other groups. As a 
reward for competitive effort, Zubin 
(1932) offered a prize to each subject 
who surpassed the person just above 
him. Panlasigui and Knight (1930) 
produced charts depicting standards of 
accomplishment and the progress of 
individuals and the total class. Garmezy 
and Harris (1953) kept a score board 
with red masonite strips, each of which 
could be raised to indicate improve- 
ment. A miniature chocolate bar, the 
bait offered for a particular level of 
attainment, was attached to the score 
board at a specific height above the 
red marker, Veroff, Wilcox, and Atkin- 
son (1953) administered a projective 
test to boys and girls between the ages 
of 16 and 18 years. To obtain an 
“achievement orientation,” the investi- 
gators challenged the subjects to do 
well on an anagrams test, stating that 
the results would be used in a competi- 
tion with another school. 

Controlling success and failure. 
Various methods have been devised to 
hold success and failure constant so 
that they would have the same values 
as incentives for different subjects. In 
a study of the levels of aspiration of 
boys 9 to 11 years old Worrell (1956) 
controlled performances on three dif- 
ferent tasks by means of prearranged 
sequences of scores. Anderson and 
Smith (1933) devised an ingenious 
method of controlling their subjects’ 
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scores on a hand dynamometer. The 
amount of pressure apparently act vated 
a visible liquid column, which the ex- 
perimenters could manipulate without 
the subjects’ knowledge. 

Involvement in the task. A child has 
a strong incentive to do well on a 
task if the results are included in official 
records or reflect on activities in which 
he must succeed. To elicit “merit in- 
volvement” on the part of his 6-to-9- 


year-old experimental subjects Filer 
(1952) informed them that t ir at- 
tainments depended on their abilities. 
He told a control group that their per- 


formance depended upon chance Zubin 
(1932) asked some subjects to sign their 
papers and others to leave their papers 
unsigned. Veroff, Wilcox, and Atkin- 
son (1953) defined their test as one of 
creative imagination. 

To maximize the involvement of his 
subjects in a task, Worrell (1956) told 
them that it was the most important one 
they would perform, that it provided 
the most information about skills crucial 
for all sports, and that he was very con- 
cerned with the task. To elicit medium 
involvement, he said that the task was 
not quite so useful as the first one and 
that it provided less information about 
a person’s abilities. To minimize in- 
volvement, he said that the task was 
inferior to the others and was of no 
interest to him. Beardslee (in Miller 
and Swanson, 1960) attempted to get 
her subjects involved in a test by first 
asking them about ambitions and prob- 
able future jobs. Then she said that 
their performance would provide a 
good indication of their promise in 
their chosen fields. 
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Temptation. Long ago the story of 
Adam and Eve made man keenly aware 
of the strong incentives provided by 
some temptations. Temptations offered 


by experimenters have varied with the 
topics under study, In one investigation 
of the aggression of 3- and 4-year-old 
children, the experimenter sat opposite 
a child and built a pile of eight red 
bricks (McKee and Leader, 1955). 
After two minutes she left, saying that 
she needed equipment from the next 


room, From that vantage point, she 
made complete record of his be- 
havior and comments. Many children 
yielded to temptation and knocked 
down the bricks. 

In “temptation” studies the subject 
is usually tempted to violate the condi- 
tions of a task. Dennis (1955), for 
example, invited his seventh and eighth 
graders in Hopi schools to take a test 
with their eyes closed. If they looked, 
their scores exceeded the upper limits 
that could be attained by anyone who 
followed instructions. Grosser, Polan- 
sky, and Lippitt (1951) provided 
temptation by means of a confederate, 
who pretended that he was a subject. 
After playing an interesting and com- 
petitive game of darts, the boys were 
assigned a dull task of drawing figure 
eights. At this point the experimenter 
left the room. The confederate stopped 
work a number of times, groaned with 
fatigue, and finally began to play the 
game of darts. 


PARTICIPANTS IN THE STUDY 


Regardless of the technique of ma- 
nipulating behavior, decisions have to 
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be made about the identity and be- 
havior of people with whom the sub- 
jects are to interact. A number of dif- 
ferent combinations of personnel have 
been used by various experimenters. 

Solitude. To study the fears of pre- 
school children, Holmes (in Jersild and 
Holmes, 1935) left them alone for two 
minutes. Arsenian (1943) separated a 
number of children from their mothers 
and left them alone in a playroom for 
five minutes. The solitude thus signified 
separation and, possibly, deprivation of 
love or protection. 

Child and adult. Most often the 
child is with the experimenter, who en- 
gages in some action like smiling or ask- 
ing the child to pet a dog. Sometimes, 
however, another significant adult is in- 
troduced, either in place of the experi- 
menter or as a third person. Holmes (in 
Jersild and Holmes, 1935) obtained the 
children’s reactions to a stranger. Mer- 
rill (1946) investigated the behavior of 
preschool children with their mothers. 
Each mother was left alone with her 
child for half an hour. Before the first 
session, she was asked to behave as 
though she were in the same room with 
him at home and unoccupied by house- 
hold duties. Before the second session, 
it was inferred to mothers in the ex- 
perimental group that their children’s 
performance had not provided an alto- 
gether satisfactory sample of their po- 
tentialities. 

Little and Cohen (1952) investigated 
the levels of aspiration that asthmatics, 
4 to 12 years of age, reported in the 
presence of their mothers. Each subject 
shot at a target and gave his level of 
aspiration after every trial. His mother 
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watched, kept score, and wrote her 
own aspiration before each trial. In San- 
ford and Risser’s study (1948) it was 
the mother who performed the task 
and the child who observed. Following 
the task, the child was asked to recall 
the different activities in which the 
mother had been engaged. The object 
of this request was to study the child’s 
selective recall of completed and un- 
completed tasks. 

Child and experimenter’s confeder- 
ate. A relatively infrequent but fruitful 
combination of personnel has involved 
the subject and a confederate or con- 
federates of the experimenter. Grosser, 
Polansky, and Lippitt (1951) employed 
a confederate who tempted subjects to 
engage in forbidden activities, and Lip- 
pitt (1939) coached his collaborators in 
methods of creating autocratic and 
democratic atmospheres. In a recent in- 
vestigation with children 12 to 14 years 
of age Brigante (1958) employed three 
confederates as “judges” of his subject's 
knowledge of sports. One judge was 
rewarding, one was neutral, and one 
was punishing. Later the experimenter 
obtained the subjects’ impressions of the 
judges. 

Child and child. Not uncommonly, 
the second person in an investigation is 
another child. To investigate overt, 
friendly behavior, Mengert (1931) ob- 
served the reactions to each other of 
2-year-old children who were put to- 
gether for twenty-minute periods. To 
study ascendance, Jack (1934) paired 
each child successively with each of 
ten others in a preschool group. A pair 
was allowed to play together for five 
minutes in a sandbox with three sets of 
toys. 


THE STUDY OF PERSONALITY DEVELOPMENT 


ges 


Advantages and Disadwvani 
of Experimental Manipulation 


TOPICS REQUIRING EXPERI- 
MENTAL TREATMENT 


Records of behavior in natural situa- 
tions do not permit an adequate € xplo- 
ration of certain types of phenomena. 
Observations of reactions to failure 
provide a good example. On first con- 


sideration, observations of children 
when they take tests in school w uld 
seem to provide important information 
about reactions to threat of failure. Yet 
such information is sometimes not rele- 


vant to the questions of the study. To 
begin with, failure is relatively i fre- 
quent in schools; serious failure is 
Hence, if an investigator concenti 
on reactions to failure, he may have to 
wait a long time before he gets an ade- 
quate number of cases. He may also 
find himself handicapped by the differ- 
ent meanings that the same test may 
have for various children. The results 
of some tests are hard to interpret be- 
cause of complex subtests, unusual 
ranges of scores, or types of scores that 
make it difficult for children to com- 
pare themselves with one another or 
with an ideal norm. 

Very difficult and very easy tests 
also create problems in interpreting 
results. A good example is provided 
by the average discrepancy in ex- 
periments on “level. of aspiration. If 
the test is too easy, a low discrepancy 
can have a number of meanings. A per- 
son may have done much better than 
he thought he had a right to expect; 
hence he did not anticipate that he 
would improve very much, It is also 


ates 
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possible that his scores were so close 
to the ceiling of the test that even the 
highest possible aspiration still resulted 


in a low pancy (Ausubel, 1951). 
Or he may have had a predisposition to 
set low goals for himself—the usual 
meaning of a low discrepancy. 

Even children who understand the 
meaning of a score may not take it as 


a true reflection of ability if they have 
not had sufficient time to study or if 
the test is excessively difficult. In the 
latter case, most children are discour- 
aged and find it easy to blame the test 
rather than themselves for their poor 
performance eS, 

A subject is likely to be more con- 
cerned with doing well on a challeng- 
ing or interesting test than on a dull or 
meaningless one, and his self-esteem 
will be more effected by the results. 
Involvement in a test also fluctuates 
with inner values and interests as well 
as external incentives. Performance on 
verbal tests is often devalued by people 
from the working class (Davis, 1948). 
A test often has differential appeal to 
various children, depending on their 
interes A boy who hopes to be- 
come a physician, for example, may 
not be at all depressed if he does poorly 
On a test of artistic skill, even if the test 
is a challenging one. The same per- 
formance may be very disturbing to a 
boy who intends to become a profes- 
sional illustrator. 

1 All of these variables—the interest, 
importance, and difficulty of a test, the 
clarity of results, the child’s concern 
With his performance, and the actual 
amount of success and failure—can be 
controlled in a careful experimental de- 
sign. Most of the variables can be held 
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constant by the designs of the instru- 
ment and the instructions. Interest and 
importance can be maximized or varied 
by instructions designed to implement 
these ends, Failure and success can be 
manipulated by such methods as pre- 
arranged scores and the use of a timed 
test. The extent to which the investi- 
gator has succeeded in controlling non- 
experimental variables can be evaluated 
in a pilot study. 


THE PLANNING OF EXPERI- 
MENTS AND PREVIOUS 
RESEARCH 


In deciding on the design of his ex- 
periment, an investigator can often 
profit from findings reported in the 
literature. The value of such findings 
can be illustrated by results pertaining 
to a common problem in designing 
studies of conflict. In much of this re- 
search the subject is asked to choose 
one of two objects. It may be the in- 
vestigator’s intention to compare the 
choices between attractive objects and 
between unattractive ones. In such a 
study, the objects themselves should be 
comparable, and their relative attrac- 
tiveness or unattractiveness has to be 
held constant if it is not to influence 
the results. How can one tell which 
variables of objects affect their com- 

arability or relative attractiveness? A 
number of leads are provided by em- 
pirical studies, some of which are cited 
here. 

Availability. The relative attractive- 
ness of objects seems to depend in part 
on their availability. To study this 
variable, Wright (1937) offered each 
of a group of preschool children one 
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of two lollipops which were suspended 
from strings. One lollipop was 11 inches 
away and within reach of the subject, 
and one was 11 inches beyond reach. 
Eleven of the 16 children selected the 
lollipop that was beyond their reach. 
From this result, Wright concluded 
that an obstructed goal has a stronger 
positive valence than a more accessible 
goal. He gave similar interpretations to 
the results of two other experiments. 
In one the children could select the 
available toys or the toys in a room 
that could be entered only after re- 
moval of bars from the door. The ma- 
jority of the children tried to get the 
toys behind the door. In a second ex- 
periment the subjects ranked the de- 
sirability of two groups of toys, one 
inside a wire cage and one outside the 
cage. In a majority of instances the toys 
in the cage were given first preference. 
Only when an available toy was a means 
to a goal rather than a goal in itself 
was it selected in preference to a toy 
behind a barrier. 

Also relevant to availability is the 
finding of Irwin, Gebhard, and Mildred 
(1946) that objects are preferred if 
they are to be kept rather than given 
to others. Irwin, Armitt, and Simon 
(1943) have reported that objects are 
preferred if they are to be obtained 
immediately instead of being available 
later. 

Association of object with other 
sources of gratification. Filer ( 1952) 
found that when objects could be 
earned by the skillful playing of a game 
they were chosen over objects that any- 
one could obtain. Another kind of re- 
inforcement was reported by Lambert 
and Lambert (1953), who asked each 
of their preschool children to exchange 
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a red token for a white token and then 
to exchange the white one for a gum- 


drop. The children estimated the sizes 
of the two tokens from memory by 
adjusting the sizes of circles of light 
on a ground-glass screen. suming 
that recalled sizes are indexes of at- 
tractiveness or importance, it is inter- 
esting to note that the white tokens, 
which were closest to the reinforcing 
candy, were overestimated more than 
the red ones. 

Substitutability. Comparability and 
relative attractiveness have also been 
studied in research on substitution of 
objects. To explore the substitutes of 
preschool children, Sliosberg (in 


Barker, Kounin, and Wright, 1943) 
first allowed each subject to take some 
candy into his mouth, Then the ex- 
perimenter asked him to put it down 
and take pieces of cardboard similar to 
the candy in size but not in shape. Most 
subjects refused the cardboard. During 
a play session they did accept card- 
board as food for a doll. They also 
accepted a paper ball as a present for 
a doll. In short, the simulated food 
could not satisfy a real need and could 
not be substituted for a real satisfier, 
but the simulated food could satisfy an 
unreal need. When the children were 
building with blocks, they accepted 
pebbles as substitutes but not wooden 
animals. The animals could not be used 
because they had fixed meanings. For 
the same reason, the children would not 
accept wooden animals in place of cars. 


THE MEANING OF EXPERIMENTS 
TO THE SUBJECTS 


When a psychologist succeeds in 
controlling all variables but the experi- 
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mental one, he can confidently attrib- 
ute the resultant variation to his ma- 


nipulations, If he is clear about the 
meaning of the experimental variable, 
he can measure it objectively and ma- 
nipulate it in a manner that has identi- 
cal effects on all subjects. 


So goes the ideal version of an ex- 
Jometimes this version comes 


periment 

close to cribing the actual events. 
More often, complications arise. What 
is particularly hard to satisfy is the cri- 
terion that che situation be identical for 
all subjects. The difficulty stems from 
the existence of a large number of vari- 
ables. The experimental results may be 


affected by other motives, the per- 
sonality of the experimenter, the nature 
of the situation, and differences in the 


children’s backgrounds (Miller and 
Swanson, 1960). So complex is the so- 
cial situation alone that the investigator 
can never hope to control all the non- 
experimental variables that might con- 
ceivably affect his results. The best he 
can do is to control some of the more 
obvious potential sources of error vari- 
ance, 

Among the myriad variables in hu- 
man behavior, some of the most diffi- 
cult to assess are those that affect the 
subject’s interpretations of the experi- 
mental situation and procedure. Some- 
umes these interpretations are very dif- 
ferent from the experimenter’s. As a 
result, the findings are overgeneralized 
or misinterpreted in other ways. 

Previous Experiences. The extent to 
which the subject’s interpretations can 
affect results is illustrated by some of 
the findings in studies of goal prefer- 
ence and incentives. As has been indi- 
cated, Wright (1937) concluded that 
an available object is less attractive than 
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a less accessible one: children prefer to 
work for a goal that is hard to attain 
rather than one that is easy to attain. 
On repeating some of Wright's studies, 
Child (1946) obtained different results. 
His subjects, who varied in age between 
2 and 8 years, chose the more available 
objects. Child then proposed a num- 
ber of possible reasons for the discrep- 
ancy in results of the two studies. In de- 
veloping his explanation, he made one 
basic assumption: differences in past ex- 
perience cause subjects to differ in their 
interpretations of the same situation, 
Some children, he thought, may have 
been challenged by their parents to 
overcome obstacles and then rewarded 
for successful efforts. Such subjects 
probably favored less accessible goals to 
a greater extent than children who had 
not been challenged and rewarded. In 
the experiment the latter may have 
found more interest in goals that were 
there for the taking than in the less 
accessible goals. It was also possible, 
Child thought, that some subjects had 
become satiated with the easily avail- 
able objects because they were used 
every day, The ones behind barriers 
may have been intriguing because of 
their novelty. 

Studies of incentives are also plagued 
by differential reactions to standard ma- 
nipulations. Suppose a teacher has just 
blamed a child for a poor performance. 
How will the child react? In part, his 
response will depend on the meanings 
and values he ascribes to the verbal in- 
centive. In part, his response will de- 
pend on the teacher’s previous inclina- 
tion to be complaining or supportive. 

Personality. The child’s personality 
also influences his reaction to blame. 
Blaming can elicit self-dissatisfaction if 
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it is congruent with his own standards 
about achievement. Blaming can also 
gratify desires for attention or privilege. 
Research has indicated that in response 
to negative statements subjects with 
personality problems of excessive anx- 
iety do poorly in learning tasks (Bird, 
1927; Grace, 1948). Blamed introverts 
do well on a cancellation test (Forlano 
and Axelrod, 1937). In view of the 
many variables that can affect interpre- 
tations of verbally stated incentives, it 
is not surprising that studies of praise 
and blame have produced complex and 
somewhat inconsistent results. 

Artificiality. The artificial, even bi- 
zarre, qualities that may have to be 
imposed on the situation by necessary 
controls or manipulations are a possible 
pitfall of experimentation. Such quali- 
ties may arouse feelings in children 
such as self-consciousness, anxiety, and 
even fear. One wonders what children 
must have felt when they were ushered 
into a room stripped of furniture and 
saw two lollipops hanging from the 
ceiling or when they were offered 
cardboard in place of candy. Of course, 
these designs represented the halting 
steps of a new discipline. In the twenty 
years since such research was done 
psychologists have generally shown in- 
creasing sensitivity to children’s inter- 
pretations of tasks. 

Pretesting. The many uncontrolled 
sources of error, particularly those aris- 
ing from the differential feelings, in- 
terpretations, and motives aroused in 
subjects by experiments, point to the 
importance of a critical and occasion- 
ally neglected step in empirical re- 
search, It is not enough for an investi- 
gator to assume that children will in- 
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terpret a technique in the anticipated 
manner. He needs independent cvi- 
dence of the meaning the children as- 
sign to a task and of the feelings it 
elicits in them. Such information can us- 
ually be obtained in a pilot study. 


RIBU- 


METHODOLOGICAL CON 
TIONS OF EXPLORATORY 
INVESTIGATIONS 


In exploring the history of a science, 


one typically finds a sequence in- 
vestigations, each of which introd ced 
new and sophisticated methods and 
controls and produced results that re- 
quired modifications of earlier co iclu- 
sions. Such a stream of research was 
initiated by Watson’s pioneering study 
of innate affective states and their 
arousal. To test the validity of Wat- 


son’s findings, Sherman (1927) 
a group of judges to interpret emo- 
tional expressions of infants by ob: 
ing them directly and in motion oie 
tures. It was possible to reproduce ac- 
curately the affects originally studied 
by Watson because he “had ‘conscien- 
tiously defined their characteristics in 
some detail. He had described rage, 
for example, in such terms as stiffening 
of the body, slashing movements of 
arms and hands, kicking, holding the 
breath, and screaming. When Sherman 
obtained judgments, he introduced a 
number of important controls. Emo- 
tional reactions were observed both 
with and without indications of the 
preceding stimulation and with and 
without sound. 

Considerable doubt was thrown on 
Watson’s conclusions by the observers’ 
inability to agree in their identifications 
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of emotions depicted in the film. There 
was not even a consensus when the 
babies themselves were observed, even 
though they could be heard and their 
facial cí could be seen. One finding 
was particularly damaging to Watson’s 
claims. When the stimuli and emotional 
responses were transposed in the movie, 
the judges’ guesses were usually more 
appropriate to the incorrect stimuli 
than to the infants’ affects. 

Subsequent research has cast even 


more doubt on the validity and reli- 
ability of Watson’s results. To deter- 
mine the connection between restric- 
tion of movement and unlearned rage 
reactions, Pratt (1932) held the arms 
and noses of 67 infants. In 96 per cent 
of the cases the infants drew back their 
heads, hunched their backs, and in- 
dulged in restless and nonspecific body 
movements. When he held the infants’ 
arms, there was no resistance or action 
in 58 per cent of the cases. A few of 
the remaining subjects flexed their 
arms, but most of the reactions were 
described as minor. 

To verify Watson’s claims about the 
unlearned sources of fear, Irwin 
(1932a) subjected infants to loud noises 
from a cabinet held 4 inches from their 
heads. Body jerks were recorded on a 
moving polygraph tape. Startle reac- 
tions were usual, but they were never 
accompanied by crying or by manifes- 
tations of fear. In another investigation 
(1932b) Irwin dropped infants 2 feet 
from a supine position. When they 
Were younger than 1 month, they made 
no overt responses in twelve per cent 
of the trials and gave some reactions in 
88 per cent of the trials. About half the 
instances of movement involved the 
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arms and legs, and only three per cent 
were accompanied by crying. 

Obviously Watson’s findings were 
not supported by the results of the 
subsequent investigations. The emo- 
tions he thought he was observing 
could not be reliably identified under 
more controlled conditions. The asso- 
ciations that he found between certain 
stimuli and extreme affective states, 
such as rage and strong fear, were not 
obtained when his experiments were 
repeated. Although these results may 
seem disappointing, they demonstrate 
three of the primary assets of well- 
designed experiments. First, Watson de- 
fined his operations very carefully, so 
that his studies could be repeated and 
new controls inserted. Second, he de- 
scribed his findings in some detail, so 
that considerable doubt could be cast 
on their validity by the absence of sup- 

ortive evidence in subsequent studies. 
The fact that noncorroborative as well 
as corroborative evidence was relevant 
to the validity of Watson’s predictions 
is an advantage of no small importance. 
Few investigators have succeeded in 
formulating predictions that could be 
disproven or the validity of which 
could be called into question by lack 
of supportive evidence. Finally, Wat- 
son developed useful techniques that 
have been improved by subsequent in- 
vestigators. The lack of support for 
Watson's conclusions applies not to the 
assumption of innate affective reactions 
but only to the connections he thought 
he found between certain stimuli and 
innate emotional responses. For some 
years the interest in innate reactions lay 
dormant. Now that the topic has been 
revived by the challenging observations 
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of ethologists the methodological gains 
resulting from the studies of affect 
should provide future investigators with 
some tested techniques for studying the 
affective responses of infants. 


PRACTICAL PROBLEMS 


It probably helps an investigator to 
clarify his thinking about his methods 
if he plans an ideal study of children’s 
motives. If he is at all practical, he will 
keep in mind the possibility that par- 
ticular conditions in the school or home 
may force him to lower his sights. The 
occasional necessity of compromising 
with one’s methodological standards 
can be humiliating and frustrating. The 
frequency with which experienced re- 
searchers have had to make such com- 
promises probably accounts for their 
tolerance of some common weaknesses 
in studies that neophytes are sometimes 
inclined to condemn. 

An investigator who collects data in 
schools must often design his study 
with an eye to limitations of space and 
of the students’ time. He may be al- 
lowed only two hours with each child, 
and the only free room may be the 
dispensary, which the nurse has to en- 
ter every twenty minutes. 

Even if the investigator has carefully 
explained the reasons for his presence 
in the classroom, he may find, months 
after the data were collected, that the 
teacher thought he really was there to 
evaluate her performance and that he 
was reporting his findings to the prin- 
cipal. Her anxieties may have been 
communicated to the subjects and have 
created an atmosphere that interfered 
with the purposes of the experiment. 
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The unforseen events that can elimi- 
nate all possibility of conducting the 
experiment or that may require idi- 
cal revision of the design are 
stant bother to the investigator. lr may 
be impossible to collect data on a par- 
ticular day because special tests have 
been scheduled for all the schools in 
the city. There are usually some 
students who cannot be seen later b 
cause they have heard about the proce- 


a con- 


ick 


dures. Materials needed for the research 
may not have been delivered on time, 
When such emergencies occur, the in- 
vestigator often has to make immediate 
judgments about the part of the study 
that he will sacrifice or about the 
changes he will make in the methods 
in order to compensate for the damag- 


ing omission of important controls 

Fathers create their special prac 
problems. A foreign visitor once ask 
the writer why American child psy- 
chologists are always studying the male 
offspring of female parents. The rea- 
sons for selecting female parents seem 
obvious. Fathers work all day and are 
often not available at night because they 
are busy with other activities, tired, 
cannot tear themselves away from the 
television set, or are convinced that the 
rearing of children belongs to the 
woman’s sphere of activities. Mothers 
are usually more cooperative. Many 
are alone for a good part of the day 
and usually welcome a visitor, particu- 
larly one who asks them about their 
lives. 


‘As for the tendency to select boys rather 
than girls, psychologists have this because most 
of the theories of personality are fairly specific 
about the development of males but contain 
few or no observations about the development 
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Mothers, too, can create difficulties 
for the i viewer. Occasionally one 
unburden rself to such an extent that 
she makes the inexperienced interviewer 
fearful thar he has disturbed her un- 
duly. It also not unknown for an 


interviewer to have difficulty in extri- 


cating himself from a home without 
offending a woman who has been talk- 
ing about herself and her family for 
two hours and seems eager to continue 
for two more. 

During visits to families many other 
kinds of practical problems can arise. 


Their variety can be suggested by a 
few examples, Some parents avoid the 
investigator for fear he is a bill collector 
or an encyclopedia salesman; some are 
uncooperative because they do not 
want their children to take time off 
from their homework and their music 
lessons; some women are unavailable 
because they have to clean house; some 
are defensive or evasive about topics 
that have aroused guilt; some insist on 
witnessing the testing of the child de- 
spite the examiner’s plea for privacy. 

It is possible to avoid one practical 
type of difficulty that occurs when 
data are collected in schools. When 
there is a large group of subjects, the 
students’ grapevine will probably carry 
an account of the procedure through 
the school in a very short time. If the 
subjects are seen individually or in small 
groups, so that the examination takes 
some time, most of the children will 


ee fe 


of females, Even in the old nursery rhyme 
the composition of the boys is described in 
such specific terms as “snakes and snails and 
Puppy dog tails”; the description of girls 
begins with “sugar and spice” but then ends 
vaguely with “everything nice.” 
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enter the experimental room with a 
picture of the methods, often a dis- 
torted one. If the subjects’ preconcep- 
tions are accurate, a corroboration of 
the hypothesis may establish not its 
validity but the efficiency of the grape- 
vine in describing the ways children 
were supposed to behave. 

It is often possible to prevent mature 
subjects from gossiping. Older children 
are inclined to cooperate with an ex- 
aminer who asks them not to talk about 
the procedure and who explains the 
reasons for the secrecy. Such appeals 
are not effective with young subjects, 
but the damage of the grapevine can 
be circumvented by the simultaneous 
use of many examiners. Ten examiners, 
for example, can see two groups of ten 
children individually, one group im- 
mediately after the other. Twenty chil- 
dren can thus be seen under conditions 
that preclude communication prior to 
testing. If the examiner limits the num- 
ber of subjects in a school to those who 
can be seen in two contiguous sessions, 
he can be certain that no children had 
previous knowledge of his methods. 


EXPERIMENTAL SAMPLES 


In many experiments it is not possi- 
ble to obtain more than one sample of 
the subjects’ behavior. It may not be 
possible to repeat the method of ma- 
nipulating subjects without changing its 
meaning. Repetition may also be pre- 
vented by the complexity, costliness, 
and the time-consuming nature of the 
procedures. Compared to many experi- 
ments, then, techniques such as ratings 
and time samples have more of a chance 
of being reliable because they are based 
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on a considerable sampling of behavior. 

Limitations in sampling provide a 
special problem in experimental studies 
of motives, which are dispositions that 
can be expressed in many different 
ways. Consequently, a number of dif- 
ferent manifestations of a motive may 
be needed before it can be identified 
reliably, and one or a few samples of 
behavior may be mislabeled. This weak- 
ness in experimental studies of motiva- 
tion was invoked by Blum and Miller 
(1952) to explain the marked differ- 
ences they obtained in the number of 
significant relations between their be- 
havioral criterion of orality and their 
experiments, on one hand, and between 
the same criterion and variables such 
as teachers’ ratings and time samples. 
The experiments yielded results close 
to chance, whereas the ratings and time 
samples yielded results that exceeded 
chance considerably. It is possible that 
the two methods differed with respect 
to validity. Yet the care with which 
they were designed and pretested led 
the investigators to attribute the results 
not to difference in validity but to the 
unreliability inherent in the limited 
sampling that was possible in the ex- 
periments. 


Natural Experiments 


In the typical experiment the inves- 
tigator attempts to manipulate two or 
more groups in contrasting ways and 
predicts probable differences in reac- 
tions. Certain kinds of manipulations 
are not possible or feasible because they 
are unethical or because they create 
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anxiety in parents, teachers, or 
adults. Instead of manipulating subj 
the investigator can test his hyp 
by studying children whose behavior 
has been affected by known experi- 


ences. This section is concerned with 
the selection of groups whose charac- 
teristics have been determined by con- 
ditions that occurred before the i 
tigation was planned. Such a selection 
VETI- 


ve 
takes advantage of “natural ej 
ments.” 


Past Events 


EXPERIENCES IN THE HOME 


Relationships with parents and 
periences during the early years of 
have been invoked frequently to 
plain the origins of children’s later be- 
haviors. Parents are usually ready to 
supply information about topics such 
as methods of child rearing and the 
early illnesses and trauma of their off- 
spring. Besides supplying information 
to interviewers (Sanford et al., 1943; 
Sears, Maccoby, and Levin, 1957), par- 
ents have rated their children (Cattell 
and Coan, 1957) and kept records of 
different kinds of behavior (Jersild and 
Holmes, 1935). 


TRAUMATIC EVENTS AND 
NEGLECT 


Most of the papers on trauma are by 
psychotherapists, and the method of 
obtaining data has typically been clini- 
cal observation of behavior. The 
trauma of operations have been studied 
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by a number of investigators. Langford 
(1937), for example, reported that after 
operations, particularly those initiated 
with ether anaesthesia, children were 
inclined to suffer from attacks of anx- 
iety. On observing children before and 


after tonsillectomies and adenoidecto- 
mies, Jessner, Blom, and Waldfogel 
(1952) found an increase in psycho- 
pathological reactions following the 
operatio 

Understimulation by adults and pa- 
rental neglect have also been investi- 
gated as sources of children’s patholo- 
gies. Spitz (1946) explored the origins 
of marasmus, a condition involving 


abortive development of affects and 
motives, among other symptoms. He 
and Wolf investigated the connection 
between anaclitic depression and the 
sudden separation of babies from their 
mothers at about the sixth month 
(1946). Freud and Dann (1951) made 
clinical observations of six children who 
had spent most of their first three years 
in a concentration camp. There they 
had received minimal care from a va- 
riety of adults. In addition to their anx- 
ieties and regressive behaviors, the 
children showed a strength in their at- 
tachments to each other that was sur- 
prising, in view of their initial lack of 
consistent personal relationships. 

Harsh parental practices provide a 
third source of data relevant to the 
origins of trauma. Huschka (1942) 
found an association between coercive 
bowel training and the subsequent tend- 
encies toward frequent expressions of 
fear and rage. Redl and Wineman 
(1951) described many of the later 
problems of children who were abused 
in their early years. 
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PHYSICAL PATHOLOGIES 


From information that a child has 
had a particular illness or physical dis- 
ability, one can often infer the nature 
of his experiences and their probable 
effects on his motives and feelings. Be- 
tween their second and fourth years, 
children who developed constipation or 
colds, or had been in accidents, dis- 
played a marked increase in temper 
outbursts (Goodenough, 1929). Ac- 
cording to Greenacre (1944), children 
who in their early years had to tolerate 
physical restraints, such as metal leg 
braces or corsets, were predisposed to 
develop catatonia in late adolescence. 
Molitch (1937) reported a significant 
association between hypogonadism and 
rate of delinquency, and Little and 
Cohen (1952) found a significant dif- 
ference between the aspirations of 
asthmatic and nonasthmatic children. 


SOCIAL POSITIONS AND 
CATEGORIES 


The fact that a child is a particular 
sex or in a particular social class or so- 
ciety means that he has had to satisfy 
standards that differ from those re- 
quired of people of another sex or so- 
cial class or society. A number of in- 
vestigators has compared members of 
socially recognized groupings to in- 
vestigate the nature of their previous 
experiences and the motivational re- 
sults. Since standards of achievement 
are seldom so high for girls as they are 
for boys, it is not surprising that, at 
different age levels, girls have lower 
levels of aspiration than boys (Walter 
and Marzolf, 1951). 
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According to sociological accounts, 
members of the middle class are trained 
to regard cleanliness as an important 
value and to inhibit the impulse to be 
messy. Such reports led Alper, Blane, 
and Abrams (1955) to measure fastidi- 
ousness in the painting done by chil- 
dren from the two social classes. It has 
been claimed by ethnologists that Hopi 
children are successfully trained to 
achieve self-respect by self-abnegation. 
To test this claim, Dennis (1955) com- 
pared Hopi and white children of the 
same ages with respect to their levels 
of ambition. 


Observation of Behavior 


DEVELOPMENTAL OBSERVA- 
TIONS IN DIFFERENT SOCIETIES 


Ethnologists are interested in the 
contrasting practices of subgroups in 
our own society and of people in dif- 
ferent societies, Some ethnological re- 
ports seem relevant to principles of 
motivation that may apply cross-cul- 
turally. The swaddling of infants in 
Albania (Greenacre, 1944), for exam- 
ple, and the thumping of the heads of 
Siouan children to prevent them from 
biting their mothers’ breasts (Erikson, 
1945) have a bearing on the relation 
between degree of parental restraint 
and directness of anger. The socializa- 
tion of aggression among the Dobuans 
(Fortune, 1932) and among the Zuni 
(Benedict, 1934) provides information 
about the connections between differ- 
ent kinds of projection and the stability 
of social organizations. 
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OBSERVATIONS OF GENERAL 
TRENDS WITHIN A SOCIETY 


Many investigators have con 
records of the modal social beh 


mor 


of children in various age groups in the 
United States or in a western European 
country. Representative of the results 
are the studies of Bridges (1933), who 
observed young infants, Isaacs (1933), 
who concentrated on children at the 
preschool level, and Gesell and Ilg 


(1946), who observed preschool chil- 
dren and those in the elementary grades. 
Each investigator kept detailed records 
of different kinds of behavior. Bri 
for example, attempted to learn about 
the typical reactions of babies in 
pital for foundlings. Her observations 
were made in many kinds of situations 
and were reported in terms of the modal 
infant. In the third month, she noted, 
the average infant usually stops crying 
at the sound of a comforting voice or 
when someone approaches; in the sixth 
month he cries and draws away on see- 
ing an approaching stranger. In the 
fourth and fifth months the baby strug- 
gles and yells if something he wants is 
taken away, and he responds to an ap- 
preciative smile with playful attention. 


Sy 


a hos- 


SPECIAL SITUATIONS 


The restriction of observations to 
specific environments sometimes helps 
to clarify the behavior under study. 
Many investigators have conducted 
their research in the playroom or the 
classroom because these environments 
are sufficiently constant to permit re- 
liable interpretations of certain kinds 
of behavior. Time samples during the 
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free play of preschool children have 
been particularly popular with many 
investigators. Heathers (1955) tabulated 
instances of dependence, such as the 
seeking approval, and instances of 


independence, such as resisting the in- 


terference of another child, Marshall 
and McCandless (1957) compared the 
frequencies of contacts with peers and 
with adults in order to study relations 
between social acceptance by other 
children and dependency. Beaver 
(1932) used the number of contacts 
with other children as an index of ini- 
tiative 

Scho is sometimes provide unique 
situations which the alert investigator 


can exploit for purposes of testing his 
hypotheses, Having noted that 2- and 
3-year-old children need their teachers’ 
help to put on or remove snow suits, 
Fales (in Anderson, 1940) used re- 
sponses to offers of assistance as his 
measure of level of aspiration, The ini- 
tiative shown by some children in open- 
ing zippers, getting arms out of coats, 
and hanging caps on hooks revealed 
that even subjects of such young ages 
had different aspirations for specific 
activities, Dillon (in Murphy, Murphy, 
and Newcomb, 1937) observed the 
sexual expression of 3- and 4-year-old 
boys and girls in a nursery school. Dur- 
ing nap time the children undressed 
completely. Autoerotic activity was 
observed occasionally, but heterosexual 
play was rare. 


SPECIAL OCCASIONS 


: Certain unique events have particular 
significance for some motives. Arsenian 
(1943), Heathers (1954), and Shirley 
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(1942) investigated the anxiety ex- 
pressed by young children when they 
first came to a clinic and were separated 
from their mothers. Subjects in the 
three investigations varied between 1 
and 7 years. Goodenough (1929), Levy 
and Tulchin (1923), and Thomas 
(1929) studied the resistance of babies 
to taking tests. The investigators were 
interested in such characteristics as 
cooperativeness, emotional disturbance, 
and self-consciousness. 


Studies of Previously Established 
Groups: Assets and Liabilities 


The use of groups that already have 
the attributes of interest to the investi- 
gator allows him to study variables the 
manipulations of which are ruled out 
by ethical considerations. Even if he 
could, no experimenter would want to 
traumatize children by arranging for 
cruel parental treatment or by depriv- 
ing them of their normal amount of 
hormones. It is, unfortunately, all too 
easy to find children who have already 
been traumatized or who were hypo- 

onadic before the research was begun. 

The selection of special groups some- 
times has another advantage. The data 
that they provide about the more ex- 
treme forms of behavior may facilitate 
the development of new psychological 
principles. Research on pathological 
subjects and certain primitive groups 
has contributed much valuable infor- 
mation about the origins of some mo- 
tivational states, which was not avail- 
able before such groups were compared 
with normal subjects in American so- 


ciety. 
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Studies of pre-established groups 
have one serious drawback. The many 
uncontrolled variables sometimes make 
it difficult to interpret the results. A 
significant difference between children 
from two social classes, for example, 
can be explained in terms of a consid- 
erable number of antecedent variables. 
There is always a question whether the 
difference is not attributable to some 
variable that was not controlled. The 
simultaneous analysis of a number of 
interacting background factors some- 
times changes the conclusions derived 
from analysis of only two variables. 
B. Allinsmith (in Miller and Swanson, 
1960) found that social class and di- 
rectness of expressed aggression were 
not significantly related to each other 
until she held intelligence constant. 
When Miller and Swanson (1958) con- 
trolled social class and “integration 
setting,” previously significant differ- 
ences in child-rearing practices among 
different ethnic and racial groups were 
no longer significant. 

Another problem in using existing 
groups results from the fact that many 
are very large. One cannot hope to test 
all asthmatics or all people in the mid- 
dle and working classes. The individual 
investigator does not even command 
the resources to obtain a stratified sam- 
ple of the middle and working classes 
in a particular urban area. Given the 
limitations of his sample, it is often 
hard for him to determine the extent 
to which his findings apply to all mem- 
bers of a social class. It is similarly 
difficult to tell whether the results ob- 
tained from a particular sample of asth- 
matics can be generalized to all people 
with the disease or only to those in the 
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same social class and with the same 
range of intelligence as subjects in the 
sample. 

Pre-established populations can have 
another serious limitation, Samples of 


these populations can be biased to an 
undetermined extent by the uncoop- 
erativeness of some members. The very 
children who are too frightened to par- 
ticipate in a study of neglected chil- 


dren, for example, may have been the 
most neglected in the population 
Unfortunately many investigators 
have not been sufficiently sensit to 
such issues of sampling and control. It 


has been all too common for observers 
to develop norms for a total age group 
by observing infants in a foundling 
home, to investigate child-rearing prac- 
tices by interviewing parents in the 
upper-middle class, or to arrive at a 
general conclusion about an alien so- 
ciety after having lived with a few 
families. Recently there has been a 
trend toward increasing methodologi- 
cal sophistication, but there is still much 
room for improvement in sampling 
practices. 


METHODS OF OBTAINING 
AND MEASURING SUBJECTS’ 
REACTIONS 


VERBAL REPORTS 


The most obvious method of finding 
out what a subject is thinking or feel- 
ing is to ask him. For reasons that will 
be summarized after the following sur- 
vey of techniques, developmental psy- 
chologists have not been strongly in- 
clined to conduct verbal inquiries into 
children’s motives or affects. Person- 


MOTIVATION AND AFFECT 
ality questionnaires and inventories 
have gained in popularity as methods 


for obtaining information from older 
children. The objectivity of these in- 
struments probably accounts for their 
appeal. Individual interviews have been 
employed less frequently, and only oc- 
casionally have children been asked to 
fill out rating scales or check lists. Sub- 
jects’ reports have been most frequently 


sought in studies of affective reactions. 
The different techniques are illustrated 
in the order of their degree of struc- 
ture. 

LESS STRUCTURED METHODS 


Compositions. To investigate anger, 
Bovet (1923) asked schoolboys to write 
compositions about fights. In his in- 
structions he asked for the reasons for 
combat and the description of a fight 
from beginning to end. 

Interviews. Jersild and Tasch (1949) 
interviewed children between 6 and 18 
years about the happiest days of their 
lives. Reasons for the happiness were 
tabulated for four age groups. Hicks 
and Hayes (1938) talked to children in 
the seventh to twelfth grades about 
their reasons for worry and the anger- 
provoking actions of other people. 


OBJECTIVE METHODS 


Ratings. Jersild, Goldman, and 
Loftes (1941) asked fifth and sixth 
grade children to tell whether each of 
25 Situations was a frequent, occasional, 
or infrequent cause of worry. As part 
of their study of masculinity and fem- 
ininity, Terman and Miles (1936) gave 
an inventory of emotional reactions to 
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boys and girls in the seventh and elev- 
enth grades. The subjects rated the de- 
gree of emotion they experienced in 34 
situations, each of which was described 
with a view to arousing an emotion 
such as anger, fear, disgust, or pity. 
Hess and Goldblatt (1957) asked ado- 
lescents to rate themselves and to pre- 
dict how they would be placed on the 
same scales by their parents. Self-per- 
cepts and anticipated ratings were also 
obtained from the parents. Subjects in- 
dicated their positions on scales varying 
between such extremes as self-con- 
trolled and wild, loving and angry, and 
serious and frivolous. 

Interviews and Questionnaires. 
Sources of anxiety were the topics of 
an interview schedule that Ort (1952) 
administered to adolescent boys. The 
questions, most of which could be an- 
swered by yes, no, or an indication of 
doubt, pertained to sex, personal habits, 
aggression, and honesty. Hurlock and 
Klein (1934) administered a question- 
naire about adolescent “crushes” to 
boys and girls in high schools, and 
Finger (1947) asked college students 
to fill out a questionnaire about sexual 
practices, beliefs, and emotional reac- 
tions in the first years of adolescence. 


STANDARD INVENTORIES 


Adolescent. Subjects’ verbal reports 
are probably sought more often by ad- 
justment inventories than by any other 
methods. A number of inventories has 
been used in studies of preadolescent 
and adolescent subjects. A few exam- 
ples of the more carefully standardized 
tests are Washburne’s Test of Social 
Adjustment (1935), Mathews’ revision 
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of Woodworth’s Personal Data Sheet 
(1923), and Cowan, McClellan, Pratt, 
and Skaer’s schedule (1935), which is 
an adaptation of Thurstone’s person- 
ality schedule and is designed to be 
administered to subjects between the 
ages of 12 and 16 years. 

Two popular inventories initially 
standardized on college populations, 
Mandler and Sarason’s questionnaire 
for test anxiety (1952) and Taylor's 
test of manifest anxiety (1953), have 
also been validated with high-school 
populations. Both tests were signifi- 
cantly related to the duration of ex- 
posure required for correct recognition 
of tachistoscopically presented words 
(Sarason and Gordon, 1953). Test anx- 
iety was significantly related to recog- 
nition threshhold for certain words 
(Smock, 1956). When asked to squeeze 
a dynamometer as soon as they heard 
a buzzer, high-school students with 
high anxiety scores on Taylor’s scale 
had significantly greater amplitude 
scores than a less anxious group (Cas- 
taneda, 1956). The latter had the greater 
speed of reaction at the weak intensity 
and lower one at the strong intensity. 

Younger Subjects. Rogers’ Test of 
Personality Adjustment (1931) is one 
of the most commonly employed with 
children in the upper grades of ele- 
mentary school. Items are phrased in 
terms of maladjustments at school, at 
home, and in the family. Castaneda, 
McCandless, and Palermo (1956) de- 
veloped a form of Taylor's scale for 
subjects in the fourth, fifth, and sixth 
grades, Forty-two items in the scale 
pertain to anxiety, and eleven comprise 
a lie scale. To validate the test, the au- 
thors conducted a careful set of studies 
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that might well serve as a model for 
authors of other tests. Compared to 
children with low anxiety scores, chil- 
dren with high scores did poorly on 
the difficult components of a s lly 
designed psychomotor task but were 
superior on the less difficult ones (Cas- 
taneda, Palermo, and McCandless, 
1956). In learning by trial-and-error, 
less anxious subjects were consistently 
superior to more anxious ones (Palermo, 
Castaneda, and McCandless, 1956). 
There were no significant correla- 
tions between test anxiety and school 


achievement or intelligence for fourth- 
or fifth-grade boys. Correlations 
were moderate to high for fourth- 
grade girls and for sixth-grade students 
(McCandless and Castaneda, 1956). It 
seemed probable that very anxior s chil- 
dren would be handicapped in social 
relationships with peers. Signi ificant 


negative associations between the mani- 
fest anxiety scale and popularity, as 
measured by sociometric status, were 
found for fourth-grade boys and all 
children in the fifth grade but not for 
children in the sixth grade (Mc Jand- 
less, Castaneda, and Palermo, 1956). 


Evaluation of Verbal Reports 


Many investigators are chary about 
using verbal reports because of their 
many sources of invalidity. For a 
child’s verbal report to be accurate, he 
must be aware of his reactions, they 
must lend themselves to verbalization, 
and he must have the necessary descrip- 
tive terms in his vocabulary. Such con- 
ditions are satisfied most readily in in- 
terviews with adolescents. Younger 
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ten find it difficult to de- 
cling or judgment because 
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children o 
scribe a 


they are sufficiently practiced in 
putting t reactions into words. 
They also tend to be more swayed by 
their emotions than are older subjects. 
Finally, procedures used with 
young children are time-consuming be- 
cause they do not read well enough to 
take a wi test. In view of these 
limitations, it is not surprising that 
verbal reports are seldom sought from 


subjects below the fifth grade. 

Even when children have the ca- 
pacity for verbal introspection, shame 
or fear or anxiety about certain motives 
and feelings can produce distorted ac- 
counts and even a deliberate withhold- 
ing of some feelings. Taylor’s scale, for 
example, is effected by the tendency to 
avoid self-criticism (Jessor and Ham- 
mond, 1957). 

The possible connection between re- 
sistance and ratings is illustrated in 
Brigante’s study (1958) of boys be- 
tween 12 and 14 years of age. Selected 
because of their high interest in sports, 
the subjects were tested on their knowl- 
edge of this subject by three judges. 
One judge was rewarding, one was neu- 
tral, and one was punishing. There 
were virtually no differences in the 
ratings of judges when the boys an- 
swered direct questions such as, “Do 
you think Mr. is a friendly or an 
unfriendly sort of person?” The dif- 
ferences in ratings became very sig- 
nificant, however, when the questions 
took an indirect line, such as, “How 
do you think the other two judges feel 
about Mr. : 

Anxiety about some motives can 
sometimes lead to their repression. 
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In reading about children’s reports 
of fears, it is well to remember 
Freud’s patient, Little Hans, who suf- 
fered from a phobia (1924). Had the 
child been asked, he would have re- 
ported a fear of horses. He could not 
have reported that horses had been un- 
consciously substituted for his father, 
the original object of the fear. Strong 
affective and motivational states, par- 
ticularly forbidden ones, are often sub- 
ject to such defensive distortions. Tia 
person has projected his hostile feelings, 
he sincerely attributes them to some- 
one else; if he has reversed them, he 
can only report very positive feelings 
to the object. 

There are special difficulties about 
answers to objective inventories. One 
problem is created by the lack of any 
provision for answers differing from 
those provided in the test. No matter 
how a question is asked, there is always 
a danger that its phrasing will prejudge 
the issue in a manner that distorts the 
respondent's intentions. A child may 
be disinclined to admit that he does 
cross the street in order to avoid people 
because he does it for reasons other 
than shyness. But there is no way for 
him to describe his reasons or the peo- 

le who elicit this behavior. Hence he 
feels that the answer is yes in one sense 
and no in another. Since he thinks that 
saying he is doubtful would be mis- 
leading, he is not at all clear about the 
answer he should mark on the test. 

Another difficulty about inventories 
stems from their impersonal character. 
The subject gives his responses not to 
a sympathetic listener but to an anony- 
mous reader, Would not any child feel 
uncomfortable about revealing his se- 
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cret fears or his forbidden aggressive 
impulses under such circumstances, par- 
ticularly when the answers may be 
read by school authorities? If the test 
is given to a group of children, there 
is usually little time for questions, so 
that it is difficult to determine the sub- 
jects’ rapport or to learn about differ- 
ent interpretations of the same item. 


Projective Verbal Techniques 


SOCIAL CONVERSATION 


Smith (1933) analyzed reasons for 
criticism mentioned in sentences spoken 
by children between 2 and 6 years of 
age. Fisher (1934) collected steno- 
graphic records of children’s speech and 
then computed the proportion of nega- 
tive comments. Hicks and Hayes (1933) 
coded more and less desirable ways of 
expressing aggression in junior-high- 
school classroom discussions. 


RESPONSES TO PROJECTIVE 
QUESTIONS AND RIDDLES 


Material with motivational signifi- 
cance may also be inferred from ex- 
pressions of wishes, statements of de- 
sired changes in the self, and earliest 
recollections, Winker (1949) obtained 
this information from boys and girls 
between the ages of 7 and 16 years, 
Wolfenstein (1954) investigated the 
reactions to anxiety of 5-to-12-year-old 
children by asking them to tell the 
jokes and riddles they liked best. On 
obtaining a joke or riddle, she inquired 
into its meaning and the subject’s as- 
sociations. A similar method of analyz- 
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ing the motivational appeal of pioverbs 
was employed by Shimkin and Sanjuan 
(1953) in their research on responses 
to anxiety in rural Russia. 


VERBAL TESTS 


Reinterpretation of Other Tests. 
When taking a nonprojective verbal 
test, subjects may unintentionally reveal 
attributes that are irrelevant to the orig- 
inal purposes of the instrument. In their 
answers to Pressey’s Interest itude 
Test, adolescent delinquents signifi- 
cantly exceeded normal subjects in ref- 
erences to topics such as jail, death, sin, 
murder, suffering, and guns (urea, 
1933). Apparently the delinquents were 

iurect 


very preoccupied with the mosti 
forms of aggression. 

Signs of Emotional Instability on 
Intellectual Tests. Bühler (1938) Bijou 
(1942), Pignatelli (1943), and Granick 
(1955), among others have investigated 
the possibility that a broad scatter of 
scores and poor performance on cer- 
tain subtests of intelligence are charac- 
teristic of emotionally disturbed or 
anxious subjects. Both Granick and 
Pignatelli, who reviewed the literature, 
came to the conclusion that the results 
thus far have been equivocal. Although 
there are suggestive trends, no clear- 
cut signs of anxiety or maladjustment 
have been established. 

An Objective, Projective Test. Scor- 
ing patterns that the subjects cannot 
anticipate also provide the basis for 
analyzing responses to Krout’s Personal 
Preference Scale (1954). This instru- 
ment was standardized on subjects 14 
years old and above. A subject is re- 
quired to indicate whether he likes, dis- 
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likes, or feels indifferent to such activi- 
ties and ts as sleeping alone in a 


house, sucking candy, eating crunchy 


foods, reading ancient history, and rais- 
ing money for charity. Among the 
scoring categories are certain motiva- 
tional states, levels of psychosexual fixa- 
tion, masculine and feminine identifica- 
tions, and maturity. The test discrimi- 
nated ficantly between normals 
and neurotics, between both and schizo- 
phrenics, and between males and fe- 
males, [n support of the test, Krout 
also described a number of cases whose 
scores were significantly related to in- 
dependent criteria. , 

FREE ASSOCIATION 


method (1910) has seldom 
oyed with children, prob- 
ably because of doubt concerning their 
capacity to cooperate. The suitability 
of the technique to adolescents has 
been demonstrated by Fauquier (1939), 
who administered the stimulus words 
hate, fear, love, and desire to two 
groups of 15-year-old boys. Compared 
to uninstitutionalized normal subjects, 
a higher proportion of institutionalized 
delinquents gave associations expressing 
hatred of persons, violence, and fear of 
animals, 


Story Techniques 


What could be more natural for chil- 
dren than listening to stories or making 
them up? Such activities are familiar 
and entertaining and make it easy for 
children to talk without being self- 
Conscious, This survey of story tech- 
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niques begins with the relatively un- 
structured request to tell a story about 
anything, continues with more struc- 
tured techniques, such as the comple- 
tion of standard themes, and ends with 
objective methods that provide the 
subjects with standard choices. 


TELLING STORIES 


No Assigned Themes. As a means of 
studying the time orientations of sub- 
jects who were 8 to 10 years of age, 
Leshan (1952) asked them to “tell me 
a story.” He anticipated that children 
from the middle class would think in 
terms of long time periods, since such 
an orientation would implement their 
concern with self-control in the service 
of future goals, As he predicted, sub- 
jects from the working class had a 
much narrower time span and a less 
definite picture of the future than did 
subjects from the middle class. In their 
research on time span Barndt and John- 
son (1955) found that delinquents be- 
tween the ages of 16 and 18 years could 
not start a story on their own. When 
they were provided with a neutral theme 
about two boys who were walking 
along a street near the edge of a town, 
the subjects were then able to com- 
plete the story. The results were ana- 
lyzed with respect to time spans and 
references to crime and unhappy end- 
ings. 

Assigned Themes. Wright (1942) 
investigated the reactions to conflict of 
children between 5 and 8 years of age 
by asking them to tell a story about a 
toy. First the children ranked 11 toys 
in order of preference. Next, each child 
was shown the toys he had ranked most 
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and least attractive and asked to select 
one for himself and one for another 
child. Following the choice of gifts, 
each child was asked to tell a story 
about one of three toys, two of which 
had been in the initial pair and one of 
which was new. The selection of a 
preferred or rejected toy as a topic for 
story telling was done twice. Stories 
were analyzed for frequency and de- 
gree of conflict. 


REACTIONS TO STORIES 


Children’s Responses to a Story. To 
explore the emotional reactions of 
4-year-old children and their mothers 
to the birth of siblings Wolfenstein 
(1946) requested the mothers to read 
a story to their offspring and to report 
the ensuing events. The investigator 
also observed the children’s behavior 
during play sessions before and after 
the reading. The story described a 
child’s version of his mother’s preg- 
nancy, the birth, and the return of the 
mother and baby from the hospital. 
Wolfenstein tabulated different reac- 
tions to the story and related the results 
to such factors as sex, presence of sib- 
lings, circumstances under which the 
story was read, and mothers’ attitudes 
toward their children’s ambivalence. 

Memory of a Story. To elicit material 
relevant to the problems of emotionally 
disturbed children, Despert and Potter 
(1936) asked them to reproduce a 
story they had previously heard. 
Frenkel-Brunswik (1949) used the same 
method to investigate the emotional as- 
pects of prejudice in young adolescents. 
The story began with the description 
of 11 children, one of whom was a 
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Negro and another of whom was Jew- 
ish. Different children were then de- 
scribed as being aggressive or protec- 
tive to three newcomers, whose at- 


tempts to defend themselves were fu- 
tile. The story was read once and the 
subjects were then asked to reproduce 
it in writing. Children high in prejudice 
significantly exceeded those low in 
prejudice in distortions, references to 


hostility, and recall of undesirable facts 
and unfavorable comments about the 
Negro. Children low in prejudice re- 
called the stories with greater accuracy 
and displayed more awareness of indi- 
vidual differences. 


COMPLETION OF THEMES WITH 
MINIMAL STRUCTURE 


Incomplete Sentences. Investigators 
differ in the amount of structure they 
provide in story tests. In some instances 
lengthy plots are produced in order to 
limit the subjects to only a few endings. 
In other instances fragments of sen- 
tences are presented in order to broaden 
the range of possible endings. The sec- 
ond approach may be illustrated by a 
method used by Sanford et al. (1943)s 
who investigated the motives of chil- 
dren between the ages of 5 and 14 
years. Items in the sentence completion 
test were structured in three ways. A 
situational pressure was described in 
one type of item, which was designed 
to elicit the expression of a relevant 
need: “Seeing that he was ignored, 
John. . . .” Some children ended this 
theme with descriptions of aggressive 
departures from the room or with tear- 
ful appeals. A second type of item be- 
gan with a reference to a need and 
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called for the description of a situa- 
tional pressure: “She almost choked 


with rage when... .” Representative 
endings to this item included accusa- 
tions of stealing and of the reading of 
one’s diary by a brother. A third kind 
of item was phrased ambiguously, and 


its construction required a reference to 
some motive in the ending; “James's 
purpose in going there was to. . . 2h 
Answers to this item included the needs 


to find our whether accusations were 
correct, to acquire a dog, and to behave 
aggressiv ely. 

Fables. Duss (1940), a pioneer in the 
construction of the story-completion 
test, created a group of incomplete 


ri 


fables. T 


se were translated by Des- 
), who used them to investi- 
gate the emotional problems of stutter- 
ing children. The fables were all short 
and could be completed by children 
from 6 to 15 years of age. One fable 
begins with a funeral going through a 
village street. People ask about the 
identity of the dead person; somebody 
Says it is a person in the family who 
lives in this house. Then the examiner 
asked who it was and encouraged the 
subject to talk about the person, Des- 
pert interpreted the endings in terms of 
such categories as death wishes, guilt 
feelings, Oedipus complex, sibling ri- 
valry, and jealousy toward parents. 

Friedman (1952) constructed his 
own fables for boys and girls between 
the ages of 5 and 16 years. To elicit 
castration anxiety, he told two fables, 
one about a monkey with a pleasing 
tail to which something happened and 
one about a puppy who liked to watch 
his long tail wag but found it looked 
different one day. 
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Incomplete Short Stories. As part of 
his study of castration anxiety, Fried- 
man (1952) also told a story about a 
child whose finger itched and who 
scratched and rubbed it all the time, 
even when the parents warned against 
it. Again, the subjects were asked to 
tell what happened. A theme designed 
to elicit oedipal reactions concerned a 
child who enjoyed himself alone in the 
park with one parent. On returning 
home, the child encountered the other 
parent, whose face did not look the 
way it usually did. 

In an effort to test the validity of 
certain psychoanalytic conceptions of 
pregenitality, Miller and Stine (1951) 
designed a story-completion test for 
children between 7 and 14 years of age. 
The story beginnings portrayed situa- 
tions that are common sources of anx- 
iety, according to psychoanalytic the- 
ory. In the theme pertaining to castra- 
tion anxiety, for example, a child was 
warned by his teacher not to touch 
some machinery, was curious, and de- 
cided to investigate. In the theme about 
sibling rivalry a hungry child could not 
be fed by his mother until she had first 
fed the baby. The test also contained 
themes pertaining to oral sadism and 
phallic tendencies. Some of the signs of 
pregenitality tabulated by the authors 
were magical thinking, anality, pas- 
sivity, and running out of the field. In 
a subsequent investigation of “oral 
character” (Blum and Miller, 1952) 
the frequency of oral references in the 
story endings written by third graders 
was negatively related to a criterion of 
orality but an interpretive analysis of 
the endings was positively related to 
the criterion. 
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PARTIALLY STRUCTURED 
STORIES 


Rated Story-Beginnings. Graham, 
Charwat, Honig, and Weltz ( 1951) 
studied aggressive responses to different 
amounts of provocation. Their subjects, 
who were between 12 and 14 years of 
age, were asked to complete statements 
in terms of the heroes’ probable actions. 
Each statement described a hostile act 
and the individual who committed it, 
and each act had been rated for degree 
of aggressiveness. The test was designed 
to cover five degrees of aggressiveness. 
Every degree was combined twice with 
each of five individuals: parents, au- 
thorities, siblings, friends, and inferiors. 
An analysis of the findings revealed that 
frequency and degree of aggression 
varied significantly with directness of 
aggression expressed in the attack and 
with the power of the attacking indi- 
vidual, 

Incomplete Longer Stories. To study 
the defenses against aggression of pre- 
adolescent boys, Beardslee (in Miller 
and Swanson, 1960) administered a 
story-completion test. The hero of each 
story was very attached to an adult 
and was seriously but unintentionally 
inconvenienced by him. Considerable 
detail was included in the story be- 
ginnings in order to maximize endings 
about techniques of resolving con- 
flict. In the coding of endings, fre- 
quencies of defenses, such as turning on 
the self, reversal, and rationalization, 
were tabulated. 

Story Completion with Guided Ques- 
tions. Sargent’s Insight Test (1953), a 
method of story completion with 
guided questions, was adopted for chil- 
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dren by Engle (1958). The endings 
were given in response to such ques- 
tions as, “What did he do?” and “How 
did he feel?” The questions about feel- 


ings was not asked after the third item. 
In some representative themes the hero 
always gets good marks in spelling but 
fails on one test, sees a big boy beating 
up a smaller one, and finds out that 
some children had been saying bad 
things about him, Engle adopted Sar- 
gent’s elaborate scoring system, which 
includes types of relationships, «ggres- 
sive, passive, and evasive reactions, 
amounts and categories of dierent 
affects and defenses, indices of conflict, 
and various indicators of maladjust- 
ment. 

In an investigation of the independ- 
ence of children in elementary school 
Boehm (1957) supplemented a story- 
completion test with more de railed 


guided questions. In one story a boy 
was unsure about the best methods of 
decorating a room and decided to seek 
advice. To delimit the themes in a sub- 
ject’s endings, Boehm first asked him 
to mention someone from whom the 
boy might get advice. After the subject 
answered, she said the hero was able 
to consult either a very artistic student 
or a homeroom teacher who did not 
know much about art. The subject was 
then asked to name the one whose ad- 
vice the hero took. There followed 
other questions about his selection 
among conflicting opinions and his feel- 
ings about the person whose advice was 
not followed. Another story was about 
a boy who became involved in a fight 
and had given his opponent a bloody 
nose. The questions pertained to the 
hero’s feelings, his choice between the 
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advice of a teacher and a friend about 
techniques for mollifying the victim, 
his reaction to the victim’s advice to 


forget about the incident, and the kind 
of atonement necessary to allay the 
hero’s guilt. Results were analyzed in 
terms of the objects of dependence, 
degree of independence of judgment, 


and the autonomy of conscience. 
OBJECTIVE STORY TESTS 

To study the resolution of conflict 
in early adolescence, Seaton (1949) 
constructed a story-completion test 
with multiple-choice endings. In one 
story a boy accidentally collides with 


his mother. As a result, she says he 
cannot go to the movies, as she has 
promised, and that he must go to his 
room. The subject was asked to choose 
one of three possible endings: the hero 
goes to his room and wonders why he 
always does the wrong thing; he ques- 
tions the punishment and points out 
that the collision was an accident; he 
goes to his room but thinks the treat- 
ment was mean and unfair. Endings 
were coded in terms of such categories 
as self-blame, blame of others, hostility 
by means of an external agent, verbal 
aggression, and resistance to temptation. 

Schroeder and Hunt (1957) explored 
the defenses of adolescent boys by 
means of an objective scale. For each 
of ten items that described different 
kinds of failure, the subjects chose one 
of two alternative endings. The endings 
Were predominantly intropunitive, ex- 
trapunitive, or impunitive. Choices 
were significantly related to flexibility 
in actual shifts from one ego-involving 
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activity to another, to the Taylor scale, 
and to the level of aspiration. 


Projective Pictures 


The commonest way of eliciting a 
subject’s projections is to present him 
with a relatively unstructured visual 
stimulus: a clear picture with an am- 
biguous meaning, a vague picture of 
people, a cartoon of animals who are 
free to express many impulses barred 
to human beings, or even vaguer stimuli 
such as ink blots or cloud pictures. 
When projective techniques first burst 
forth on the psychological world, they 
seemed to offer access to people's pri- 
vate worlds, both conscious and uncon- 
scious. Many psychologists hoped the 
new techniques would markedly ex- 
pand the theoretical horizons of per- 
ception and motivation. For the most 
part, such hopes have not been realized. 
There is now considerable evidence 
that conscious resistance affects the re- 
sults of tests and that they are not so 
sensitive to some kinds of unconscious 
impulses as was once hoped. Little 
progress has been made in the clarifica- 
tion of basic theoretical problems, such 
as the meaning of projection and the 
emotional significance of color. Al- 
though an impressively large number 
of projective techniques has been pro- 
posed, their standardization and valida- 
tion has tended to be fragmentary or 
lacking. At present, it is possible to 
summarize the projective picture tests 
that have had most influence on moti- 
vational research by discussing three 
types of instruments. One is the The- 
matic Apperception Test and related 
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techniques, a second embraces the vari- 
ous tests consisting of cartoons and 
drawings, and the third is the Ror- 
schach Test. 


THEMATIC APPERCEPTION 
TEST AND’ RELATED 
TECHNIQUES 


Probably the most influential and 
‘most frequently used of the projective 
techniques has been the Thematic Ap- 
perception Test (Morgan and Murray, 
1935), which contains 20 cards for chil- 
dren, Subjects are instructed to inter- 
pret the action, to tell what the char- 
acters are thinking and feeling, and to 
give imaginary constructions of the 
preceding events and the final out- 
comes. The stories are scored in terms 
of needs, press, and outcomes. 

Use of Parts of the Test in Research. 
Most investigators administer only those 
cards and score those needs and press 
required to test particular hypotheses. 
Mussen and Naylor (1954) for exam- 
ple, employed a curtailed form of the 
test to explore the aggression of work- 
ing-class boys between the ages of 9 
and 16 years. A majority of the sub- 
jects were having difficulties with edu- 
cational or legal authorities because of 
antisocial behavior. Ten of the 20 cards 
were administered, and the stories were 
coded for only two needs. One was 
aggression, which was tabulated when 
references were made to such acts as 
fighting, killing, criminal assault, get- 
ting angry, quarreling, criticizing, and 
resisting coercion. Because the authors 
felt that the overt expression of aggres- 
sion depends, in part, on fear of pun- 
ishment, they also tabulated references 


THE STUDY OF PERSONALITY DEVELOPMENT 


to such punishments as assault, threat, 
quarreling, deprivation of privilege, 
and rejection. 


Combination of Standard Pictures 
with New Ones. Investigators fre- 
quently add new pictures to the ones 
supplied by Murray. To test children 
between 5 and 14 years, Sanford et al. 
(1943) supplemented the standard set 
of pictures with another group in 
which a child was the central figure. 
The stories were interpreted in accord- 
ance with Murray’s system. In order of 
frequency, the most commonly ex- 
pressed needs were aggression, acquisi- 


tion, autonomy, affiliation, succorance, 
and passivity. The least frequently ex- 
pressed needs were order, infavoidance, 
exhibition, blamescape, understanding, 
and counteraction. 

In their research on physical matura- 
tion and personality, Mussen and Jones 
(1957) used nine pictures from the 
standard TAT (Murray, 1943), five 
pictures from a 1938 version of the test, 
land five new pictures designed to high- 
light the motives to be analyzed. Writ- 
ten stories were obtained from physi- 
cally retarded and physically advanced 
17- year-old subjects. Themes of the 
two groups were then compared with 
respect to feelings of personal inade- 
quacy, parental rejection, the inclina- 
tions to escape and defy parents, and 
the needs for affiliation and aggression. 

Scoring Methods. A number of new 
systems for scoring the Thematic Ap- 
perception Test has been prompted by 
differences in purpose and population 
from the ones of Murray’s research and 
by fruits garnered from clinical prac- 
tice with his instrument. Henry (1947) 
has developed one of the more sophisti- 
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cated systems. In his research he com- 
pared personalities in different societies 
and also traced the associations between 
the culture and different aspects of per- 
sonality within each society. The sub- 
jects were Hopi and Navajo children 
between the ages of 6 and 18 years. 


Because the scoring system was being 
applied to stories of children from alien 
cultures, the dimensions had to have 
the same meanings in the two societies. 


The scoring system also had to be quite 
detailed because Henry intended to 
study the organizations of total per- 
sonalities. To that end, he classified 
each subject with respect to such varied 
topics as mental capacities, richness and 
originality of imagination, motivational 
and emotional ties with peers and adults 


at home and in school, defense mecha- 
nisms, and conflicts between white and 
native identifications. To analyze such 
material, he defined criteria for coding 
both form and content. Some aspects 
of form coded were length of story, 
originality, rhythm, level of organiza- 
tion, and language structure. Among 
the categories of content were the mo- 
tives, actions, and thoughts of charac- 
ters in interpersonal situations, the emo- 
tional interplay between persons, con- 
flict and the substitutes for forbidden 
impulses, and insight into emotional 
bases of action. The scoring was done 
with reference to the age and sex norms 
for each picture. Subjects were rated on 
each variable. Eighty-three per cent of 
the ratings agreed with some other 
source of data, such as the Rorschach 
Test, a life history, a free drawing, 
and a battery of tests devoted to emo- 
tion, moral standards, and the making 
and changing of the rules of games. 
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The Achievement Motive. An in- 
vestigator sometimes finds that he gets 
better results with pictures selected 
because of their relevance to his 
topic than he does with the original 
TAT cards. To measure the achieve- 
ment motive, McClelland et al. (1953) 
selected a special set of pictures, which 
portrayed people doing various kinds of 
manual and intellectual work. In a typi- 
cal picture two men in working clothes 
are standing next to a machine, which 
appears to be some kind of press. When 
the test was administered to high-school 
freshmen, the reliability of the scoring, 
as measured by the agreement between 
raters, was .96. Scores were significantly 
related to incidental learning but not 
direct learning (Karolchuck and Wor- 
rell, 1956). Correlations of school grades 
and intelligence with the achievement 
motive were low to moderate. 

A Scale for adolescents. To establish 
norms for adolescent boys and girls, 
Veroff, Wilcox, and Atkinson (1953) 
repeated the original method of stand- 
ardizing the test of achievement motiva- 
tion with a high-school population 
between the ages of 16 and 18 years. 
Scores were obtained with two types 
of instructions. First the test was ad- 
ministered with instructions calculated 
to produce a relaxed orientation. Then 
it was administered with achievement- 
orienting instructions. Under the re- 
laxed condition, boys’ scores were 
significantly lower than they were with 
the achievement orientation. Girls’ 
scores under the initial instructions were 
so high that they could not increase 
on the second test. 

The scoring system. Although the 
promise of test of the achievement 
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motive has been demonstrated in a 
number of studies, recent investigations 
by MacArthur (1953) and by Karol- 
chuck and Worrell (1956) indicate the 
desirability of refining the scoring 
system. The results of both investiga- 
tions suggest that the current scores 
have at least two relatively independent 
components. One, the expectancy of 
goal achievement, is expressed in 
descriptions of mastery and of positive 
goal states. Another, which Moulton 
(in Atkinson, 1958) has called the fear 
of failure, is expressed by references 
to obstacles and to anxiety about the 
achievement of goals. Moulton has 
created a scoring system for fear of 
failure. He tabulates only nonpositive 
references to the goal and to the tech- 
niques of implementing it. 


OTHER TESTS WITH REALISTIC 
PICTURES OF PEOPLE 


Adolescents’ Fantasies. Symonds 
(1949) designed a special set of pic- 
tures for adolescents and standardized 
his instrument by administering it to 
boys and girls in junior and senior high 
schools. Among the more frequent 
motivational and emotional themes that 
occurred in the stories were aggression, 
eroticism, anxiety, altruism, ambition, 
and repentance. The commonest en- 
vironmental themes pertained to family 
relationships, economic issues, punish- 
ment, separation, accidents and illnesses, 
and school. Norms were provided for 
each of the themes, and Symonds rec- 
ommended that picture-story material 
be interpreted in the light of individual 
case histories, 


THE STUDY OF PERSONALITY DEVELOPMENT 


A Test of Oedipal Problems. To ob- 
tain stories related to the Oedip 
plexes of subjects between 5 


years of age, Friedman (1952) made up 
two sets of pictures, one for boys and 
one for girls. In each set one picture 
depicted a father, a child with a broken 


toy, and a staircase in the background; 
another picture portrayed a child stand- 
ing at a door and looking toward his 
mother; a third depicted two parents 
facing a child on the floor. After the 
set had been administered, Fricdman 
asked each subject to name the parent 
in the picture whom the child loved 
better and the parent who loved the 
child better. Stories about the pictures 
were coded in terms of such catego- 
ries as homosexual versus heterosexual 
choice, constructive action by a parent, 
frequency of conflict themes for a 
card, and frequency of references to 
the father ascending the staircase. The 
ascent was interpreted as a symbolic 
reference to the sexual act. 

‘Aggression to Authority. Johnson and 
tanley (1955) used a sophisticated 
design to select items for a study of 
aggression. The investigators first 
planned 24 pictures, which represented 
different combinations of three charac- 
teristics: the provoking agent, who was 
male or female; threat, which was high 
or low; and power, which was high or 
low. There were eight possible com- 
binations of the three dichotomized 
variables. A picture could portray a 
man with low power and high threat, 
for example, or a woman with high 
power and low threat. In a pretest the 
authors asked 18 graduate students to 
judge the amount of power and threat 
suggested by each picture. Since the 
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sex of the main character was obvious, 
it did not have to be rated. The investi- 
gators then eliminated eight pictures, 


one from cach of the eight sets with the 


same combination of variables, about 
which the judges disagreed most with 
the original classification. The final 
form of the test, then, consisted of 16 
pictures with two categories of sex, 
of power, and of threat, The test was 


administered individually to delinquent 


and nondelinquent boys between the 
ages of ro and 12 years, and the re- 
sponses were rated on a five-point scale 
of subr sion-hostility. In the analysis 
of data the investigators computed as- 
sociations involving submission-hos- 


tility, sex, power, and threat, and their 
Interactions. 


CARTOONS OF HUMANS 


: One reason for giving projective tests 
1s to obtain material that the subject 
will not give voluntarily, either because 
he deliberately withholds it or because it 
makes him so anxious that he uncon- 
sciously distorts it. To minimize the 
effect of such resistances, investigators 
have created tests with cartoons or with 
drawings of animals rather than of 
people, It is assumed that unusual-look- 
ing anonymous figures or drawings of 
animals will not make a subject self- 
Conscious or arouse anxiety in him about 
his own motives and feelings. 
Direction of Aggression. Rosenzweig 
(1948) one of the pioneers in the de- 
velopment of cartoon tests, devised a 
form of his Picture Frustration Study 
for children between 4 and 13 years of 
age. Each cartoon depicts a hero who 
is being provoked by the action of 


723 
another person. Rosenzweig and Ro- 
senzweig (1952) found that extra- 
punitiveness gradually diminished with 
increasing age and that there was a 
progressive increase of indirect expres- 
sions of aggression, These trends were 
less marked for maladjusted than for 
well-adjusted children. 

Misbehavior and Punishment. Mor- 
gan and Gaier (1956) constructed a 
special cartoon test, the Punishment 
Situation Index, depicting actions that 
are commonly punished. There are 
two sets of ten cartoons, Each set por- 
trays such situations as a possible physi- 
cal injury to a child, an unfavorable re- 
lationship with a sibling, destruction of 
the parents’ personal possessions, a bad 
report card, a disordered room, and 
stealing. After the subject described 
what was happening in the picture, he 
was asked what his mother would say 
if he were the person in the picture, 
what he would say to his mother, what 
his mother would do, what he would 
do, how he would feel if his mother 
acted in the way he anticipated, and 
how he thought his mother would feel. 


CARTOONS AND DRAWINGS 
OF ANIMALS 


Animals have the special virtue of 
being natural objects of identification 
for children and adults. It is commonly 
assumed that both will be readily in- 
volved in the task of telling stories about 
animals. 

Psychosexual Development. To 
measure different aspects of psycho- 
sexual development, Blum (1949) con- 
structed the Blacky Pictures. Blacky, 
the hero, is a dog. Incidents involving 


74 

him and members of his family are 
portrayed in 12 cartoon drawings. From 
the subjects’ stories and responses to 
the subsequent inquiry, the author in- 
ferred the strengths of oral eroticism, 
oral sadism, anal sadism, Oedipal in- 
tensity, masturbation guilt, castration 
anxiety, penis envy, positive identifica- 
tion, sibling rivalry, guilt feelings, posi- 
tive ego ideal, masculine and feminine, 
and love object, masculine and feminine. 
The author devised scoring criteria and 
obtained norms for each picture. In- 
stead of computing correlations with 
independent criteria, such as clinical 
ratings or psychiatric diagnosis, Blum 
offered strong evidence for construct 
validity. When psychosexual categories 
were correlated, almost all the results 
were consistent with psychoanalytic 
principles. Instances of such evidence 
were the significant correlations be- 
tween anal expulsiveness and sibling 
rivalry and between penis envy and 
narcissistic love object in females. 

Cava and Raush (1952) used the 
Blacky Pictures to study the identifica- 
tions with fathers of boys in the 
twelfth grade. Subjects were classified 
as having more or less conflict on the 
dimensions of oedipal intensity, castra- 
tion anxiety, identification, and ego 
ideal, As anticipated, the less the con- 
flict, the greater was the perceived simi- 
larity between the subjects’ responses to 
the Strong Vocation Interest Test and 
the responses they attributed to their 
fathers. The trends were significant, 
however, only for the dimensions of 
castration anxiety and of total identifica- 
tion. 

Also psychoanalytic in orientation is 
Bellak and Bellak’s (1950) Children’s 
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Apperception Test. Stories ab the 


animals in the pictures were coded for 
orality, anality, sibling rivalry, Oedipal 
conflict, aggression, fear, sexuality, and 


identification with mother and father, 
Frequencies and relative intensities of 
responses to each picture by children in 
the preschool years were reported by 


Byrd and Witherspoon (1954). On the 
basis of their attempt to obtain norma- 
tive data, these authors concluded that 


the test was most effective in cliciting 
information about orality, aggression, 
and relations with parents, and less ade- 
quate in producing material about 
sibling rivalry, fears, and sexuality. 


1 that 
on had 


Byrd and Witherspoon also nore 
judgments of parental identificati: 
low reliability and that the onl 
quate interpretations of toilet cleanli- 
ness and oedipal dynamics were deriv ed 
from pictures designed to elicit stories 
about these topics. 


ade- 


MOTION PICTURES 


Motion Pictures versus Stills. On 
comparing the stories told by tenth 
graders about silhouette motion pic- 
tures and matched stills, Eiserer (1949) 
discovered that films were superior in 
eliciting expressions of needs and press, 
interpretable content, and stories that 
were more projective than descriptive. 

A Filmed Puppet Show. Eiserer’s re- 
sults should motivate others besides 
Haworth (1957) to study filmed projec- 
tive methods. Haworth made a film of 
a puppet show called ‘“Rock-a-Bye 
Baby,” which was originally written 
by Woltman (in Bender and Woltman, 
1936). Sibling rivalry was the theme 
of this 35-minute film, which begins 
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with Caspar, the hero, calling a witch 
to get rid of his baby sister. In sub- 
sequent scenes the witch puts a spell 
on the milk, the baby falls ill and is 
rushed to the hospital, Caspar kills the 


witch, the baby is restored to health, 
and Caspar gets his parents’ love. 
Haworth stopped the film at the point 
when the baby is taken to the hospital 
and asked the children to finish the 


story as a group. Then the rest of the 
film was shown. Finally, the subjects 
were secn individually and interviewed. 
At that time the investigator asked them 
to give their reactions to the show and 
to describe Caspar’s thoughts about his 
mother, father, baby sister, and the 


witch, Orher questions pertained to the 
necessity of Caspar being punished, his 
probable answer if his mother asked 
about the reasons for the baby’s becom- 
ing well, and predictions of subsequent 
events. From the subjects’ answers, 
Haworth derived signs of defensiveness, 
sibling rivalry, autoeroticism, obsession, 
Oedipal conflict, and superego develop- 
ment. Norms were established by test- 
ing one group of relatively normal 
children between 4 and 10 years and 
another group of children in therapy 
who were between 4 and 9 years. 


AN OBJECTIVE PICTURE TEST 


A creative attempt to construct an 
objective test of anxiety was made by 
Temple and Amen (1944) and Dorkey 
and Amen (1947). Intended for pre- 
school children, the instrument con- 
Sisted of two series, each containing 14 
Pictures. Each picture depicted a par- 
ticular kind of relation between two 
children or between a child and an 
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adult, Some of the themes were scold- 
ing, neglect, and aggressive attack. In 
every picture the area of one person’s 
head was left blank. From two faces, 
one happy and one sad, the subject 
selected the most appropriate one for 
the blank space. The proportion of un- 
happy faces chosen to complete the 
pictures was the anxiety score. Norms 
were compiled for each picture, but 
they were based on few cases. For 
children 2 years of age, the correlations 
between test scores and teachers’ rat- 
ings of anxiety was .59. 


Inkblots 


RORSCHACH TEST 


Norms for the Preschool and Ele- 
mentary Grades. For reasons to be dis- 
cussed below, the Rorschach Test has 
been used less frequently in studies of 
children’s motives and affects than have 
the other popular projective techniques. 
A careful standardization of the method 
is available in the publication of Ames, 
Learned, Metraux, and Walker (1952), 
who tested 25 boys and 25 girls at each 
year level from 2 to 10 years. The 
authors described their procedures very 
carefully, but the sample was not at all 
representative of the general population. 
Three fourths of the children were 
above average in intelligence and from 
the middle class. In addition, the re- 
stricted geographic region from which 
the subjects were selected—Connecticut 
and New York—raises doubts about the 
applicability of the norms to children 
in different areas, such as the south- 
eastern or mountain states. 
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Norms for Adolescents. Some of the 
studies of adolescents suggest the need 
of a special set of Rorschach norms for 
these subjects. Compared to Beck’s 
normal adult group, Hershenson’s 
(1949) adolescents produced a lower 
mean percentage of good, pure form 
and much more white space. As these 
signs are interpreted by the investigator, 
they signify that, compared to adults, 
adolescents display greater emotional 
distortion, lack of conscious control, 
and rebelliousness. McFate and Orr 
(1949) analyzed sex differences in the 
Rorschach records of adolescents in the 
California Guidance Study. Compared 
to the boys, the girls produced sig- 
nificantly more color, which indicates 
emotionality, and significantly fewer 
signs of anxiety. 

Clinical Reports. A number of papers 
on the Rorschach Test has been devoted 
to the dynamics of an individual person. 
Sometimes the reports have been de- 
velopmental. Typical is Allen’s analyses 
of nine quarterly records obtained from 
a girl between the ages of 2.6 and 4 
years (1955). As she grew older, the 
extratensive method of relating to her 
environment became increasingly prom- 
inent, and the author felt that she 
showed increasing control over emo- 
tionality without inhibiting her ability 
to make warm personal ties. è 

Validation of Signs. Working with 
boys and girls in the fourth grade, 
Broida and Thompson (1954) at- 
tempted to validate certain Rorschach 
signs of “insecurity” by analyzing their 
associations with reactions to psycho- 
logical stress. Among the signs of 
insecurity were pure color, color- 
dominated form, color shock, a re- 


THE STUDY OF PERSONALITY 


DEVELOPMENT 


stricted range of content, a large 
number of rejections, and few move- 
ment responses. The authors used two 
criteria for reactions to stress. One was 
the teachers’ list of such responses as 
crying, autistic gestures, mobility, 
rigid approach to a test, and anxiety. 
The second criterion consisted of 
teachers’ ratings of subjects on 12 scales 
of insecurity by Swift. The Rorschach 
signs were not significantly related to 
either teachers’ observations or ratings 
on Swift’s scale, although these two 
criteria were significantly related to 
each other, In short, it was possible to 
predict reactions to stress from teachers’ 
ratings of children’s insecurity in the 
classroom but not from the children’s 
performances on the Rorschach Test. 


Drawing and Painting 
SCORING CRITERIA 


Impressed by the extent to which 
artistic productions reveal the per- 
sonality dynamics of children, a num- 
ber of clinicians have attempted to 
standardize techniques of scoring paint- 
ings and drawings. Many psychologists 
who have employed artistic productions 
in research on motivation or affect have 
used the scoring criteria of Alschuler 
and Hattwick (1947). The validities of 
certain signs in this scoring system were 
investigated by Thomas (1951), who 
compared the paintings of children 
frustrated by the sight of inaccessible 
toys with the paintings of a nonfrus- 
trated group. Thomas also compared 
the paintings produced by subjects of 
different ages; the age range was 2 to 
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5 years. Frustration was tot signifi- 
can related to regressive shifts in 
hue or line. Age differences in form 
level agreed with Alschuler and Hatt- 
wicks picture of developmental 
changes, but age differences in color and 
line did not. 


PAINTING STYLE 


Fastidiousness. Alper, Blane and 
Adams (1955) applied Alschuler and 
Hattwick’s criteria to the analysis of 
style. Their subjects were 
nursery-school children in the middle 
and working classes. From class dif- 
ferences in techniques of child training, 
especially those aimed at cleanliness, 
the authors deduced that children of 
white-collar workers would be more 
fastidious and inhibited in style of 
finger painting than would children of 
blue-collar workers. The predictions 
were generally supported by the results. 
Compared to the subjects from the 
working class, the ones from the middle 
class were more inclined to put colors 
down separately, mix colors indiscrim- 
inately, name a free painting, wash 
up on their own, avoid covering the 
whole sheet of paper, and avoid paint- 
ing with warm colors. The investiga- 
tors interpreted the cool colors favored 
by children in the middle class as signs 
of depression, poor adjustment, and 
controlled reactions. The separate 
placement of colors was attributed to 
a sense of order and cleanliness and to 
repression of the desires to smear and 
soil. Indiscriminate mixing of colors 
seemed to reflect the repressed desire 
for smearing, and the limited coverage 
of the drawing paper was interpreted 


artistic 
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as a sign of withdrawal and emotional 
dependence. To determine how color 
was applied when dirtiness was not 
an issue, the investigators asked the chil- 
dren to draw with crayons. As antici- 
pated, the socioeconomic differences 
obtained with paints were no longer 
significant. 

Conceptual Orientation. McNeil (in 
Miller and Swanson, 1960) compared 
the paintings of preadolescent boys 
from the middle and working classes. 
His purpose was to analyze the strength 
of their “conceptual” orientations: their 
inclinations to communicate by means 
of concepts. The boys were asked to 
depict the feelings of anger, fear, hap- 
piness, and sorrow. Each feeling was 
illustrated by a concrete situation. In 
the direction to portray extreme anger, 
for example, the boys were asked to 
convey how they felt when “. . . you 
see a boy beating up your little dog or 
...a reckless driver has just missed 
you with his car.” Subjects were asked 
not to paint a person expressing each 
emotion but rather to convey the ideas 
and feelings behind it. Guided by the 
literature on artistic productions, 
McNeil developed six criteria for con- 
ceptual and nonconceptual orientations. 
He assumed, for example, that a very 
conceptual person would be inclined 
to use paint only, to portray unrec- 
ognizable objects, to paint with at least 
four of the five colors, and to cover 
most of the sheet. Among the signs of 
a nonconceptual orientation were the 
outlining of shapes with black and the 
painting of small compositions. Al- 
though the results produced the an- 
ticipated associations, they were not so 
marked as those McNeil obtained with 
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other techniques of measuring the con- 
ceptual variable. 


Play Techniques 


METHODS OF INTERPRETATION 


The meaning of play has been the 
ee of many publications, some 
dating back to the nineteenth century. 
In clinical work with children play 
provides the primary source of informa- 
tion for both diagnosis and psycho- 
therapy. Probably the most influential 
publications in more recent historical 
times were Freud’s descriptions of Little 
Hans’ phobia (1924) and of a child’s 
play with a spool (1920). The im- 
portance of play therapy to people with 
different theoretical predilections has 
resulted in a variety of interpretive 
systems. Among the psychoanalytic 
orientations, those of Anna Freud 
(1926) and Melanie Klein (1932) have 
been most influential. Researchers have 
also found much of interest in the more 
eclectic scheme that Lowenfeld pro- 
posed for interpreting her World Test 
(1939) and in Erikson’s system (1940) 
of analyzing play. 


FREE PLAY 


Unlike the Europeans, most American 
investigators have been partial to ob- 
jective methods and to the phrasing of 
hypotheses in terms of behavioristic 
learning concepts. In great part, the in- 
fluence of learning theorists can be 
traced to their concern with clarity 
of concepts and their success in de- 
veloping fruitful operational defini- 
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tions. Such assets have made the v 
on frustration and aggression b) 
ard, Doob, et al. (1939) a hanc 
for many students of social be! 
These authors defined a set of proposi- 
tions that have been used by iber 
of investigators in interpreting ag 
sion/in play. 

Aggressive Behavior. In a majority 
\of American studies the variables are 
analyzed in terms of relative fre- 
quencies. Usually the criteria foi 
lations are carefully defined. In 
signing studies, most investigators 
at the direct measurement of behavi 
units but do not shy away from 
ferences about internal states. Re 
sentative of the American orien! 
to play are the following three st 
of aggressive behavior, attitudes 
fantasies. 

Lévin and Sears (1956) investigated 
sofme antecedents of aggression in a 
group of 5-year-old children by ob- 
serving their behavior during two 
twenty-minute sessions of doll play. 
Action during the play was divided into 
units; each child was classified in terms 
of the proportion of aggressive units 
in a session. To measure amount of ag- 
gression, the investigators tabulated 
attempts to irritate, hurt, injure, punish, 
frustrate, or destroy dolls or equip- 
ment. In the analysis of results relations 
were computed between frequency of 
aggression and such variables as sex of 
subjects and of punishing parents and 
birth order of subjects. 

Negative Attitudes. Moustakas 
(1955) compared the “negative atti- 
tudes” expressed in play by well- ad- 
justed and disturbed 4-year-olds. Each 
subject had four sessions, all of which 
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were recorded on tape and by a ste- 
nographer. The first and third sessions 
provided the data for the study. From 
these data, 241 negative attitudes were 
reliably identified and rated for in- 
tensity expressed feelings. Compared 
to the normal children, the disturbed 


ones expressed a significantly greater 


number of negative attitudes with more 
intense feelings. The commonest nega- 
tive attitudes were diffuse hostility, 
hostility to home and family, and 
anxiety about orderliness and cleanli- 
ness. 

Fantasies of Aggression. Wenar 
(1956) studied the aggression of 8-to- 
10-year-old children with motor handi- 
caps. He observed the subjects as they 


played with the World Test. In analyz- 
ing the results, he focused on fantasies 
of destruction. These included fantasies 
about war, fighting, fires, accidents, 
sickness, and zoos or jungles with fight- 
ing, biting, or escaping wild animals, 
fire houses, hospitals, sailors, and police- 
men. In addition to destruction, Wenar 
also analyzed protective defenses and 
lack of over-all adjustment. Evidence 
of the latter consisted of aggressive 
worlds, empty worlds with few pieces 
and categories, closed worlds with 
many fences, rigid or overly symmetri- 
cal worlds, and disorganized or chaotic 
worlds with many fences. The fre- 
quency of fences was interpreted as an 
index of impulsivity and excessive 
anxiety about the importance of con- 
trol. 


DOLL PLAY WITH STRUCTURING 
QUESTIONS 


Sibling Rivalry. Numerous investi- 
gators have presented their subjects 
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with a limited number of play materials 
and structured the situation further by 
asking a series of specific questions. In 
some cases the dolls were identified and 
descriptions were given of their re- 
lationships to each other before the 
questions were asked. Levy (1937) em- 
ployed such a method to investigate 
sibling rivalry. The range of his sub- 
jects’ ages was 2 to 14 years. Play was 
limited to three dolls, which were 
identified as a mother, a baby, and a 
child the same sex as the subject. The 
examiner said that the mother had to 
feed the baby. Pointing to her chest, he 
noted that she had no breasts, and sug- 
gested that he and the child make some. 
The examiner shaped one out of clay, 
and the child shaped the other. After 
the breasts were attached to the mother 
doll, her arms were put around the baby 
doll. The examiner then said that the 
sister or brother had seen the new baby 
at the mother’s breast for the first time 
and asked what the sibling did. Next, 
the child was instructed to translate his 
description into action with the dolls. 
From time to time, other questions were 
asked with a view to encouraging the 
expression of rivalry. The examiner 
might even quote the older child as 
saying, “That bad, bad baby at my 
mother’s breast!” 

Levy classified all the activities of 
each subject into five general cate- 
gories. One was the prevention of hos- 
tility to objects by such techniques 
as denial, withdrawal, distracting play, 
and leaving the room. A second covered 
the different forms of attack, and a 
third, the direction of hostility, included 
such objects as self, baby, mother, and 
breasts. A fourth category applied to 
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other techniques of measuring the con- 
ceptual variable. 
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The meaning of play has been the 
subject of many publications, some 
dating back to the nineteenth century. 
In clinical work with children play 
provides the primary source of informa- 
tion for both diagnosis and psycho- 
therapy. Probably the most influential 
publications in more recent historical 
times were Freud's descriptions of Little 
Hans’ phobia (1924) and of a child's 
play with a spool (1920). The im- 
portance of play therapy to people with 
different theoretical predilections has 
resulted in a variety of interpretive 
systems. Among the psychoanalytic 
orientations, those of Anna Freud 
(1926) and Melanie Klein (1932) have 
been most influential, Researchers have 
also found much of interest in the more 
eclectic scheme that Lowenfeld pro- 
posed for interpreting her World Test 
(1939) and in Erikson’s system (1940) 
of analyzing play. 


FREE PLAY 


Unlike the Europeans, most American 
investigators have been partial to ob- 
jective methods and to the phrasing of 
hypotheses in terms of behavioristic 
learning concepts. In great part, the in- 
fluence of learning theorists can be 
traced to their concern with clarity 
of concepts and their success in de- 
veloping fruitful operational defini- 
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tions. Such assets have made the volume 
on frustration and aggression by Doll- 
ard, Doob, et al. (1939) a handbook 
for many students of social behavior, 
These authors defined a set of proposi- 
tions that have been used by a number 
of investigators in interpreting aggres- 
sion in play. 

ggressive Behavior. In a majority 

American studies the variables are 
analyzed in terms of relative fre- 
quencies, Usually the criteria for tibu- 
lations are carefully defined. In de- 
signing studies, most investigators aim 
at the direct measurement of behavioral 
units but do not shy away from in- 
ferences about internal states, Repre- 
sentative of the American orientation 
to play are the following three 
of aggtessive behavior, attitudes, and 
fantasies, 
m UN and Sears (1956) investigated 
sofe antecedents of aggression in a 
group of s-year-old children by ob- 
serving their behavior during two 
twenty-minute sessions of doll play. 
Action during the play was divided into 
units; each child was classified in terms 
of the proportion of aggressive units 
in a session. To measure amount of ag- 
gression, the investigators tabulated 
attempts to irritate, hurt, injure, punish, 
frustrate, or destroy dolls or equip- 
ment, In the analysis of results relations 
were computed between frequency of 
aggression and such variables as sex of 
subjects and of punishing parents and 
birth order of subjects. 

Negative Attitudes. Moustakas 
(1955) compared the “negative atti- 
tudes” expressed in play by well-ad- 
justed and disturbed 4-year-olds. Each 
subject had four sessions, all of which 
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were recorded on tape and by a ste- 
nographer. The first and third sessions 
provided the data for the study. From 
these data, 241 negative attitudes were 
reliably identified and rated for in- 
tensity of expressed feelings, Compared 
to the normal children, the disturbed 
ones expressed a significantly greater 


number of negative attitudes with more 
intense feelings. The commonest nega- 
tive artitudes were diffuse hostility, 
hostility cto home and family, and 
anxiety about orderliness and cleanli- 
ness, i 

Fantasies of Aggression. Wenar 
(1956) studied the aggression of 8-to- 
1o-year-old children with motor handi- 


caps. He observed the subjects as they 
played with the World Test, In analyz- 
ing the results, he focused on fantasies 
of destruction, These included fantasies 
about w ar, fighting, fires, accidents, 
sickness, and zoos or jungles with fight- 
ing, biting, or escaping wild animals, 
fire houses, hospitals, sailors, and police- 
men. In addition to destruction, Wenar 
also analyzed protective defenses and 
lack of over-all adjustment. Evidence 
of the latter consisted of aggressive 
worlds, empty worlds with few pieces 
and categories, closed worlds with 
many fences, rigid or overly symmetri- 
cal worlds, and disorganized or chaotic 
worlds with many fences. The fre- 
quency of fences was interpreted as an 
index of impulsivity and excessive 
anxiety about the importance of con- 
trol, 
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Sibling Rivalry. Numerous investi- 
gators have presented their subjects 
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with a limited number of play materials 
and structured the situation further by 
asking a series of specific questions. In 
some cases the dolls were identified and 
descriptions were given of their re- 
lationships to each other before the 
questions were asked, Levy (1937) em- 
ployed such a method to investigate 
sibling rivalry, The range of his sub- 
jects’ ages was 2 to 14 years. Play was 
limited to three dolls, which were 
identified as a mother, a baby, and a 
child the same sex as the subject. The 
examiner said that the mother had to 
feed the baby. Pointing to her chest, he 
noted that she had no breasts, and sug- 
gested that he and the child make some, 
The examiner shaped one our of clay, 
and the child shaped the other, After 
the breasts were attached to the mother 
doll, her arms were put around the baby 
doll. The examiner then said that the 
sister or brother had seen the new baby 
at the mother's breast for the first time 
and asked what the sibling did, Next, 
the child was instructed to translate his 
description into action with the dolls, 
From time to time, other questions were 
asked with a view to encouraging the 
expression of rivalry. The examiner 
might even quote the older child as 
saying, “That bad, bad baby at my 
mother’s breast!” 

Levy classified all the activities of 
each subject into five general cate- 
gories. One was the prevention of hos- 
tility to objects by such techniques 
as denial, withdrawal, distracting play, 
and leaving the room. A second covered 
the different forms of attack, and a 
third, the direction of hostility, included 
such objects as self, baby, mother, and 
breasts. A fourth category applied to 
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techniques of restitution, such as making 
the sibling “good” or putting back the 
breast, and a final category involved 
methods of self-defense, such as im- 
potence and rationalization, 

Directness of Aggression. Fite (1940) 
used a semistructured method of doll 
play to study directness of aggression in 
children’s relationships with peers and 
with adults, First the examiner asked 
whether she should make her doll hit 
the subject’s doll. Then she asked him 
what he thought about hitting and if 
it was all right to hit other children. 
These queries were followed up by 
requests to justify his positions and to 
explain their sources. Next, he had to 
say whether it was all right to hit if 
the other child attacked first. Finally 
he was requested to make his doll hit 
the examiner’s. 

Responses to Frustration. Lerner (in 
Murphy, 1956) first gave each child 
some attractive new toys, such as a 
Mickey Mouse, some trucks, and a 
toy telephone. Then he raised some 
questions. He might ask what he should 
do, or he might try to get the child to 
say which toys were his and which were 
the examiner’s. Once ownership was 
established, the investigator wanted to 
know whether his toys or the child’s 
were nicer. He might end by saying he 
was going to play with all the toys and 
that the child should watch. Such pro- 
cedures were repeated with different 
groups of toys. Then the child was 
given a chance to create a “selective 
play fantasy” involving himself and the 
examiner. Lerner inferred the child’s 
methods of overcoming frustration 
from reactions such as competition, in- 
trusion, desertion, and obligation. There 
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seemed to be three general sty! of 
reaction to frustration. One consisted of 
ego-extending, ego-intruding, and ego- 
shielding responses. Another c¢ nsisted 
of reciprocal actions, such as ego-fusing, 
ego-sharing, and ego-supporting 
third included ego-retreating, ego-re- 
coiling, ego-destroying, and other pas- 
sive actions. 


The 


HIGHLY STRUCTURED PLAY 
AND REAL SITUATIONS 


Frequency of an Act. Blum ind 
Miller (1952) provided a group of 
third-grade children with an apparently 
limitless amount of ice cream s ly 
after they had eaten an ample ! nch. 


The ice cream was packed in 1-ounce 
cups. Only one cup was available at a 
time, so that it was easy to record the 


quantity consumed by each child. Since 
the children had just eaten, the investiga- 
tors assumed that the consumption of 
ice cream indicated not hunger but 
strength of oral passivity. The amount 
consumed was significantly related to 
nonpurposive mouth movements and to 
some of the behavioral characteristics, 
which, according to psychoanalytic 
theory, are part of the constellation of 
the oral character. 

Inhibition About Engaging in a 
Legitimate Act. The destructiveness of 
preschool subjects was given free scope 
by Stone (in Murphy, 1956), who 
invited them to break some balloons. 
When a child refused, he was given a 
more pressing invitation. If he was 
still reluctant, the examiner asked if he 
should break them. With further re- 
fusal, the child was told that the bal- 
loons belonged to the examiner, that 
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there were many, and that it was all 
right to break them. If there was still a 
eluctance, the examiner broke 


show © 

one when the child was not looking. 
Then the child was asked how the bal- 
loon had broken, if he did it, if it was 
fun, if more should be broken, and 


so forth. Stone recorded the total se- 
quence of events and various indications 
of emotional tension, anxiety, and guilt. 


In a game of collisions Lerner (in 
Murphy, 1956) moved a toy train on 
a track toward an approaching train that 
a preschool child was pushing. The 
purpose of the exercise was to see how 
the subject would act when the trains 
met. In two other situations the ex- 


aminer’s doll stopped the child’s car 
and the child could not put his doll into 
a doll house because it belonged to the 
examiner, 

Style of Action. Bernstein (1955) re- 
corded amount of smearing during 
finger painting and play with cold 
cream. Smearing, which was taken as a 
sign of anality, was not significantly 
associated with a history of coercive 
toilet training. $ 

Choice Between Alternative Acts. In 
one form of projective test the child 
has to make a choice between alterna- 
tives. To diagnose type of orality, for 
example, Bernstein (1955) said his sub- 
jects could have either a lollipop 
or chocolate. A decision to suck the 
lollipop was classified as oral-passive; the 
cating of chocolate was classified as 
oral-sadistic. 

Choice Plus Style of Action. At each 
of ten positions on a floor Ackerman 
(1937) placed a toy, parts of which 
were assembled, and a disassembled 
replica. When a child came to a posi- 
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tion, he had to decide whether he would 
put a toy together or pull one apart. 
Some of the coding categories were 
constructive and destructive impulses, 
ineffectiveness because of extreme 
orderliness, and constructive play with 
destructive motives. The investigator 
provided criteria for scoring each item. 
Results were interpreted in such terms 
as patterns of aggressive expression, 
conscience, level of aspiration, and 
dominant emotion. On comparing de- 
gree of constructiveness of maladjusted 
and well-adjusted children, Ackerman 
obtained significant differences for sub- 
jects between 3 and 6 years but for no 
other groups up to 16 years of age. 


BEHAVIOR IN PLAY AS AN 
INDICATOR OF CHANGE 


Deprivation and Regression. All the 
experiments to be described in this 
section have similar designs. First, the 
investigators conducted a play session. 
Then they created a frustrating situa- 
tion. Finally, there was a second play 
session. To assay the effects of frustra- 
tion on motivation or affect, the in- 
vestigators assessed the changes from 
the first to the second sessions of play. 
In their research on the regression of 
preschool children, Barker, Dembo, and 
Lewin (1941) predicted a decrease in 
the constructiveness of play following 
frustration. Constructiveness was rated 
on a seven-point scale. To frustrate 
the children, the investigators exposed 
a number of highly attractive, inacces- 
sible toys behind a wire-net partition. 
Events in each play session were re- 
corded by an experimenter and by an 
observer behind a one-way screen. 
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During the period of frustration the 
subjects significantly increased attempts 
to penetrate the barrier, to mix play 
with other diversions, and to leave the 
experimental situation, There was a 
significant decrease in the types of play 
to which the subjects gave their com- 
plete attention. 
j Failure and Aggression. Yarrow 
</ (1948) tested the effect of different 
kinds of failure on the aggressiveness 
that preschool children expressed in 
play. The subjects participated in two 
thirty-minute sessions of doll play. Be- 
tween the sessions, members of the 
control group spent fifteen minutes in 
solitary play with tinker toys and with 
a simple jigsaw puzzle. The productions 
were praised liberally. Members of a 
failure group were given tinker toys 
and requested to duplicate the com- 
plex model of a windmill; toward the 
middle of the session, the examiner dis- 
paraged the children’s efforts. Members 
of a satiation group were asked to work 
on a large peg board. They continued 
until either they refused despite the 
experimenter’s insistence, or thirty 
minutes were up. Play before and after 
the frustrating experiences was rated for 
intensity of aggression. 
Training and Aggression after Frus- 
~ tration. În an ingenious experiment 
Davitz (1952) demonstrated that train- 
ing can affect the reactions to frustra- 
tion of 7-to-9-year-old children. One 
group of subjects was given practice 
with aggressive games; another was 
helped to perform constructive, cooper- 
ative tasks. A typical aggressive game 
was Break the Ball. Each subject was 
provided with a pingpong ball, which 
was placed on the floor and could not 
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be touched by hand. The object of the 
game was to break somebody else s ball 
while protecting one’s own. In a second 


game each subject competed physically 
with the others to cover a very small 
spot on a mat with his body. In a third 
game each subject had to rip a picce of 
cloth from another subject’s arm and 
retain the cloth on his own arm. 


The second group of children 
worked cooperatively on the drawing 
of murals and the solution of jigsaw 
puzzles. During these activities a!) ag- 
gressive behavior was discouraged ind 
constructiveness was praised. To frus- 


ilm 
each 


trate both groups, Davitz stoppe: 
at a climactic point and deprive: 
subject of some candy that he had 
previously received. Play before and 
after frustration was then rated for 
aggressiveness and constructive! 
The subjects trained in aggre: 
more aggressive after frustration than 
those trained to be constructive. The 
constructively trained subjects were 
more constructive after frustration 
than those trained to be aggressive. 


ness. 


vere 


Congruence Between Projective 
and Social Behavior 


Projective tests were originally de- 
vised for purposes of clinical predic- 
tion. From the test protocol, the psy- 
chologist obtained information that he 
used as a basis for estimating the 
patient’s cooperation, his probable de- 
fenses, his ego strength, and other 
structural or behavioral characteristics 
of relevance to prognosis. The re- 
searcher, too, employs projective tests 
as predictive instruments. On learning 
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that a child has expressed much aggres- 
sion in play with dolls, an investiga- 
tor often infers a disposition, which he 
anticipares will also be manifested in 
verbal or physical attacks on adults 
or peers, To be more accurate, the typi- 
cal investigator used to do so more often 
than he does now. The assumed rela- 
tions berween projective protocols and 
actual behavior were based on two 
premises. It was assumed that the 
stronger the need, the more frequent 
or the more emphatic would be its ex- 
pression on the test. It was also assumed 
that the higher the score on the test, 
the more strongly or frequently the 
need would be manifested in actual 
social behavior. Research on projec- 
tive tests has revealed that these 


premises are based on too simple a 
picture of the connections between test 
performance and behavior. 
Contradictory Evidence. Under some 
conditions, the frequencies and in- 
tensities of test responses are valid 
prognostic indices of actual behavior. 
Walker (1951) found that ratings of 
hostility in Rorschach protocols were 
significantly related to ratings of actual 
hostility. Most of the evidence, how- 
ever, indicates the lack of a simple as- 
sociation between responses to tests 
and social behavior. In the study by 
Sanford et al. (1943) the needs ex- 
pressed in various projective tests were 
not related to behavior in any Con- 
sistent manner, Symonds (1949) dis- 
covered an inverse relation between the 
frequency of a need in adolescents’ 
stories and its overt manifestation. In 
his opinion, even a strong need that re- 
ceives sufficient expression in behavior 
does not have to be expressed in fantasy. 
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Only suppressed needs require such 
internal outlets. 

In Korner’s research (1949), the hos- 
tility that preschool children expressed 
in play was consistent with the results 
of a story-completion test. But the 
relations of play and stories to the sub- 
jects’ overt manifestations of frustration 
and to parents’ ratings to such behavior 
were quite low. Jensen (1957) ad- 
ministered the TAT to three groups of 
boys between 15 and 17 years. Accord- 
ing to their teachers, one group was 
aggressive and was in trouble in school, 
a second group was aggressive but their 
behavior was not antisocial, and a third 
group was passive. Neither a direct 
nor an inverse relation existed between 
aggression on the TAT and the expres- 
sion of the need in school. 


THE MEANING AND CODING 
OF A NEED 


The results of Davids, Henry, 
McArthur, and McNamara (1955) in- 
dicated that clarification of theory and 
greater refinement in the coding of 
projective tests might contribute to an 
increase in predictive efficiency. Ag- 
gression on a projective test is often 
tabulated without regard to the direc- 
tion with which it is expressed. The 
investigators found that amount of ag- 
gression in self-ratings was related to 
direction of aggression against others 
on the TAT but not to the total 


quantity. 
STIMULATION AND CHANGES 
IN TEST SCORES 


Some of the investigations revealed a 
direct correspondence between the 
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stimulation of a need and its expression 
on a projective test. When deprived 
of food, subjects increased their ref- 
erences to food in the TAT (Sanford, 
1937). When criticized continually by 
the examiner under conditions that per- 
mitted no outlets other than story writ- 
ing, subjects increased their references 
to aggressive words in protocols of the 
TAT (Bellak, 1944). When given in- 
structions calculated to create a relaxed 
orientation, boys got significantly lower 
scores on the achievement motive than 
they did when they were challenged 
to do well and had performed a com- 
petitive task before taking the projec- 
tive test (Veroff, Wilcox, and Atkin- 
son, 1953). 

Conditions under Which Direct 
Manifestations Are Avoided. Not al- 
ways has arousal been followed by an 
increase in the direct manifestation of a 
need or affect. It was necessary for 
Clark (1952) to measure strong sexual 
needs not by their direct manifestations 
in TAT stories but by their symbolic 
forms of expression. In coding strong 
feelings of guilt, Allinsmith (in Miller 
and Swanson, 1960) had to take into 
account both their direct and their in- 
direct manifestations. Feasibility of ex- 
pression as well as avoidance may affect 
the reaction to arousal. Evidence pro- 
vided by Brozek, Guetzkow, and 
Baldwin (1951) suggests that arousal 
may not be reflected on a projective test 
if there is no possibility of gratifying 
the need. While on a semistarvation diet 
for 24 weeks, their subjects, unlike 
those of Sanford (1937), did not in- 
crease their perception of food in 
Rorschach cards, 
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DISPOSITION VERSUS BEHAVIOR 


Impulse and Expression. One factor 
underlying the occasional disparity be- 
tween projective tests and behavior i 
suggested by the findings of Lindzey 
and Tejessy (1956). Of ten signs of 
aggression on the TAT, few were sig- 
nificantly related to extrapunitiveness 
or intropunitiveness in behavior but 


most were significantly related to self- 
ratings of aggression. In other words, 
the subjects knew what they wanted to 
do but did not necessarily follow the 

impulses. Tests results thus signify dis- 
positions but not necessarily their forms 


J ee 


of overt expression. In fact, as 
mentioned, the expression may D 
ceedingly indirect if it is not feasil 
not compatible with the moral stand 
of the child or his parents and the moti- 
vational state is strong. 

Age, Disposition, and Control. The 
curvilinear relation sometimes ob- 
tained between aggression in fantasy 
and in behavior provides one kind of 
evidence that strong, forbidden re- 
sponses are often inhibited by subjects 
taking projective tests. Preschool chil- 
dren whose behavior was rated by Bach 
(1945) at either extreme of aggression 
expressed more of this motive in doll 
play than the children who were rated 
as moderate. On giving the Rorschach 
Test to subjects between g and 15 years 
of age, Smith and Coleman (1956) dis- 
covered that very aggressive behavior 
was associated with medium hostility; 
nonaggressive behavior went with high 
or low ratings. Although the trends in 
both studies are curvilinear, the results 
are not congruent, a fact that may be 
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attributed to age differences in the two 
samples. In the preschool group high 


scores in aggression during doll play 
sociated with either high or 
gs of behavior. In the pre- 
adolescent group high hostility scores 
on the Rorschach Test were associated 
with low ratings of overt aggression. 
Older subjects had had more practice 
than the younger subjects in controlling 
their «aggression and were probably 
more adept at curbing their immediate 
impul particularly the stronger ones, 
by deliberate inhibition or defensive 
distortion. 


AVOIDANCE AND DIRECTNESS 
\OF EXPRESSION 


Disposition, Fear, and Behavior. 
Direct evidence of the effect of avoid- 
ance on direct expression has been pro- 
vided by two studies. Mussen and 
Naylor (1954) investigated the aggres- 
sive fantasy of boys in the working 
class who were between ọ and 15 years 
and in difficulty because of antisocial 
behavior. When scores on the TAT 
were high, rated behavior varied with 
the fear of punishment expressed in 
the stories. If the fear was strong, the 
subjects were not rated as being very 
aggressive. Only if the fear of punish- 
ment was weak were high scores on the 
test congruent with rated behavior. 

Similar conclusions can be derived 
from Lesser’s research (1957). For 
those of his 10-to-13-year-old subjects 
whose mothers had encouraged aggres- 
sion, he obtained a correlation of .43 
between the expression of the need in 
TAT stories and in behavior. For boys 
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with discouraging mothers, the cor- 
relation was minus .41. If aggression 
had been encouraged, overt aggression 
was significantly correlated with ex- 
trapunitiveness and intropunitiveness 
on the Picture Frustration Test, If 
aggression had been discouraged, the 
correlations were not significant. Ap- 
parently children who have become 
anxious about their aggression avoid its 
direct forms in overt behavior and 
sometimes when taking tests. It seems 
necessary for an investigator to obtain 
information about the subjects’ previous 
training or their internalized standards 
if he hopes to avoid misinterpreting 
some of the low projective scores for 
forbidden needs. Such scores have at 
least two meanings. The subjects’ needs 
may actually be weak, or they may be 
strong but not expressed directly be- 
cause of the deterring effects of stronger 
avoidance responses. 

Other Interacting Variables. Fear of 
punishment is one of a number of 
variables that affect the association be- 
tween the expression of aggression in 
fantasy and overt behavior. Otis and 
McCandless (1955) studied two others. 
The investigators frustrated preschool 
children with two of Lerner’s blocking 
methods (in Murphy, 1956). Despite 
repeated frustrations, the subjects varied 
markedly in their expressions of 
dominant-aggressive and compliant- 
submissive responses. The variations 
were significantly related to two needs, 
ower-dominance and love-affection, 
on which the subjects were rated by 
teachers. Children with high need for 
power were high on aggression during 
frustration and increased their scores 
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during the second half of the series. 
The submission scores of these subjects 
were low during frustration and 
dropped rapidly after that. Children 
with low need for power were low on 
aggression. Strong need for love-affec- 
tion was associated with frequent sub- 
mission and infrequent aggression in 
play. 

The current evidence thus seems to 
indicate relatively complex relations 
between the expression of needs in 
projective tests and in behavior. Overt 
expression may be effected by fear, 
moral proscriptions, the feasibility of 
certain actions, and competing needs. 
An investigator would do well, there- 
fore, to think about the third or fourth 
variables that may affect the way in 
which the need is expressed. If he does 
not take the interacting variables into 
account, he is decreasing his chances of 
obtaining significant results. Of course, 
his results may still be significant if he 
is lucky enough to have stumbled on 
one of the simpler associations between 
projective expression and social be- 
havior or if he has unwittingly chosen 
conditions under which the interacting 
variables were held constant. If sig- 
nificant results are not to be fortuitous, 
the investigator must develop acumen 
in selecting variables that affect the 
association between fantasy and overt 
expression, 


Structure of Projective Tests 


Except for the TAT, few of the 
standard projective tests popular with 
clinicians have been employed very 
much in research on motives and affects. 
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Initially, investigators tried the standard 
tests, since they provided many meas- 
ures of different needs, objects, aftects, 
defenses, and perceptual styles. As time 


went on, the clinical instruments gave 


way to others designed for particular 
topics and particular populations. Re- 
searchers have shown most partiality to 
techniques such as story completion : nd 
play, the forms of which cai be 
tailored to the particular hypotheses. 
The investigator who analyzes behavior 
in play can choose the number and type 


of dolls and can define the activity to 
restrict the subjects’ range of re if nse, 
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R 
Re. 


Ambiguity and the Goals of Re- 
search. It is the vagueness and the many 
details in projective pictures and ink- 
blots that are of most value to the 
clinician. When given a picture of 
two adults, both of whom are doing 
different things with a number of in- 
struments, a subject can respond to 
many different details (Miller and 
Swanson, 1960). He can concentrate on 
one or both people, a utensil, a clock in 
the background, or even a shaft of 
light. Psychologists assume that forces 
within the subject prompt him to select 
particular stimuli and some of their 
attributes from the total picture and to 
organize them in a manner that reflects 
the nature of those forces. 

All the content in a subject’s stories 
is grist for the clinical mill. All the in- 
formation about different needs, de- 
fenses, and affective states has a bear- 
ing on the structure of the subject’s 
personality, his dynamics, and his 
prognosis. But much of the story con- 
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tent is of no use to the researcher, who 
is interested not in everything he can 
find out about one person but in a few 
variables, possibly even one, From all 
the projective material he chooses the 
references that help him to rank the 
subject on che experimental variables, 
Choice of Stimuli and Validity. The 
traditional projective test is most fallible 
when it is used to rank all subjects on a 


variable. If a subject has selected his 


own stimuli and given them his own 
meanings, there is no guarantee that all 
or any of his associations will be 
pertinent to the problem of the study. 
In fact, chere is a number of reasons 
why a child’s fantasy or behavior may 


not be pertinent to the investigator's 
interest. The child may be high on a 
need but get an invalidly low score 
because he is consciously or uncon- 
sciously suppressing references to it or 
distorting it. He may get an invalidly 
low score because the test contains few 
stimuli relevant to the topic of the re- 
search, 

Interpretation of Protocols and Am- 
biguity. When he interprets the sub- 
jects’ responses to ambiguous stimuli, 
the investigator must sometimes assume 
they have a “common meaning.” Card 
IV of the Rorschach Test, for example, 
is supposed to signify the father to most 
subjects (Levy, 1958), and a number 
of latent meanings has been proposed 
for cards of the TAT (Henry, 1956). 
Yet the more ambiguous the stimulus, 
the smaller the probability of a common 
meaning for all subjects (Heath, 1958). 

Structured Projective Instruments. 
To benefit from the primary asset of 
Projective tests, one must use stimuli 
with some ambiguity so that the sub- 
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ject can make a choice that reflects the 
nature of his motive. To rule out 
responses that are irrelevant to the 
study, one must use items with par- 
ticular kinds of structure. Some experi- 
menters have tried to achieve these 
goals by working with the more struc- 
tured items of traditional tests. From 
the TAT, Mussen and Naylor (1954) 
selected the cards that seemed most 
likely to elicit particular needs and 
affects. Other experimenters have de- 
vised their own tests. To elicit feelings 
about oral sadism, Blum (1949) used a 
picture of an angry dog biting a collar. 
To obtain expressions of sibling rivalry, 
Levy (1937) showed each child a 
mother doll feeding a baby and then 
asked the subject what he would do. 
Such stimuli are far less ambiguous than 
an inkblot or a picture of a vague figure 
in the darkness, So clear are the mean- 
ings of themes that even a lack of re- 
sponse can be interpreted with some 
confidence. Despite their structures, 
such tests retain one advantage of a 
projective test: they still permit the 
subject a range of reactions. 
Constructing an Unambiguous Test. 
Allinsmith (in Miller and Swanson, 
1960) has discussed the advantages of 
the structured story-completion test 
and some of the difficulties of de- 
signing one. He was concerned with the 
strength of guilt feelings expressed in 
response to stories about immoral be- 
havior. To make it clear that the hero 
was motivated by guilt rather than fear, 
the investigator defined the specific 
violation of a moral standard in each 
story beginning. The violations were of 
a kind that could not be detected. Each 
theme specified the act, the extent of 


738 


the violation, the motive behind it, and 
the relationship between hero and 
victim. These facts were included in 
order to insure a comparable inter- 
pretation of the story by all subjects. 
Among the other variables listed by 
‘Allinsmith as requiring control in pro- 
jective tests of guilt feelings were the 
hero’s initial wish and his intention 
when he committed the act, the sex 
of the victim, symbolic connotations 
of the violation, and the order of stories. 
The satisfaction of so many criteria 
requires considerable work in the writ- 
ing of stories and in conducting pilot 
studies, But if the investigator has even 
moderate success, he can collect final 
data with some confidence that he has 
eliminated one possible source of error. 
He will have specified the facts, both 
implicit and explicit, so that they will 
be hard to misinterpret. All subjects, 
then, will write endings with reference 
to approximately the same event. 
Multiple Choice Endings. It is some- 
times difficult to choose between a 
partially structured but open-ended test 
and one that is completely objective. 
Advantages of objective tests, like high 
reliability, ease of administration, and 
automatic scoring, are too well known 
to require discussion here, The dis- 
advantages of objective tests require 
more attention, The most obvious one 
is the sacrifice of the most commonly 
mentioned asset of projective tests: the 
subject's freedom to select the stimuli 
to which he will respond. The more 
structured the test, the more limited is 
the subject in his attempts to organize 
his percepts. If the test is completely ob- 
jective, he cannot even give responses 
that the examiner did not anticipate, 
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It is sometimes possible that a list of 
alternative answers provided by the ex- 
aminer will change the subject's frame 
of reference. He may not have thought 


of some of the answers in the test, or 
he may have thought of similar ones 
with slightly different meanings. If his 
own answers are not on the list, he may 
have no choice but to the 
closest alternatives, even if they mis- 
represent his reactions. If his reactions 
seem pathological in comparison with 
some of those furnished in the test, he 
may become anxious and, for conscious 
or unconscious reasons, choose the more 
socially acceptable answers. 

Although the potential problems that 
have been listed require the investiga 
tor’s careful consideration, they do not 
preclude the use of objective projective 
tests. Unlike the clinician, the researcher 
is not necessarily interested in many 
free associations or in the dynamic 
organization of a mass of material. If 
he is sufficiently ingenious and he con- 
ducts a careful pilot study, he can 
construct endings that are relevant to 
the topic under investigation, are 
phrased in terms that are familiar to 
most subjects, and include most of the 
reasonable alternatives. If his subjects 
have not reached adolescence, he can 
often count on their being relatively 
naive about the psychological sig- 
nificance of the endings. Younger sub- 
jects are usually less guarded than older 
ones. In sum, objective tests have their 
assets and their liabilities. An investiga- 
tor can make an intelligent choice of a 
test only after careful consideration of 
such issues as his hypotheses and his 
population, 


chec 


MOTIVATION AND AFFECT 
CONTENTS OF TESTS 


Animal and Human Characters. In 
addition to the structures of empirical 
tests, their contents have provided in- 
vestigators with a number of challeng- 
ing empirical problems. It is often as- 
sumed that animal characters elicit more 
self-revealing information than human 
characters. A test with pictures of 
rabbits clicited longer stories from chil- 
dren between 5 and 10 years of age 
than TAT cards (Bills, 1950), but the 
animal stories did not yield the same 
trends as the stories obtained from the 
TAT. Nor did the animal stories of a 
group of third graders yield the same 
trends as their behavior in play therapy 
(Bills, Leiman, and Thomas, 1950). 

Of course, the playroom and the 
TAT both differ in stimulus value from 
pictures of animals. In comparing stories 
about animals and people, Biersdorf and 
Marcuse (1953) tried to control dif- 
ferences in the stimulus values of the 
pictures. These investigators con- 
structed a special set of pictures, which, 
with one exception, were identical with 
the CAT cards, even in the emotional 
expressions of the characters, The ex- 
ception: the special pictures portrayed 
people instead of animals. There were 
no significant differences between the 
two tests on such quantitative measures 
as response time and number of ideas. 
Mainord and Marcuse (1954) admin- 
istered the CAT and the special cards 
to a group of disturbed boys and girls 
whose average age was seven years, 
Again there were no significant dif- 
ferences in the quantitative measures de- 
rived from the two tests, but ratings of 
the clinical usefulness of the results 
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favored the cards with human figures. 
Light (1954) and Furuya (1957) found 
that stories based on pictures of human 
beings exceeded CAT stories in the 
expression of feelings, number of con- 
flicts, and amount of dynamically in- 
teresting content. 

In most studies, then, pictures of 
people have been equal or superior to 
the animal pictures in eliciting material 
of interest to the psychologist. Murstein 
(1959) summarized much of the litera- 
ture on the structure and content of 
projective tests. His review led him to 
conclude that the most productive pic- 
tures are only moderately ambiguous 
and portray culturally acceptable hu- 
man beings who are not so similar to 
the subjects as to create self-conscious- 
ness, 

Type, Quantity, and Presentation of 
Toys. The content of the play situation 
has also been the subject of empirical 
work, Both the preference for toys 
(McDowell, 1937) and attention span 
for them (Moyer and Gilmer, 1955) 
varied with age. The choice of toys 
and their use in play also varied with 
sex differences (McDowell, 1937) and 
with the amount of available equipment 
(Johnson, 1935). When many materials 
were provided in a playground, the chil- 
dren devoted themselves primarily to 
these objects, With less equipment, the 
children were not so inclined to play 
with toys and were more likely to 
initiate social contacts, play games, and 
display aggressive behavior. 

The kinds of toys and the ways in 
which they are presented may affect 
style of play. When play materials were 
organized as part of a conventional 
house, children expressed more aggres- 
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sion than they did when the materials 
were just spread on the floor (Pintler, 
1945). Dolls like the subjects’ own 
family constellations elicited more 
direct aggression than standard figures 
(Robinson, 1946). Realistically clothed 
dolls and recognizable furniture stimu- 
lated more exploratory behavior but less 
systematic arranging by subjects than 
undressed stuffed dolls and blocklike, 
ambiguous furniture (Phillips, 1945). 


SOURCES OF ERROR IN THE 
TESTING SITUATION 


In discussing the results of different 
projective techniques, Lerner (in 
Murphy, 1956) noted that children 
often express a need on one test but 
not on another. He also observed that 
reactions to tests differ in accordance 
with the sex of the examiner, the child’s 
previous experience with the test, and 
even the day or the particular time of 
day. When such sources of variance are 
large, the results may reflect the at- 
tributes of the situation more than the 
enduring traits of the subjects. The 
error variance can often be diminished 
by the addition of certain controls to 
the design. Some of the needed controls 
have already been mentioned. Others 
suggested by studies of the potential 
sources of error in the testing situation 
are now considered. 

The Examiner’s Influence on Play. 
When the examiner was an interested 
and approving onlooker, subjects ex- 
pressed more thematic aggression, more 
nonstereotyped play, and more changes 
in themes than they did when they had 
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minimal interaction with the e 
(Pintler, 1945). Amount of contact 
with an examiner also affected behavior 
during play (Phillips, 1945). From t 
first to the third sessions there was 
significant increase in aggressior 
decreases in exploratory behavior 
stereotyped thematic play. But, when 
Siegel (1957) eliminated the examiner, 
the aggression decreased in the second 
session. In Siegel’s opinion, the presence 
of an adult motivates the child to ab- 
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dicate control. If the child is alone, 
his guilt becomes aroused when he is 
tempted to engage in disapproved «cts. 
Since the amount of temptation in- 


creases with time, he becomes less ag- 
gressive from one session to another 

The Examiners Influence on 
Rorschach Test. It is commonly 
ognized that scores on an intelligence 
test vary with differences in the ex- 
aminer’s behavior (Sacks, 1952). The 
variation is probably greater for projec- 
tive tests. The potential effect on 
projective protocols of the examiner's 
personality is illustrated by the study of 
Sanders and Cleveland (1953). 
Rorschach Tests were administered 
with a standardized inquiry by nine 
examiners to randomly assigned, normal 
subjects. Of 38 scoring categories, 
there were 20 on which the subjects 
varied significantly, depending on the 
examiners who had done the testing. 
The protocols also varied with the 
subjects’ impressions of the examiners’ 
hostility and anxiety and with scores on 
these variables computed from Ror- 
schach Tests that the examiners had 
taken before they received clinical 
training. 


the 


rec- 


MOTIVATION AND AFFECT 


Conditions of Administration. Other 
studies suggest the existence of addi- 
tional sources of error. Rorschach 
scores varied markedly, depending on 
whether or not the subjects were very 
ego-involved in the test and whether 
it was defined as a measure of 
intelligence, nervousness, or imagina- 


tion (Calden and Cohen, 1953). Scores 
on the TAT differed for group and 
individual methods of administration 
(Sarason and Sarason, 1959). 


Socioeconomic Backgrounds. Sub- 
jects’ protocols can vary with personal 
attributes other than aspects of per- 


sonality. In verbal projective tests the 
performances are often related to the 
subjects’ socioeconomic backgrounds. 


Reissman and Miller (1958) postulated 
that the contrasting stories often told by 
subjects from blue-collar and white- 
collar backgrounds may result from 
differences in willingness to participate, 
capacity to establish rapport with the 
examiner, verbal skill, and previous ex- 
perience in making up stories about pic- 
tures. The fact that people in the pic- 
tures appear to come from the middle 
class may also help subjects from that 
socioeconomic level to be productive. 

Social backgrounds are also mirrored 
by responses to the Rorschach Test. 
Compared to the white-collar workers, 
manual laborers gave more signs of mal- 
adjustment, produced fewer responses, 
and made fewer references to color or 
movement. Reissman and Miller were 
inclined to attribute the frequent signs 
of stress in the Rorschach protocols of 
manual laborers to their lack of fa- 
miliarity with tests in general. It will 
apparently be necessary to develop 
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special norms for subjects from the 
working class. 


OBSERVATIONS AND REPORTS 
ABOUT CHILDREN 


Most of the methods mentioned thus 
far are feasible when subjects are avail- 
able, able, and willing to produce some 
special form of behavior: to discuss 
their fears, to finish a story about a 
curious boy and some dangerous 
machinery, to break balloons, to play 
with toys. 

In some types of investigation it is 
not feasible to manipulate subjects 
physically or psychologically. They 
cannot be put into special situations or 
even asked any questions. There are 
many possible reasons for such prohibi- 
tions. It may not be advisable to disturb 
children who have just had an opera- 
tion; a clinician may fear that testing 
will interfere with psychotherapy; the 
children may have so many assignments 
at school that they cannot spare the 
time needed for participation in the 
study. 

Even when the subjects are not avail- 
able for manipulation and testing, it 
may still be feasible for the investigator 
to observe them or to get impressions of 
them from people who know them well. 
This concluding section is devoted to 
the different sources of information 
about children. The sources to be dis- 
cussed are the investigator, teachers, 
parents, and peers. Included in the dis- 
cussion of each source are suggested 
techniques for obtaining the informa- 
tion and representative topics that have 
been investigated by such techniques. 
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Experimenter’s Observations 


SUBJECTIVE APPROACHES 


Anthropological Observation. Eth- 
nological accounts of child development 
sometimes provide valuable links be- 
tween social structure and motivation. 
Such a linkage is illustrated by Henry’s 
description (1944) of the sexualized 
aggression of Pilaga children, Bateson 
and Mead’s portraits (1942) of parental 
teasing and emotional development in 
Bali, and Asch’s evidence (Murphy, 
Murphy, and Newcomb, 1937) of non- 
competitiveness in the conversations of 
Hopi children, 

Clinical Cases. So large is the current 
clinical literature on children’s prob- 
lems that it is not possible to do more 
in the available space than suggest some 
of the different topics. Some typical 
clinical reports are those of Levy (1937) 
on affect hunger, Redl and Wineman 
(1951) on the control and defenses of 
children who have suffered years of 
abuse and neglect, and Smalley (1930) 
on variations in sibling rivalry with sex, 
age, and intelligence. 

Clinical Observations of Specific 
Events. Subjective observations of 
children can be quite productive when 
applied to circumscribed and clearly 
defined topics. An interest in the con- 
genital aspects of sucking, for example, 
led Ribble (in Hunt, 1944) to observe 
600 neonates for signs of exaggerated 
sucking activity. Jessner, Blom, and 
Waldfogel (1952) compiled case studies 
of children before and after tonsil- 
lectomies and adenoidectomies. 

Clinical reports are usually couched 
in terms of particular theoretical posi- 
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tions and convey the impression that the 
writers may not have paid sufficient at- 
tention to phenomena that are not 
covered by the theory. The disinclina- 
tion of some ethnologists to specify 
their psychological assumptions creates 


problems of interpretation for the 
readers of field studies. Other common 
failings of both anthropological and 


clinical reports are a lack of precise 
definitions of psychological concepts, 
a paucity of cases, an insensitivity to 
the representativeness of the sample, 
claims of significance without support- 


of 


ing statistical evidence, and a la 
control groups. Despite such de 
cies, anthropological and clinical 

often contain fruitful hunches, some of 
which are later transformed into test- 
able hypotheses by resourceful experi- 
menters. 


studies 


COMBINATIONS OF SUBJECTIVE 
AND OBJECTIVE METHODS 


Continuous Parallel Records. The 
goal of objective analysis is coveted by 
virtually all investigators, but many dis- 
approve of the rating or tabulating of 
variables at the time of the original be- 
havior. The disapproval seems cogent 
when applied to studies of complex 
reactions or to studies in which there 
are many coding categories. In both 
kinds of research the judges may be- 
come very uncertain about their 
evaluations or even about the facets of 
the situation requiring most attention. 

To prevent such confusion, the in- 
vestigator can gather continuous steno- 
graphic records of events and then 
interpret the results at his leisure. In 
his investigation of autocratic and dem- 
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ocratic group atmospheres Lippitt 
(1939) compiled records of the con- 
versations and activities of groups of 
10-year-old boys. Among the data were 
a running account of social interactions, 
a minute-by-minute analysis of group 
structure, notes on changes in the 
group's atmosphere, and a continuous 
stenograplic record of all conversations. 


It was this kind of information that he 
analyzed when he tabulated the fre- 
quencies of such reactions as aggressive- 
Ness, competitiveness, persecution of 


scapegoats, and frustration because of 
lack of structure, 

Specimen Records. Barker and 
Wright's (1954) specimen records and 


the method of coding them represent 
the most sophisticated use to date of the 
continuous record as a means of 
analyzing motives and affects. Each 
specimen record referred to all the 
events observed in a defined situation 
during a period of thirty minutes or 
less. The observer dictated the results 
immediately after the period was over. 
A supervisor then raised questions about 
the points he did not understand, and 
the observer added information to the 
record in order to eliminate sources of 
confusion. 

In the analysis of results the record 
was divided into natural episodes, each 
with its settings, behavior, and objects. 
The analytic code contained a category 
of matrix factors, such as action se- 
quence, strength of motivation, cen- 
trality of motivation, and social field 
potency, Another category consisted 
of modes of action, such as dominance, 
aggression, resistance, submission, nur- 
turance, appeal, and avoidance. There 
Were also action attributes, such as 
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pressure, affection, mood, and evalua- 
tion. Finally, there was a category of 
aggression, including attacks, their end, 
spontaneity, and emotional concomi- 
tants. 

A Time Sample of Continuous Rec- 
ords. In her study of the behavioral 
problems of elementary-school children, 
Young-Masten (1938) employed a com- 
bination of time-sample and continuous 
record, Each child was the object of at 
least ten fifteen-minute observations. 
When at all possible the observations 
were spread over every activity of the 
school program. During each session 
an attempt was made to record all the 
overt behavior of the child. Unlike 
Barker and Wright, Young-Masten 
did not make inferences about underly- 
ing motives. Once the records were 
compiled, they provided the basis for 
listing each child’s unacceptable be- 
haviors, such as talking without permis- 
sion, standing up in one’s seat, and 
pushing. 


THE STRENGTHS AND WEAK- 
NESSES OF SUBJECTIVE 
METHODS 


It is common among toughminded 
social scientists to stress the weaknesses 
of subjective methods. There is no 
gainsaying the seriousness of some of 
the defects. The frequency with which 
case studies depend on unchecked re- 
call by biased informants, the lack of 
precautions concerning halo effects, 
the ease with which observations can 
be stretched and tailored to fit the 
procrustean bed of a preconceived 
theoretical system, the usual stress on 
qualitative judgments and relative lack 


744 


of concern with frequencies, the omis- 
sion of critical facts because the collec- 
tion of data was not planned with 
sufficient care—these and other damn- 
ing comments are often made about 
some kinds of subjective observation. 

A survey of the literature on chil- 
dren’s motives and affects conveys the 
impression that many investigators, 
particularly those with  positivistic 
orientations, are far more aware of the 
serious weaknesses than of the strengths 
of subjective methods. As a result, such 
methods are sometimes used without 
sufficient care or rejected when they 
could be very useful to the investigator. 
It is important, then, to list their assets 
as well as their liabilities. 

Often mentioned as an asset of sub- 
jective observation is the fact that 
it permits an analysis of events in 
natural units. Such units may be quite 
small or quite large and may vary with 
the situation. If molar units are nec- 
essary, events do not have to be dis- 
torted by arbitrary atomistic analysis. 

When an observer keeps records of 
total situations, he can later view each 
event in its context. The information 
about events in all their complexity 
also makes it possible to study their 
continuity. The completeness of the 
record facilitates analyses of genotypic 
forces. 

These, then, are some of the assets 
claimed for subjective methods. But 
do the assets compensate for the serious 
liabilities? The answer is positive for 
some topics. Few objective methods can 
do justice to the complexity and or- 
ganization of some complex phenomena, 
such as those typically encountered in 
a clinic. Subjective methods, with all 
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their defects, are particularly helpful 
during the initial exploration of » topic. 
Such methods may provide the original 
hunches that are later to be defined as 
testable hypotheses. 

Check Lists 

NATURAL SITUATIONS 
Emotional Adjustment to Nursery 
School. If the investigator is clear about 
his operations and they are not very 
deal 


complex, he can obtain a great 
of information in a relatively short time 
by means of a check list. Heathers 
(1954) used a check list to evaluate 
the initial adjustments of 2-year-olds to 
a nursery school. Among the behaviors 
that he looked for in each child were 
hiding to avoid the trip, resisting 
attempts to dress him, seeking reas- 
surance or comforting, crying, calling 
to mother, and clinging to mother. 

Aggression. Dawe (1934) analyzed 
the quarrels of preschool children dur- 
ing free play by means of a combined 
record blank and check list modeled 
after Goodenough’s (1931). The re- 
search was restricted to conflicts about 
possessions, physical violence, inter- 
ference with an activity, and difficulties 
in social adjustments among children. 
The data were coded in terms of such 
categories of people as precipitator and 
nonparticipator and such categories of 
behavior as aggressive, retaliative, pas- 
sive, and objecting. 


CONTRIVED SITUATIONS 


Ascendance. Jack (1934) successively 
paired each of a group of preschool 
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children with each of ten others and 
observed the play of each pair with 
three sets of toys in a sandbox for a 
period of five minutes. Eight categories 
of ascendant behavior were tabulated. 
They w verbal attempts to secure 
play materials, forceful attempts to se- 
cure play materials, success in securing 
play materials, defending or snatching 
back materials taken from one’s pos- 
session, bal attempts to direct be- 
havior of the companion, compliance 


by the companion with one’s direction, 


forbidding, criticising, or reproving the 
companion, and providing a pattern of 
behavior which the companion imitates. 

Fear. \\olmes (in Jersild and Holmes, 
1935) observed children in the age 
range of 1 to 8 years and filled out 
check lists to describe their behavior 


when they were placed in such threat- 
ening situations as being left alone for 
a few minutes, deprived of support 
on a collapsing board, and asked to 
retrieve a ball from a dark room. Each 
subject was classified as performing the 
act without hesitation, performing it 
but after some delay or after an in- 
tervening act, performing it with the 
help or company of the experimenter, 
or completely refusing to try. 


Investigator’s Ratings 


STANDARD PERSONALITY 
SCALES 


The California Inventory. Rating 
scales are among the most popular 
techniques for collecting data about 
Motivation. Of the personality scales 
that have been carefully standardized, 
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the best known are the California Be- 
havior Inventory (Conrad, 1932; Read, 
1940) and the Fels Child Behavior 
Scales (Richards and Simons, 1941). 
From Conrad’s original instrument, 
Read collected the items judged by a 
group of experts as being most im- 
portant for the study of the general 
adjustment and the personality makeup 
of children. For each trait in the in- 
ventory, the rater is presented with the 
definitions of two ends and the middle 
of a seven-point scale. Items are phrased 
in terms of situations commonly found 
in nursery school. Some of the items are 
persistence in the face of difficulty, 
happiness, desire for affection, emo- 
tional inhibition, suspiciousness of 
people, quarrelsomeness, and nervous 
habits. 

The Fels Scales. There are 30 Fels 
Scales. According to the authors, the 
scales are suitable for children in pre- 
school or in the first four grades. In 
deciding on the rating of a trait, the 
judge makes a choice from four or 
five cues that are placed at appropriate 
points on a line. Each cue is indicative 
of a degree of the trait. The position 
of the cue on the line was determined 
by judges’ ratings in accordance with 
the procedure developed by Champney 
(1941). The scales cover such traits as 
affectionateness, aggressiveness, com- 
petitiveness, cruelty, emotional control, 
friendliness, and vigor of activity. Re- 
liability coefficients for individual items 
are in the upper .7o’s or above; the 
mean is .89. A multiple factor analysis 
of the scales revealed three factors. One 
is desirable, mature behavior, such as 
competitiveness, friendliness, and lead- 
ership. A second is defined in such 
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terms as independence, nonconformity, 
and antagonism and is illustrated by 
such traits as aggressiveness, cruelty, 
lack of emotional control, and jealousy: 
The third factor, lack of social appre- 
hensiveness, is exemplified by cheerful- 
ness, curiosity, and a sense of humor. 


SCALES DEVELOPED WITHIN 
RESEARCH PROJECTS 


Strengths of Motives. Frenkel-Bruns- 
wik (1942) asked three judges to rate 
the motives of a group of preadoles- 
cents and adolescents on five-point 
scales, The nine motives, which were 
derived from Murray’s list, were au- 
tonomy, social ties, achievement, rec- 
ognition, abasement, aggression, suc- 
corance, dominance, and escape. The 
judges, who had known the subjects 
for a number of years, were asked to 
rate underlying motives and also vari- 
ables of social behavior. The combined 
ratings for the three judges had reli- 
abilities of the order of .70 for all cate- 
gories but autonomy and succorance, 
The analysis of the data revealed that 
the same motivational state could be 
expressed in a number of forms. Girls 
high on abasement, for example, were 
high on manifestations of altruism or 
insecurity or both. 

Affective Reactions. Arsenian (1943) 
used a ten-point scale to rate the se- 
curity of 1-to-3-year-old children in a 
strange playroom. The scale varied 
from crying at one extreme to a free 
approach at another. Children were in 
the room for intervals of five minutes, 
during which time they were observed 
through a one-way screen, 

A lack of security might also be at- 
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tributed to the elementary-school chil- 
dren who got high scores on Young- 
Masten’s rating sheet for symptoms. 
Four-point scales were used to rate 
children on traits such iness, 
nervousness, stubbornness, and giggling. 
In studies of preschool children Mc- 
Hugh (1943) rated their degree of shy- 


as sul 


ness and negativism and Duffy (1930) 
rated their excitability. 

To develop an index of regression, 
Barker, Dembo, and Lewin (1941) 
rated the constructiveness of children’s 
play on a seven-point scale. Schiff, 


ini- 


Dougan, and Welch (1949) usec 
cal rating of anxiety as a critericn for a 
study of the psychogalvanic response 
and the electroencephalogram. The pa- 
tients and normal subjects in this study 
were between the ages of 7 and 16 
years. 


Time Samples 


SUBJECTIVE OBSERVATION 


A time sample provides an objective 
basis for the ranking of subjects on a 
dimension. In devising a method, the 
investigator has to define his terms and 
to decide on the minimal number of 
observations and their duration. Both 
differ greatly in various studies, usu- 
ally as a function of the problems be- 
ing investigated. 

Short Periods. Green (1933) made 
very short observations of the quarrels 
of preschool children. Such events are 
very tempestuous. They flare up sud- 
denly and often run their full course 
within a matter of seconds. It was not 
necessary, therefore, for the investiga- 
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tor to observe a child for more than 
thirty seconds at a time. The sample 
of behavior was always taken in a free- 
play period when there was maximal 
opportunity for open aggression. The 


data consisted of tabulations and of 
subjective observations. Information 
was gathered about the nature of the 
activity, the subject's companions, the 
name of the aggressor, the manner in 


which antagonism was expressed, and 
the subject’s reactions. From the dif- 
ferent observations, Green developed 
indices friendly and quarrelsome 
contacts and a ratio of quarrelsome con- 
tacts to operative play, The reliabili- 
ties of these indices varied between .77 
and .go, 


Longer Periods and Specimen Rec- 
1 contrast to Green, who was 
interested in a more circumscribed 
topic, Young-Masten (1938) studied 
behavior problems in elementary-school 
children. Such problems can be mani- 
fested in different ways and can vary 
markedly in duration, Since she hoped 
to obtain data about many kinds of mis- 
behavior, she observed each child for 
fifteen minutes at a time and attempted 
to spread her ten observations among 
all the activities of the school program. 
She did not classify the difficulties be- 
forehand and make objective observa- 
tions, Instead, she recorded all the overt 
behavior of a child during the period 
of observation. Later she identified dif- 
ferent behavior problems from a peru- 
sal of the records, 


OBJECTIVE RECORDS 


Social Contacts. ` Mengert (1931) 
studied the overt friendly behavior of 
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paired 2-year-old children by means of 
twenty-minute time samples. Beaver 
(1932) tabulated the initiation of social 
contacts by 2-to-4-year-old children by 
means of five-minute time samples dur- 
ing free play. 

Aggression. In a study of the aggres- 
sion of children between the ages of 2 
and 4 Body (1955) obtained daily sam- 
ples of seven and one-half minutes for 
eighteen days. A sample was broken up 
into three periods, each lasting two and 
one half minutes. Included in the tabu- 
lations were four categories each of 
physical and verbal aggressions toward 
teachers, children, and objects. Lippitt 
(1941) investigated hostility in the be- 
havior of preschool children by getting 
time samples of one minute per day 
over a period of thirty days. Walker 
(in Dawe, 1934) studied quarrels of pre- 
school children by placing them in a 
room in pairs for six minutes, providing 
toys, and recording aggression every 
five seconds. Among his categories were 
screaming, pleading, and passive relin- 
quishment. 

Dependence. To record social inter- 
actions between preschool children and 
between children and adults during 
free-play periods, Marshall and Mc- 
Candless (1957) made a minimium of 
şo two-minute observations of each 
child over a period of one calendar 
month. The investigators tabulated as- 
sociations between subjects, friendly 
and neutral approaches, conversations 
lasting one half minute or more, and 
hostile verbal or physical behavior. An 
index of dependence was developed to 
analyze the data. It was the sum of the 
interactions with adults proportional 
to the length of observations. 
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Oral Tendencies. As part of an in- 
vestigation of nervous habits of normal 
children in elementary school, Olson 
(1929) obtained time samples of oral 
gestures like thumb sucking. He found 
that the frequency of oral gestures was 
negatively correlated with nutritional 
status in children between the ages of 
7 and 13 years and that it was high 
among overweight children between 10 
and 12 years. Blum and Miller (1952) 
measured orality by means of eight 
two-minute samples of such nonpurpo- 
sive mouth movements as thumb suck- 
ing, lip licking, tongue rolling, and 
bubbling. The frequency of these 
movements was significantly related to 
the amount of ice cream consumed 
when sated and to teachers’ ratings of 
eagerness for lunch. 

Affect. Swan (1938) used five-minute 
samples to collect data on the facial 
expression of emotion during periods 
of free play. Her children varied be- 
tween 2 and 4 years of age. In a study 
of emotional instability in preschool 
children Lee (1932) collected three- 
minute time samples of shifts in mood. 
Each sample was divided into 12 fifteen- 
second units. For each unit, a judgment 
was made on a seven-point scale, rang- 
ing from audible expression of rage or 
grief to audible expression of pleasure. 
The scores consisted of frequency in 
shifts of mood and average mood level. 
To learn about the initial adjustment 
of children to nursery school, High- 
berger (1955) obtained 20 six-minute 
time samples of the aggression, initia- 
tive, and anxiety revealed in the inci- 
dents that occurred over the first nine 
days. 
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USES AND ABUSES OF TIMI 
SAMPLES 


The time-sampling method has most 
of the assets of check lists and rating 
scales: it is simple and not v time 
consuming, it is objective, and lends 
itself to quantitative analysis. has a 
special asset: it is based on many re- 
actions of a subject over a period of 
time. As a result, it is a good repre- 


sentative measure, and it permits the 
analysis of variability from day to 
day. 

Like check lists and rating scales, 
time samples have their liabiliti hey 
provide data that can be interpreted in 
many ways, they omit information 
about context. When misused, they can 


present an oversimplified picture of an 
event, and they can be affected by the 
observer’s bias. 

Usually, tabulations require quick 
judgments, so that time samples cannot 
be too complicated. A common error 
in planning the collection of time sam- 
ples is the inclusion of too many cate- 
gories. The observer can attend to only 
a limited number of events. Beyond the 
limit, his ratings become unreliable. 

Time samples are more useful with 
observable than inferential material. 
When an investigator's hypothesis re- 
quires some inference, he must decide 
whether to retain the method of time 
sampling, shift to another method, or 
jettison the hypothesis. Such a decision 
had to be made by Swan (1938), who 
could not get sufficient agreement 
among observers in their ratings of ag- 
gression, affection, or general bodily 


MOTIVATION AND AFFECT 

activity. She decided to eliminate these 
variables, It was certainly an asset of 
the technique that it permitted the as- 
sessment of reliability. But the decision 
to abandon part of the study because 
the time sample did not provide suffi- 
ciently reliable information suggests 
that the method may not have been the 
most appropriate one for the research. 
Although the final study produced val- 
uable data about the affect that could 
be studied in terms of facial expression, 
information was lacking about some of 
the original problems of the research. 
Another investigator might have been 
inclined to change the method or to 


add another. 

Time sampling is not suited to some 
kinds of topics, Failure, for example, 
occurs so infrequently in many schools 
that a random sampling of behavior 
during the administration of tests is 
likely to miss critical incidents. The 
method of time sampling is also not 
suited to subtle and complex types of 
events, w hich can have varying mean- 
ings and which are spread over time 
units of varying duration, Expressions 
of sympathy, for example, can take 
many forms. They can include a fleet- 
ing smile, an earnest discussion, and 
assistance with work. The duration of 
each form of expression is not neces- 
sarily a good index of the strength of 
Sympathy: the fleeting smile may ex- 
press more sympathy than assistance 
with physical labor. Tabulations within 
standard units of time do not provide 
a meaningful comparison of degree of 
sympathy in the smile and the earnest 
discussion. 
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Records of Professional Agencies 


Quite a variety of records have been 
consulted by investigators interested 
in the origins and development of 
motives and affects. An interest in 
the jealousy of young children led 
Sewall (1930) to search the records of 
a clinic and a nursery school for infor- 
mation about competition among sib- 
lings. Bernstein (1955) consulted medi- 
cal records about the early feeding of 
his subjects, and Page (1941) perused 
the records of social agencies to learn 
about the early frustrations of children 
who were later classified as aggressive 
and withdrawn. School records pro- 
vided Sanford et al. (1943), with data 
about emotionality and needs, such as 
those for recognition, play, and nur- 
turance, 


Teachers and Professional 
Personnel 


INTERVIEWS, QUESTIONNAIRES, 
AND REPORTS 


Spitz and Wolf (1949) consulted 
nursing personnel concerning the fre- 
quencies of rocking and of genital play 
at different times during the first year. 
Sanford et al. (1943) conducted 
monthly interviews with teachers to 
obtain information about the needs of 
children in elementary school, and Hur- 
lock and Klein (1934) administered 
questionnaires to principals, teachers, 
and counselors about the “crushes” of 
boys and girls in high school. 
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A Structured Interview with Special 
Informants. Some investigators have 
devised special interviewing methods, 
which are structured to elicit very con- 
crete examples of the traits being in- 
vestigated, Flanagan’s critical-incident 
technique (1954) is such a method. By 
means of it, an interviewer can inter- 
rogate an observer about incidents that 
are important to the understanding of 
a particular topic. First, the observer 
is asked to make several judgments in 
order to establish the meaning of the 
topic. The relevance of different actions 
to a particular motive, for example, 
can be explored by such judgments. 
The informant is then asked to describe 
the actions in detail: the times and con- 
ditions, the persons involved, their lo- 
cations, their behaviors, and so forth. 

Turner (1948) used a similar method 
to obtain information about the altruism 
of children between the ages of 9 and 
16 years. The items of a questionnaire 
on altruism provided the basis for in- 
terviews with a special group of in- 
formants. These people were selected 
because they knew the subjects well, 
did not have any need to protect them, 
and were able to communicate the re- 
quired information. Items in the inter- 
view referred to seven classes of situa- 
tion, among which were competition 
with others, frustration, and personal 
threat. The aim of the interview was 
to obtain descriptions of concrete sit- 
uations encompassed by the definition 
of each item. On the basis of each de- 
scription, Turner classified the actions 
as egoistic, of doubtful meaning, or al- 
truistic with respect to the underlying 
motive. Scores on the test correlated 
.55 and .47 with ratings of emotional 
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stability and .47 and .42 with ireedom 
from antisocial tendencies. 


RATINGS 


Ascendance. It is not at all difficult 
to train teachers to make reliable rat- 
ings, since they are accustomed to mak- 
ing comparisons among thei: pupils. 
Jack (1934) obtained teachers’ ratings 
of the ascendance of preschool chil- 


dren. The subjects were rated on five- 
point scales for such traits as insisting 
lirect- 


on one’s own right to material, 
ing the activities of companions, and 
helping with the enforcement of the 
group’s rules. The ratings correlated 
.81 with the ascendance manifested in 
an experimental situation. 


Dependence and Aggression. Beller 
(1955) asked teachers to rate preschool 
children on dependence, independence, 
and aggression. Most of the individual 


scales had reliabilities above .80, a fact 
that is attributable to the care with 
which the categories were defined. 
Each scale was introduced by a ques- 
tion specifying the instruments activity 
believed to be an aspect of a particular 
motive. The question was followed by 
a paragraph describing examples of ac- 
tivity, actions not in the category, and 
other clarifying information. The scales 
had seven points, the extremes of which 
were very often and very rarely. Sub- 
sumed under striving for dependence 
were scales for such variables as seek- 
ing help, seeking recognition, seeking 
attention, and seeking to be near others. 
For independence and achievement, 
ratings were obtained of such reactions 
as the derivation of satisfaction from 
work, attempting to overcome an en- 
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vironmental obstacle by oneself, and 
itive in carrying out one’s 


States. To measure the in- 
fourth graders, Broida and 


security 0f 
Thompson (1954) asked teachers to fill 
out 12-point scales for such variables 


as dependence, fear of things, reaction 
to criticism and responsiveness to emo- 
tional stimulation, In a study of fear- 
arousing communications Janis and 
Feshbach (1954) first asked the teach- 
ers of high-school students to select 
subjects who were hyperemotional or 
excitable, The investigators then com- 
pared tle responsiveness to different 
communications of this extreme group 
with the responsiveness of the less emo- 
tional subjects. 


CHECK LISTS 


Aggression and Dependence. Livson 
and Mussen (1957) had teachers fill out 
a daily check list and then independ- 
ently rank preschool children at the end 
of the week. Among the items on the 
check list for aggressive behavior were 
interference with others’ activities, 
teasing or insulting, quarreling, and 
physical attacks. Typical items on the 
check list for dependence were the 
seeking of physical contact, wanting to 
go home, “helpless” crying, and asking 
permission unnecessarily. The median 
of the reliability coefficients for ag- 
gression was .80, and the median for 
dependence was .48. To increase the 
low reliability of the pooled ranks for 
dependence, the investigators excluded 
those cases about which the observers 
showed most disagreement. 


75! 
Parents 


INTERVIEWS 


Early Training. Parents are often in- 
terviewed about their children’s behav- 
ior at home and about their important 
experiences in earlier years. Interview- 
ers have tended to concentrate on the 
parents’ techniques of child rearing, 
such as the timing of weaning, preferred 
methods of discipline, and methods of 
teaching responsibility for various 
household tasks. In a number of studies 
these methods have been traced to in- 
dices of social background, such as so- 
cial class, bureaucratization of father’s 
job, race, religion, and ethnicity (Davis 
and Havighurst, 1942; Miller and Swan- 
son, 1958; Sears, Maccoby, and Levin, 
1957). In some investigations parents’ 
methods have been related to the sub- 
sequent characteristics of the children. 
On questioning a group of mothers 
about the early history of their off- 
spring, Levy (1928) found a significant 
association between reported depriva- 
tion of normal sucking opportunities 
in babyhood and the later incidence of 
thumb sucking. Allinsmith (in Miller 
and Swanson, 1960) found a number 
of significant relations between the 
harshness of weaning and toilet training 
reported by mothers and their children’s 
moral standards in early adolescence. 
In Bernstein’s research (1955) mothers’ 
reports of coercive toilet training were 
significantly related to the anxiety that 
the children later manifested when 
separated from their mothers, to signs 
of immaturity in drawings of a person, 
and to the negativism expressed when 
asked to take some tests. 
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Motives and Affects. Dameron (1955) 
obtained reports from mothers about 
their methods of teaching emotional 
self-restraint, and Sewall (1930) sought 
information about competition with 
siblings. Sanford et al. (1943), con- 


ducted free interviews with moth- 
“ ers and also asked direct questions about 


their children’s needs. Sears, Maccoby, 
and Levin (1957) conducted interviews 
with mothers and made inferences about 
their children’s dependence, sex, aggres- 
sion, and conscience. 
Pathological Traits. 


Mothers can 


usually give reliable reports about the 
‘presence of certain discernible symp- 


toms. Moncur (1955) asked mothers of 
stuttering children to provide evidence 
of such difficulties as weeping, emo- 
tional vomiting, and nightmares. Long 
(1941) requested parents to fill out an 
anonymous questionnaire concerning 
such undesirable behavior as fears, 
dawdling, and boastfulness. Then he 
analyzed the age distributions of the 
reported reactions. Michaels and Good- 
man (1934) questioned parents of chil- 
dren between 6 and 16 years about the 
frequencies of such pathological traits 
as enuresis, thumb sucking, tantrums, 
and sleep disturbances, 


PARENTS’ RECORDS 


Anger. In a study of the expression 
of anger in preschool children, Good- 
enough (1931) asked the mothers of 
her subjects to keep records of the fre- 
quency, duration, and causes of out- 
bursts. To get reliable data for each 
child, the investigator requested the 
mothers to keep records over a four- 
month period. The outbursts were most 
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)PMENT 


frequent at meals and during illnesses 
and became less explosive and more fo- 
cused with increasing age. 


Fear. Jersild and Holmes (1935) ob- 
tained parents’ records of their chil- 
dren’s expressions of fear. The range 
of subjects’ ages varied between 1 and 


8 years, and the records werc 
over a period of 21 days. In 
scription of each incident a pa 


cluded the place and time, the child’s 
activities, the persons present, the ap- 
parent causes of fear, and the child’s 
reactions. Two judges agreed in go per 
cent of their ratings of fear. 
Children’s Questions. Davis (1933) 
asked mothers to keep records of the 


questions asked by children between the 
ages of 3 and 12 years. Most of the 
questions referred to actions, but many 
suggested different kinds of motives. 
The records were analyzed in terms of 
the categories of people to whom the 
questions were addressed. The cate- 
gories included mothers, fathers, all 
other adults, children of the same age, 
older children, younger children, ani- 
mals, dolls, and no one in particular. 
Still needed is an exploration of the 
overt and symbolic content of ques- 
tions. Judging from the clinical litera- 
ture, such content offers a mine of in- 
formation about children’s motivational 
and affective states. 

Letters. In her study of social devel- 
opment Isaacs (1933) exploited an un- 
usual source of information about chil- 
dren, She analyzed letters from moth- 
ers and nurses, who described various 
kinds of behavior in children and asked 
for practical advice. The letters pro- 
vided interesting evidence of the kinds 
of motives and affects that create anx- 
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jeties in 


jults and that may provide 


foci of difficulty in children’s relation- 
ships with authorities. 
Peers 
INTERVIEWS 
How can one get the most valid in- 
formation about a subject from other 
children who know him? The inter- 
view has a number of serious draw- 


backs with juvenile informants, espe- 
cially when the research pertains to 
affects and motives. These are loaded 
topics, and the average child lacks the 
objectivity to converse about them. His 
anxiety may be considerable if the ques- 
tions pertain to his friends or even his 
schoolmates. Children who become 
anxious often become blocked or in- 
capable of visualizing the interpreter’s 
purpose. For such reasons, the inter- 
view has been used with decreasing 
frequency in recent years, and primarily 
in studies of adolescents. 


SOCIOMETRIC CHOICES 


The sociometric method is probably 
the commonest current method of ob- 
taining information about children from 
their peers. The assets of this method 
are best illustrated by examples of its 
use in research, 

Pictures. When children are very 
young, they cannot be depended on to 
remember all the people in a social 
group. Lists are no help as reminders 
when subjects have not yet learned to 
read. To overcome these difficulties, 
Bichler (1954) used pictures. Each sub- 
Ject was asked to look at three pictures, 
cach of which portrayed two children 
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at play. In every picture the children 
lacked heads. After putting a picture 
of the subject’s head on one of the fig- 
ures, the investigator asked him to se- 
lect a picture of another child’s head 
for the other figure. Choices by chil- 
dren showed agreements of 72 and 76 
per cent with time samples of the 
choices made during free play. 

Verbal Requests. Gray (1957) did 
not need to use pictures with her chil- 
dren, who were in the sixth and seventh 
grades. She obtained information about 
aggression and withdrawal by means of 
a verbal sociometric technique. To 
identify the most aggressive children, 
she asked each subject to name the 
people who broke the rules of the 
school and of games. To identify with- 
drawing children, she asked about the 
boys and girls who stayed out of games. 
As measured by repetitions of the 
choices, the reliability of aggression was 
significant for the total group. 

Verbal Portraits. To obtain a socio- 
metric rating, the examiner sometimes 
presents a verbal portrait of a particu- 
lar type of child and then asks the sub- 
ject to name people in his group who 
fit the portrait. Lesser (1957) employed 
this technique to study direct and in- 
direct aggression in children between 
the ages of 10 and 13 years. Havighurst 
and Taba (1949) presented their 16- 
year-olds with descriptions approxi- 
mately a paragraph in length and asked 
the subjects to name the portrayed 
persons. 


ASSETS OF SOCIOMETRIC 
CHOICES 


Sufficient examples of the sociometric 
method have been presented to illus- 
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trate the reasons for its popularity. 
First, the descriptions of traits are pro- 
vided by the investigator and not the 
respondent. This procedure relieves the 
respondent of the burden of verbaliza- 
tion. All he has to do is match each 
description with one or more persons. 
Next, the respondent is asked not to 
discuss a specific child but to pick the 
people in a group with certain attrib- 
utes. Hence he is not likely to feel that 
he is telling tales about a particular per- 
son. He is also providing information, 
which, when combined with the reac- 
tions of others in the group, will enable 
the investigator to compare a subject’s 
ranking with those of other children. 
The data consist of choices or of rat- 
ings, which make them easier to ana- 
lyze than the more varied content of an 
interview. In many cases the child is 
motivated to give a valid description of 
others because his responses may deter- 
mine who will sit next to him in class 
or who his partner will be in play ac- 
tivities. 


SOME GENERAL IMPRESSIONS OF 
RESEARCH ON MOTIVATION 
AND AFFECT 


A survey of methods would be in- 
complete if it did not list common pit- 
falls and sources of confusion. Fore- 
most among the difficulties is a lack of 
Consensus about conceptions of moti- 
vation and affect. Another vexing prob- 
lem concerns the methods of validating 
tests of complex phenomena, which can 
be manifested in many different kinds 
of behavior, Finally, there are questions 
about the choice of subjects; particu- 
larly, the extent to which characteristics 
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such as age, sex, social class, 4 avail- 
ability of subjects should determine the 
selection of a sample. 


Problems of Definition 


Although this is a volume meth- 
ods, it is necessary to raise certain the- 
oretical issues that are pertinent to the 
design of investigations and inter- 
pretation of results, Some of the most 
perplexing problems in studies of mo- 


tivation and affect arise because of a 
lack of consensus about definitions. 
Some conceptions of motivation are so 
broad that it is difficult to te!! what 
they exclude. A psychologist who can- 
not understand a subject’s behavior, for 
example, often assumes that more in- 
formation about his past expericnces or 
his current unconscious reactions would 
reveal an underlying motive. Such a 
definition of motivation covers vir- 
tually all behavior but reflex action. 

The problem of definition is not lim- 
ited to the generic meaning of motiva- 
tion. Names of specific motives seem 
as numerous and as arbitrary as the 
names of stars. In line with differing 
theoretical assumptions about compo- 
nents and origins of motives, they have 
been identified in psychological, behav- 
ioral, and phenomenological terms. 
Various concepts refer to such aspects 
of motives as hereditary characteristics, 
affects, incentives, objects, acts, and 
goals. Some concepts are as specific as 
needs for money and for a friend's 
company; others are as general as the 
need for consistency and the death in- 
stinct. 


MOTIVATION AND AFFECT 


DEFINITIONS AND CODING 


OF AGCRESSION 
The lack of consensus about nomen- 
“€lature and theoretical premises creates 


some practical difficulties in the plan- 
ning and interpretation of research, 
difficulties that are all too evident in 
the literature on aggression. Sometimes 
the same rerm has been applied to con- 
cepts thar were obviously intended to 
have different meanings. In the Fels 
Scales (Richards and Simons, 1941), for 
instance, the reactions subsumed under 


aggression are closer in meaning to 
what is ordinarily understood as ini- 
tative. Sometimes different terms have 


been applied to the same concept. Many 
of the items in Jack’s code for ascend- 
ance (1934) would ordinarily be as- 
signed to the motive of aggression. 
Often a form of behavior, which dif- 
ferent psychologists are ready to define 


in the same way, is given different un-\ 


derlying interpretations. Negativism 
has been interpreted as an expression 
of aggression by Fisher (1934), of shy- 
ness by Goodenough (1929) and of 
anxiety by Reynolds (1928). 
Investigators have shown some agree- 
ment in their connotative definitions of 
aggression but not enough to establish 
a common position with respect to 
denotative definitions and operations. 
The latter have tended to vary with 
the level of abstraction, the scope of 
the investigator’s conception, and the 
extent to which it was conceived in 
terms of observable behavior rather 
than an underlying disposition. Frenkel- 
Brunswik’s definition (1942) encom- 
Passed one of the broadest range of 
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acts. She viewed aggression in terms of 
a goal, the desire to deprive others. Ag- 
gression, she thought, refers not to be- 
havior but to an underlying predisposi- 
tion, which can be expressed in many 
kinds of acts. She did not think that 
the motive could be measured validly 
by ratings of overt behavior. Her esti- 
mates of the motive consisted of intui- 
tive judgments based on many kinds of 
acts in a variety of situations. 

Frenkel-Brunswik considered a 
broader range of acts than did Allin- 
smith (in Miller and Swanson, 1960), 
who concentrated on the directness with 
which aggression is expressed, Her cod- 
ing system consisted of categories of 
directness, each of which was defined 
in terms of specific acts. Still more re- 
stricted was the definition of Mussen 
and Naylor (1954), whose code gave 
more weight to acts calculated to in- 
flict injury on an organism or its sur- 
rogate. The instrumental acts used in 
Beller’s definition (1957) represented 
the most direct forms of aggression, 
although he provided for the possi- 
bility of displacements. 

Disagreements about the definition 
of a motive have also led to differences 
in the codes for what was purported to 
be the same concept. The code of ag- 
gression employed by Graham, Char- 
wat, Honig, and Weltz (1951) did not 
include hurting oneself; Mussen and 
Naylor’s did. Mussen and Naylor did 
not include reactive helpfulness in their 
code; Allinsmith did. Only the most 
speculative comparisons of the studies 
of aggression are possible when they 
have differed in their operational defi- 
nitions. 
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Development of Criteria. When psy- 
chologists disagree about definitions, 
there can be no generally acceptable 
criteria for validating tests. Even if 
there were agreement, validation would 
still be hampered by the very nature 
of motives. They are conceived not as 
acts but as dispositions, which can be 
expressed in many ways. Frequency of 
fighting would be a poor criterion for 
a test of aggression, a motive that can 
also take overt unbelligerent forms 
such as excessive friendliness and de- 
pression, A better criterion would en- 
compass the different manifestations of 
the motive, indirect as well as direct. 

Arousal. Some investigators have at- 
tributed face validity to tests of motives 
if they were given before and after the 
motives were actually aroused. The data 
consisted of discrepancies between two 
scores. Such a technique also has its 
problems, some of which can be illus- 
trated by a hypothetical study of ag- 
gression. The investigator must create 
conflict about the aroused anger or it 
will be turned against him instead of be- 
ing expressed on the test. If the guilt is 
very strong, the aggression may be dis- 
torted defensively so that there are few 
direct manifestations of it on the test. It 
is not easy to succeed in the delicate task 
of arousing just the right amount of 
anger and guilt. A second difficulty is 
created by unanticipated reactions to 
the provocation. Some children may 
not become angry but may welcome 
the punishment as atonement for guilt, 
and some may react with masochistic 
pleasure. 

Construct Validity. Because of the 
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difficulty of devising good independent 


criteria, investigators have bi show- 
ing an increasing tendency justify 
tests by construct validity. The fact 
that different sets of empiric! results 
derived with an instrument con- 
sistent with an underlying theoretical 
system is taken as evidence not only 
in support of the theory but al of the 
probable validity of the instrument. Of 
course, there are pitfalls in the use of 
such evidence. A few findings may be 
consistent with some of the investiga- 


tor’s theoretical premises but may have 
meanings that are related less to his 
theoretical scheme than to others that 


he did not consider. This | of a 
necessary connection may help to ac- 
count for the difference in frequencies 


of significant Rorschach validities ob- 
tained by the different methods. A sur- 
vey of the Rorschach literature by Levy 
and Orr (1959) reveals that many more 
significant results were reported for 
construct than for predictive validity. 
The chances were better than two to 
one that the hypotheses of the study 
would be supported by evidence in 
favor of construct validity. With an 
independent criterion, however, the 
odds were almost two to one against 
significant results. The less favorable 
results with predictive validity may 
also reflect the difficulty of establishing 
good criteria for complex reactions to 
the Rorschach Test, such as color shock 
(Baughman, 1958) and extratension. 


VALIDITY AND THE RELATION- 
SHIP BETWEEN SUBJECT 
AND EXAMINER 


Ideally, an investigation should be 
experienced in the same way by all sub- 
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jects. Careful planning is needed to ap- 
proximate that ideal in studies of mo- 
tives and affect, since there are many 
sources of variance. The interaction be- 
tween subject and examiner is a fre- 
quent source of uncontrolled error. If 
the subject is ashamed of some of his 
associations, for example, he may be 
tempted to modify them in the direc- 
tion thar will win the investigator’s 
respect. Ií the subject has been removed 
from a cherished activity, such as 


swimming, he may become covertly 
uncoope) ve, 

Even tler interpersonal influences 
may t a subject’s responses. A 
number of investigations indicate that 
results can differ as a function of such 
variables as the sex of examiner, his 
overt and covert personality traits, the 
subject’s private definition of the task, 
and his involvement in it (Miller, 1953). 

Validity of Methods of Manipula- 
tion. Despite the many possible inter- 
personal sources of error, very few of 
the experimenters who have manipu- 
lated their subjects have attempted to 
find out whether the methods had the 
anticipated effects. Attempts to frus- 
trate subjects or to vary their involve- 
Ment in different tasks, for example, 
have seldom been followed by inquiries 
into degrees of resultant frustration or 
involvement. When the hypotheses 
about the effects of frustration have 
been confirmed, the validity of the 
techniques of manipulation has usually 
been taken for granted. There are some 
notable exceptions to this practice. To 
Judge the effects of their technique, 
Barker, Dembo, and Lewin (1941) com- 
pared regressive reactions with amounts 
of frustrated behavior elicited by two 
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degrees of frustration. To judge the ef- 
fects of two sets of instructions in- 
tended to produce different degrees of 
ego involvement, Calden and Cohen 
(1953) obtained reports of involvement 
from subjects who had been given the 
different instructions. 


Selection of Subjects 


AVAILABILITY AND REPRE- 
SENTATIVENESS OF SPECIAL 
POPULATIONS 


Often studies that have been care- 
fully planned in other respects have 
produced questionable results because 
too little time was devoted to consid- 
eration of the sample. A review of de- 
velopmental studies of motives and af- 
fects conveys the impression that some 
populations were selected primarily be- 
cause they were available. Staffs of 
psychology departments have been in- 
clined to employ easily obtained cap- 
tive populations, usually college sopho- 
mores or children from nursery schools 
in college towns. Such groups have 
tended to have high IQ’s and to come 
from the middle class. 

Availability of a sample is no small 
advantage in conducting research, 
When an investigator does not have 
to spend much time in finding children, 
he can devote most of his efforts to the 
planning of the study and the analysis 
of data. But availability can be given 
undue weight as a selective criterion. 
Subjects in a college nursery school are 
not representative of the total popula- 
tion of children, and it is often hazard- 
ous to generalize results obtained from 
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such samples. It is probable, for exam- 
ple, that their average need for achieve- 
ment will be quite high. It is also prob- 
able that they will favor indirect forms 
of aggression more than a group of 
children from the working class will. 
The acts that one observes in a college 
nursery often have a different signifi- 
cance than they would in a nursery for 
children from the working class. The 
son of a white-collar worker who ha- 
bitually resorts to direct forms of ag- 
gression is likely to be more pathologi- 
cal than a child from the working class 
who behaves in the same way. 


HOMOGENEITY OF SPECIAL 
POPULATIONS 


The college sophomore and the child 
in the college nursery have an addi- 
tional disadvantage in research—their 
homogeneity in certain traits. Because 
of the narrow ranges, the probability 
is small that such traits will be related 
significantly to antecedent variables. It 
is not surprising, for example, that so 
many studies of the levels of aspiration 
of college sophomores reveal few sig- 
nificant associations with antecedent 
variables. The narrow range of ambi- 
tions in such a sample may be imagined 
from the fact that people in the second 
year of undergraduate school have been 
ambitious enough to complete elemen- 
tary school, get good enough grades to 
be admitted to academic secondary 
schools, do well enough there to be 
graduated and to be admitted to uni- 
versities, and, finally, to pass enough 
courses in their first year to attain the 
status of sophomore. In sum, the in- 
vestigator increases the probability of 
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nonsignificant results when he restricts 
his subjects to those in a partic ular so- 
cial class or within a narrow range of 
intelligence. 


AGE LEVELS 


How old should one’s subjects be? 


Obviously, the answer depends on one’s 
hypotheses. In selecting children to test 
his hypotheses, the investigatori would 
do well to consider some of the char- 
acteristics of children at different stages 
of development. Subjects at cach stage 
have their advantages and disadvan- 
tages. Children in the preschool and 
even early school years, foi example, 
have the assets of being relatively naive 


and credulous. As a result, they accept 
at face value many instructions and sit- 
uations that might make older children 
skeptical or uncooperative. lt is also 
relatively easy to establish relationships 
with young children and to stimulate 
their interest in activities from which 
motives and affects can be inferred. In 
the nursery school and the initial grades 
of the elementary school the curricula 
are flexible, and most teachers are not 
reluctant to spare children for an ex- 
periment during school hours. Finally, 
young subjects are often easy to obtain 
because they are members of captive 
populations. A 

Young children also have their limi- 
tations. First, they cannot read. Hence 
they cannot be given paper-and-pencil 
tests in a group, and research with them 
is often more time consuming than re- 
search with older groups. Also contrib- 
uting to the complication of obtaining 
data from young subjects is their lim- 
ited ability to concentrate on a topie 
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The inve tor has to be sensitive to 
the duration of each child’s span of at- 
tention and to the periods when he is 
attending. Considerable time must be 
allowed, therefore, for the collection of 
certain kinds of information that re- 
quire the subject’s participation. Third, 
young children are not accustomed to 
describing their feelings verbally, par- 
ticularly to strangers. A few shy chil- 
dren cannot ever relax with a stranger. 
Most subjects can be cooperative, even 
spontaneous, if they have a few ses- 
sions with the observer. Finally, their 
limitations in the ability to communi- 
cate their feelings verbally makes it 
necessary to study their affects and 
motives indirectly. 

Adolescent subjects can read and can 
concentrate on a task for some time, 
and many of them are accustomed to 
introspecting and expressing themselves 
verbally. Hence they can often be seen 
in groups, given written tests, and asked 
questions about complex experiences. 

It is somewhat difficult to obtain ade- 
quate samples of older adolescents in 
the working class because they have 
left school. A difficulty that arises in 
the testing of adolescent subjects is 
their occasional lack of cooperation. 
Their negativism may take such forms 
as ambivalence, rebelliousness, cyni- 
cism, and sophisticated boredom. The 
Negativism is only one manifestation of 
a general increase in certain conflicts. In 
adolescence children are trying to de- 
velop new identities. They cannot ex- 
Press their needs and affects as children, 
nor have they mastered the more ma- 
ture forms of behavior. Sex and ambi- 
tion and aggression represent perplex- 
ing problems and are connected with 
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emotional disturbances. Subjects in 
their teens are often understandably 
reticent about discussing certain affects 
and motives. 

Preadolescent children, between 9 
and 13 years, deserve more attention 
from psychologists than they have re- 
ceived thus far. Most children of these 
ages still have the naiveté and the im- 
maturity of more youthful subjects. 
The writer has found them very credu- 
lous, and very cooperative after receiv- 
ing chocolate bars. Since attendance at 
school is still compulsory, the popula- 
tions of children in these intermediate 
years contain adequate proportions of 
potential subjects from all social classes. 
Children in this age group read and 
possess a fair amount of skill in verbal 
communication. Hence many of them 
can be given paper-and-pencil tests in 


groups. 


The Current State of 
Developmental Research 


The literature on motivation and af- 
fect leaves one in a far from pessimistic 
frame of mind. Of the more recent 
developments, the most striking is the 
accumulation of a great variety of 
tested methods. Also impressive are the 
manifestations that each generation of 
psychologists has benefited from the 
achievements and errors of their pred- 
ecessors. Many of the techniques de- 
scribed in the last decade, for instance, 
represent the achievement of a double 
goal that was seldom attained by ear- 
lier investigators. Most of the tech- 
niques have permitted objective meas- 
urements of results without oversim- 
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plications of the motives and affective 
states that were being studied. 

American investigators have revealed 
increasing tendencies to define concepts 
with some rigor and to justify opera- 
tions in terms of theoretical and devel- 
opmental considerations. A number of 
studies has begun to unravel some of 
the current tangle of theoretical terms 
and premises. It is now apparent, for 
example, that independence and de- 
pendence are not opposites (Beller, 
1955; Heathers, 1955). The quality of 
current developmental research on mo- 
tivation and affect reveals a growing 
competence with both theory and 
methodology. It is a competence that 
promises a steady progress toward the 
goal of establishing empirically tested 
theoretical systems. 
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This review of methods for the ap- 
praisal of personality characteristics in 
children is distinguished by three fea- 
tures: it is variable-oriented, child- 
oriented, and theory-oriented. The first 
feature refers to the fact that major 
consideration is given primarily to those 
procedures that have been explicitly 
designed for measuring particular vari- 
ables. Generalized techniques for the 
appraisal of personality in toto, such as 
the CAT, the Rorschach, and the 
World Test, are not treated here, espe- 
cially since they are dealt with else- 
where in this book. The review is child- 
oriented in that we have restricted our- 
selves only to those methods that have 
been specifically devised or adapted for 
use with children. Thus we make only 
passing references to procedures that 


were designed primarily for adults but 
happen also to be applicable to adoles- 
cents or older youngsters. Finally, our 
approach is theory-oriented out of ne- 
cessity. We found that we could not 
evaluate techniques for appraising per- 
sonality in children without first ana- 
lyzing conceptually both the kinds of 
variables that investigators wished to 
assess and those that they were in fact 
measuring. 

Even when one restricts oneself only 
to those methods of personality assess- 
ment that meet our criteria of being 
variable-oriented and child-oriented, 
the number of “eligible” traits and tech- 
niques is far beyond the scope of a 
single chapter. Accordingly, we have 
resorted to the usual expedient em- 
ployed by the researcher confronted 
with an impossibly large universe— 
sampling. Our sample in this instance 
is “stratified” rather than random; spe- 
cifically, we have selected for intensive 
consideration three variables—aggres- 
sion, anxiety, and motivation for 
achievement—which represent some- 
thing of the range of personality char- 
acteristics that have been measured in 
children and each of which has been 
studied by a variety of techniques. 

The chapter is organized in five ma- 
jor sections: 


1. The first presents what is intended 
as a theoretically neutral conceptual 
analysis of types of personality charac- 
teristics, 

2. Here we consider the major stra- 
tegies employed for the analysis of per- 
sonality characteristics in children, the 
assumptions on which these strategies 
are based, and the dangers inherent in 
these assumptions. 

3- A statement of the criteria em- 
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ployed in selecting variables and tech- 
niques for consideration and an out- 
line of the steps to be followed in evalu- 
ating each variable and the techniques 
employ ed for its measurement. 

4. The main body of the chapter 
analyzes the major concepts and tech- 
niques currently employed in the meas- 
urement of three representative vari- 
ables: aggression, anxiety, and motiva- 
tion for achievement. This evaluation 
is carried out in the light of the con- 
cepts and criteria outlined in the first 
three sections. Implications for use of 
the techniques in research and practice 
are examined, 

An examination of the general im- 
plication suggested by the intensive 
analysis of the selected variables and 
techniques and directions for future de- 
velopment. 


THE NATURE OF PERSONALITY 
CHARACTERISTICS 


The conceptual framework we have 
employed for the analysis and classi- 
fication of personality characteristics is 
by no means new; rather it is a syn- 
thesis gleaned from a variety of sources, 
the most important being the contribu- 
tions of Allport (1937), Murray (1938), 
Sanford et al. Goa Goodenough 
(1949) and, most recently, Baldwin 
(1955, 1958). We may begin by noting 
that the term “personality character- 
istic” is used, explicitly or implicitly, 
to refer to a wide range of properties 
of a person, Nevertheless, these diverse 
uses have one feature in common, un- 
derlying all is some kind of disposi- 
tional construct, in Carnap’s (1936, 
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1937) sense of this term. To put the 
matter more precisely, a personality 
characteristic ordinarily implies a tend- 
ency toward behavior associated with 
a particular person under a given set of 
conditions. Most commonly, this be- 
havioral tendency refers to a disposition 
to act on the part of S, the person him- 
self. For example, Mary cries whenever 
she sees another child crying, Tom 
avoids the company of boys his own 
age. For reasons that we shall indicate 
later, we also wish to consider as a 
personality characteristic any tendency 
of S to evoke a particular response in 
others under a given set of conditions. 
For example, John is picked on by boys 
of his own age, Tom is “mothered” by 
females older than himself. All together, 
we have found it necessary to distin- 
guish three types of personality char- 
acteristics, which we designate as re- 
sponse tendencies, motives, and stimulus 
characteristics. The first two refer to 
dispositions toward action on the part 
of S himself, the last to tendencies to 
evoke responses in others. 


RESPONSE TENDENCIES 


Here it is the specific behavior elic- 
ited that is relatively invariant. Every 
time the response tendency is activated, 
the person tends to do exactly the same 
thing. There is no modification of re- 
sponse in order to achieve a particular 
goal or end-state. For example, Baby 
Sam cries when not fed for several 
hours, Mary reacts impulsively in new 
situations, John shows anxiety signs 
when separated from his mother. Many 
response tendencies may, in fact, bring 
about a particular end-state: the baby’s 


772 

crying will often lead the mother to 
feed it. The crucial criterion for a re- 
sponse tendency, then, is not whether 
the behavior is adaptive or nonadaptive 
(it may be either), but whether the 
particular behavior evoked tends to be 
the same under similar stimulus condi- 
tions, The reader will recognize that 
many emotional characteristics of the 
type that Allport (1937) subsumed un- 
der the term temperament, such as en- 
ergy level, speed of response, irritabil- 
ity, and mood, fall under the rubric of 
response tendencies as here defined. 


MOTIVES 


With motives, it is the goal 
or end-state that is invariant rather than 
a specific mode of response; the person 
may resort to a variety of actions to 
achieve the goal. For instance, Henry 
strives to excel in any activity in which 
he engages, Mary tries to comfort any- 
one who is hurt or unhappy. In both 
of these examples the particular acts in 
which the child engages may differ 
from one situation to the next, On one 
occasion Mary may feed a hungry dog, 
on another, get a Band-Aid for a hurt 
child, and, on a third, perhaps even 
fight a bully attacking other children. 
The consistency manifested is with re- 
spect to the particular end-state—that 
of relieving the pain or discomfort of 
others. This goal or end-state may be 
positive as well as negative; for exam- 
ple, Tom avoids the company of girls. 

In the case of motives, then, the spe- 
cific behavior manifested constitutes an 
instrumental act that may vary mark- 
edly from one situation to the next. A 
motive, therefore, cannot be inferred 
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from a single piece of behavior, since 
both response tendencies and motives 
may lead to the same specific act. Thus 
the crying of a hurt child may reflect 
simply the disposition to cry in response 
to pain or may represent an instrumen- 
tal act, in the service of a dependency 
motive, aimed at getting help from 
others. To establish the existence of one 
or the other type of disposition requires 
additional evidence either that the child 
tends to cry in response to pain in a 
variety of situations irrespective of the 
consequences or that he consistently 
engages in a variety of behaviors hav- 
ing in common the feature that they 
elicit attention and assistance from 
others. Lacking such additional evi- 
dence, it is easy to err by inferring the 
existence of one type of disposition 
when only the other in fact exists. As 
we shall see when we begin to look at 
specific techniques, the readiness to as- 
sume motives where only response 
tendencies are actually present is par- 
ticularly common. 


STIMULUS CHARACTERISTICS 


The distinction between response 
tendencies and motives on the one hand 
and stimulus properties on the other 
parallels Allport’s differentiation (1937) 
between “biophysical” and “biosocial” 
characteristics. In the belief that the 
former are no less social than they are 
physical and that the latter are as much 
physical as social, we have eschewed 
Allport’s terminology and substituted 
our own. One can argue, as Allport 
does, that to view biosocial character- 
istics as aspects of personality is ille- 
gitimate, since they refer primarily not 
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to attributes of S but to the effect of 
these attributes on others. We never- 
theless wish to retain the concept of 
stimulus characteristics in our analysis 
for three reasons. First, there is the 
purely practical consideration that a 
number of widely used techniques of 
personality assessment do in fact meas- 
ure primarily not attributes of S but 
the effect of these attributes on others 
(for example, sociometric indices of ac- 
ceptability). Second, there is a theoreti- 
cal argument: much of human behav- 
ior, being social in character, implies 
reciprocal expectation and response. 
When, for example, we describe a child 
as influential or irritating, we are re- 
ferring not merely to characteristics of 
S but also to the response that S is 
likely to evoke from others. The prin- 
cipal basis for wishing to maintain the 
distinction, however, is methodologi- 
cal, since the failure to differentiate be- 
tween S’s own behavioral tendencies 
and the responses that he elicits from 
others can contribute to ambiguity and 
error in present-day procedures for the 
appraisal of personality characteristics, 
especially in children. 

Having described three major types 
of personality characteristics and em- 
phasized the differences between them, 
we would now call attention to cer- 
tain features that they share in com- 
mon. First, since all three are disposi- 
tions that are manifested only under a 
given set of circumstances, the fre- 
quency of overt expression depends in 
part on the number of times the person 
is subjected to these instigating condi- 
tions. Thus some response tendencies, 
such as smiling, may be evoked many 
times in the life of a person; others, 
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like fainting under stress, only once or 
twice. The frequency with which an 
act occurs, therefore, is not exclusively 
a property of the disposition but also 
a function of the situation. A disposi- 
tion is no less a disposition if it occurs 
only once (or perhaps never) in the 
course of a lifetime. Second, although 
from a theoretical point of view a dis- 
position may remain a property of the 
person for only a short period of time 
(e.g, a matter of hours or minutes), 
interest customarily focuses on disposi- 
tions that are relatively enduring (i.e. 
they last for a period of years). Finally, 
and most important, personality char- 
acteristics, being dispositional con- 
structs, cannot be observed directly; 
their existence must be inferred from 
certain operations performed under 
specified conditions. These operations 
and inferences constitute the major 
strategies that underlie specific tech- 
niques of personality assessment. We 
turn next to a consideration of these 
strategies and the assumptions on which 
they are based. 


STRATEGIES AND UNDERLYING 
ASSUMPTIONS IN THE ASSESSMENT 
OF PERSONALITY CHARACTERISTICS 


The Problem of Construct Validity. 
In attempting to measure behavioral 
dispositions, the investigator is con- 
fronted with an issue which, though it 
has haunted psychology since its be- 
ginnings, has been given systematic 
formulation only in recent years. This 
is the problem of “construct validity,” 
of inferring “the degree to which the 
individual possesses some trait or qual- 
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ity (construct) presumed to be re- 
flected in the test performance” (APA 
Technical Recommendations, 1954). 
The concept of “construct validity” 
was first introduced formally in the 
report of the APA Committee on Psy- 
chological Tests (APA Technical Rec- 
ommendations, 1954). The most defi- 
nitive analysis, however, appears in the 
subsequent paper by Cronbach and 
Meehl (1955).! Echoing the Commit- 
tee report, these writers take issue with 
the traditional view that a measuring 
technique is adequately validated sim- 
ply by correlating it against a single 
external criterion, Such a procedure is 
appropriate, they assert, only when the 
investigator is interested solely in pre- 
dicting the criterion behavior and has 
“no concern whatsoever with the type 
of behavior exhibited in the test.” 
Whenever “the trait or quality under- 
lying the test is of central importance,” 
we are faced with the problem of “con- 
struct validity,” and the single criterion 
approach is no longer sufficient nor ab- 
solutely necessary, Instead, the investi- 
gator is called upon to marshal all avail- 
able evidence, both logical and empiri- 
cal, in support of the argument that 
the test measures what it purports to 
measure. Correlations with external 
criteria are but one possible source of 
corroborative evidence. Cronbach and 
Meehl cite five common methods of 
construct validation. 


1. Correlational matrices and factor 
analysis. If two tests are presumed to 
measure the same construct, a correlation 
between them is predicted. . . . A matrix 
of intercorrelations often points out prof- 


‘A recent review of principles and methods 
of construct validity is provided by Clark 
(1959). 
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itable ways of dividing the construct into 
more meaningful parts, factor analysis be- 
ing a useful computational method in such 
studies. 

2. Group differences. If our understand- 
ing of a construct leads us to expect two 
groups to differ on the test, this expecta- 
tion may be tested directly. . . . 

3. Studies of internal structure. For 
many constructs, evidence of homogeneity 
within the test is relevant in judging val- 
idity. . . . Negative item-test correlations 
may support construct validity provided 
that the items with negative correlations 
are believed irrelevant to the postulated 
construct and serve as suppressor variables. 

4. Studies of change over occasion. The 
stability of test scores (“retest reliability”) 
may be relevant to construct validation. 
Whether a high degree of stability is en- 


couraging or discouraging . . . depends 
upon the theory defining the con- 
struct. . . . More powerful than the re: 


test after uncontrolled intervening experi- 
ences is the retest with experimental in- 
tervention . . . the experimenter validates 
by determining whether the test can de- 
tect induced intraindividual differences. 

5. Studies of process. One of the best 
ways of determining informally what ac- 
counts for variability on a test is the ob- 
servation of the person’s process of per- 
formance (pp. 287-289). 


The foregoing list of methods is by 
no means exhaustive. Particularly per- 
tinent to the validation of personality 
measures in children, for example, is the 
demonstration that the variable being 
measured shows developmental trends 
and antecedents that are consistent with 
the theory of the nature and genesis of 
the variable. The general strategy of 
construct validation is well summarized 
by the authors of the APA Technical 
Recommendations (1954) when they 
state, “Essentially, in studies of con- 
struct validity, we are validating the 
theory underlying the test” (p. 14)- 
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Construct Validity of Behavioral Dis- 
positions. The general problems of con- 
struct validity are further complicated 
when the characteristic being assessed 
is a behavioral disposition, for this type 
of construct requires somewhat more 
involved operations and attendant as- 
sumptions. These requirements for es- 
tablishing the existence of a behavioral 
disposition are easier to state than to 
meet. Since a personality characteristic 
involves a tendency toward action as- 
sociated with $ under given conditions, 
it is necessary to show that the particu- 
lar behavior does in fact occur when- 
ever § is found or placed in specified 
conditions. Ideally, this requirement is 
met by direct observation of S over a 
period of time under a variety of cir- 
cumstances, If the specified behavior is 
then observed only under the specified 
conditions, the particular personality 
characteristic may be said to exist. 

These ideal criteria are obviously dif- 
ficult to establish. Not only is the pro- 
cedure of direct observation expensive 
in time and money, but there is always 
the chance, since one cannot observe 
all possible situations, that the inferred 
personality characteristic is actually 
cither more general or more specific 
than is believed. 

Moreover, although the method of 
direct observation over extended times 
and settings has been employed on oc- 
casion (e.g., Murphy, 1937), ordinarily 
the investigator resorts to other, less 
demanding procedures, which, though 
more practicable, inevitably entail fur- 
ther assumptions, hence scientific risks. 

Whatever method is employed for 
the assessment of personality charac- 
teristics, two general kinds of assump- 
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tions are made, explicitly or implicitly. 
Each type has its dangers and difficul- 
ties. 


1. The assumption of specificity. To 
identify a personality characteristic, it is 
necessary to specify both a particular 
kind of behavior and a particular type of 
situation in which this behavior is evoked. 
Accordingly, in applying any method of 
personality assessment, it is necessary to 
assume that the behavior and situation 
being observed are representative of the 
particular class of behaviors and situations 
in which one is interested and not some 
broader class or mixture of classes. If this 
assumption is not warranted, then the 
measuring procedure being employed is 
not a valid one. 

Example. An index of aggressiveness 
based on the number of times a child 
attacks other children may be a function 
not only of the child’s tendency toward 
aggressive behavior but his general rate 
of activity. (For this reason, aggressiveness 
is often measured as a percentage of all acts 
exhibited in a given time interval.) 

2. The assumption of generality. This 
assumption represents the opposite side of 
the coin from the first. In employing any 
assessment technique, one must assume that 
the behavior and situation being observed 
are generalizable to the broader class of 
behaviors and situations in which one is 
interested. In point of fact, the observed 
behavior may be highly specific to the 
particular situation in which it is elicited 
or to the psychological state of the subject 
at the time. 

Example. Level of aspiration as observed 
in a given experiment may not be a 
pervasive aspect of S's behavior but 
specific to the task employed (e.g, dart 
throwing). 


We shall now examine the applica- 
tion of these generic assumptions in 
each of the major approaches com- 
monly employed for inferring per- 
sonality characteristics. 
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Sources of Data. We shall discuss 
strategies as they relate to the various 
sorts of data utilized as the bases for in- 
ference. The data are of four types. 


1. Observations of behavior in selected 
Situations (typically one situation only). 
This method compromises with the ideal 
in reducing the variety of situations and 
the period of time over which S is ob- 
served. The situation selected may be 
taken from life or contrived as an experi- 
ment. In either case, both major assump- 
tions apply, and the corresponding pitfalls 
increase as a direct function of the re- 
strictiveness of the situations selected in 
variety and time span. Thus, to return to 
the example of level of aspiration, the 
fewer the tasks employed and the number 
of replications, the greater the risk in 
generalizing to other situations. 

2. Reports of behavior. In the case of 
children, these are typically obtained by 
questionnaires or interviews with the sub- 
ject, his parents, teachers, or others who 
know him well. Here the assumption of 
specificity takes a special form and brings 
its special problems of validity; namely, 
the investigator assumes substantial cor- 
respondence between verbal report and 
overt behavior. The content of the report 
is assumed to be determined primarily by 
the behavior that actually occurred at 
another time rather than by immediate 
circumstances or orientations of the re- 
spondent. It is well recognized, however, 
that considerable violence is done to this 
assumption by distortions of fact. These 
may arise from a variety of sources: ig- 
norance, faulty observation, failure to 
understand the question, and deliberate 
or unconscious bias as a function of the 
respondent’s own needs and behavioral 
dispositions. A special form of this last 
source of error is the response set or 
tendency to answer questions in a par- 
ticular manner without regard to the 
specific content of the item. Response sets 
can be a serious contaminating influence 
in reports of behavior, especially when 
the reports are obtained as responses to 
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a structured questionnaire. The distortions 


are likely to be even more marked, how- 
ever, in reports of attitude rather than 
action. Accordingly, we defer systematic 


consideration of response sets to the next 


section, which deals with the measurement 
of behavioral dispositions from statements 
of attitude. 

3. Reports of attitude. Inferences about 


personality characteristics are frequently 
made from the subject’s stated preferences 
or opinions as recorded in interviews or 
questionnaires or as reported by others. 
Except in the relatively rare instances in 
which the investigator’s interest is con- 
fined strictly to the subject’s verbal re- 
sponse per se, the use of attitude data as 
a basis for inferring personality charac- 
teristics rests on the assumption that the 
professed attitude reflects a disposition to 
act in accordance with the opinion ex- 
pressed. Apart from the requirements of 
construct validity imposed by this hy- 
pothesis, attitude measures of personality 
are, as already mentioned, highly sus- 
ceptible to the contaminating influence of 
response sets. The full extent, variety, and 
subtlety of this influence have come to be 
recognized only in recent years. For 
systematic analysis of the various types of 
response sets and of methods for taking 
them into account, the reader is referred 
to the technical papers of Cronbach (1946, 
1950), Bass (1955, 1956), Edwards (1957), 
Christie et al. (1958), Bronfenbrenner et 
al. (1958) and Bronfenbrenner (1958, 
1960b). For present purposes, it will suf- 
fice to call attention to five of the com- 
monest kinds of response set: 


a. Stereotypy. The tendency to give 
the same response to every question. 

b. Acquiescence. A special type of 
stereotypy in which the respondent con- 
sistently tends to say “yes” or to agree. 

c. Expressiveness. Willingness or 10- 
clination to take a strong or intense posi- 
tion, whatever its content or direction. 

d. Favorability. The tendency to view 
one’s self or others in a favorable light. 

e. Social desirability. The tendency to 
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give the response that is most socially 
acceptable. 


As is readily apparent from the fore- 
going examples, response sets are actually 
personality characteristics in their own 
right. Although in some instances they 
may reflect goal-oriented dispositions (e.g., 
the desire to make a good impression), 
response sets are ordinarily invariant, 
stimulus-specific forms of behavior, hence 
constitute examples of the type of disposi- 
tion we have called “response tendency.” 
In a recent article Loevinger (1959) has 
pointed to the special importance of re- 
sponse sets in the study of personality 
development in children. She argues that 
the gradual increase, between the ages of 
8 and 13, in the tendency to describe 
one’s self favorably (Getzels and Walsh, 
1958) and the subsequent decrease in this 
same tendency over the college years 
(Sanford et al., 1956) can be viewed as a 
reflection of the normal course of ego 
development in children. “At the lowest 
point there is no capacity to conceptualise 
oneself; at the midpoint there is a stereo- 
typed, usually conventional and socially 
acceptable self-conception; and at the 
highest point a differentiated and more or 
less realistic self-concept.” More typically, 
however, response sets are not of interest 
in their own right but function as con- 
founding factors giving a false stability 
and generality to measures intended to 
assess variables far more specific than a 
generalized pattern of verbal response. 
Although structured questionnaires and 
interviews are particularly susceptible to 
these contaminating effects, non-directive, 
Projective-type approaches are by no 
Means immune to their influences. 

4 Projective responses. These are or- 
dinarily obtained by specially designed 
techniques (Ep story-completion, doll 
play, drawings), although any kind of be- 
havior, including responses to structured 
questionnaires, may be used as the basis for 
projective analysis. What distinguishes the 
projective technique from other methods 
iS a special form of the assumption of 
generality. Specifically, projective tech- 
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niques are assumed to reflect dispositions 
that are not given free and direct expres- 
sion in everyday behavior. Accordingly, 
one cannot postulate, as one does with the 
preceding methods, a continuity in kind 
between the behavior exhibited in the 
projective test and in other situations. 
Nevertheless, a certain systematic relation- 
ship is implied, namely, the projective 
response is presumed to reflect a per- 
sonality characteristic that manifests itself 
in a different but yet theoretically con- 
sistent fashion in real-life situations. The 
validity of a projective technique as a 
measure of the personality characteristic 
depends on the demonstration of this 
systematic relationship. 

In a searching essay on “The Appraisal 
of Child Personality,” Goodenough (1949) 
employed the term “sign” to designate that 
feature of a projective response that is 
taken as an indicator of some underlying 
behavioral disposition, or as she called it, 
“conduct tendency.” 

Because the projective significance of 
his response is ordinarily not apparent to 
the respondent, it is easy to dismiss or 
overlook the possibility that distortions 
or response sets, willful and otherwise, 
may plague such techniques no less than 
they do more conventional procedures. 
It is nevertheless a fact that projective 
techniques are highly susceptible to con- 
founding influences from such factors as 
vocabulary level, verbal fluency, level of 
abstraction, and stereotypy of response. 
Specific instances of such sources of error 
will be noted as they occur in relation to 
particular techniques. 


Whatever the source of the data, 
variations occur also in the unit of 
measurement employed and the level 
of inference it implies. For example, in 
analyzing observations of behavior, the 
investigator may work from a “verba- 
tim” record of specific acts or episodes, 
as in the studies of Barker and Wright 
(1951, 1954) or, following the method 
of Bales (1950), acts may be classified, 
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as observed, into higher-order catego- 
ries such as “shows tension” or “shows 
solidarity.” More commonly, however, 
an even more “global” unit is em- 
ployed; that is, observers are asked to 
rate on the basis of general impressions 
gleaned from an entire behavior proto- 
col or from observations over an ex- 
tended period of time. In such in- 
stances, not only is the chain of infer- 
ence longer and less explicit, but there 
is an increased danger of distortion 
from halo effects (which, incidentally, 
are good examples of stimulus charac- 
teristics). 

The same range in units of measure- 
ment and levels of inference is found 
with other sources of data as well. Thus 
measures of attitude may be based on 
responses to specific items or on self- 
ratings on highly abstract variables. 
Similarly, one may analyze projective 
responses at a very molecular level, as 
exemplified in the work of McClelland, 
Atkinson (McClelland et al., 1953; At- 
kinson, 1958) and their colleagues, or, 
following the earlier practice of Mur- 
tay (1938) and Sanford et al. (1943), 
rate the projective protocol as a whole. 
As we shall indicate in our discussion 
of specific techniques, each approach 
has some advantages and some short- 
comings. 


SELECTION OF VARIABLES 
AND METHOD OF EVALUATION 


Criteria for Selection. Our aim is to 
evaluate, in the light of the conceptual 
distinctions and methodological criteria 
outlined above, examples of the most 
widely used and most promising strate- 
gies and techniques currently available 
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for measuring personality characteris- 
tics in children. Since practical consid- 
erations preclude the examination of 
all possible personality characteristics 
and available techniques and since many 
such procedures are treated extensively 
in other chapters of this book, we focus 
attention primarily on a few selected 
variables and methods meeting the fol- 
lowing criteria: 


1. The variables examined must, inso- 
far as possible, be representative of the 
major types of personality characteristics 
that have been (and are likely to continue 
to be) the focus of attention in work with 
children. 

2. The variables examined 1 have 
been investigated through a variety of 
techniques representing the major ap- 
proaches and problems currently en- 
countered in the measurement of per- 
sonality characteristics in children. 

3- The assessment techniques themselves 
must have been devised or adapted to 
measure the specific variable in question. 
Procedures designed to provide a global 
picture of personality or to yield measures 
on a wide variety of dimensions are thus 
excluded from consideration. | 

4. The method must have been orig- 
inally devised or specifically adapted for 
use with children. Procedures primarily 
designed for adults are not dealt with 
here, even though they may be applicable 
to adolescents or older children. } 

5. The method must meet conventional 
criteria of psychological measurement, €-8» 
the specific procedures for applying the 
technique must be described and con- 
sideration given to problems of validity 
and reliability. 


The Variables To Be Examined. A 
review of research studies utilizing 
methods meeting the foregoing criteria 
brought to light three general types of 
variables for which specific measuring 
techniques had been developed for use 
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with children. We may refer to these 
characteristics as emotional orienta- 
tions, social orientations, and task ori- 
entations. The first category includes 
variables such as anxiety, guilt, orality, 
and psychological sexuality, which are 
primarily intrapsychic in nature or im- 
ply disposition toward the self. The 
second group is interpersonal and so- 
cial in focus and encompasses such 
attributes as aggressiveness, affiliation, 
authoritarianism, and identification. 
Finally, there are problem-centered 
orientations such as motivation for 
achievement, intolerance of ambiguity, 
and creativity. Although the three 
groups of variables pose many of the 
same theoretical and methodological is- 
sues, each appears also to have certain 
special features. Accordingly, in order 
that our selection might be at least in 
small measure representative of the va- 
riety of problems to be encountered, 
we have chosen one variable from each 
group. In each instance the variable 
chosen was the one that seemed to have 
been studied most intensively with the 
greatest variety of techniques and that 
lent itself best to the exposition of the 
Major conceptual and technical prob- 
lems involved. The measuring proce- 
dures to be examined, then, are those 
that have been devised for assessing 
children’s aggressiveness, anxiety, and 
motivation for achievement. 

Method of Evaluation. The follow- 
ing procedure is employed in evalu- 
ating existing techniques for assessing 
the foregoing variables: 


Step 1. A brief theoretical analysis of 
the variable to delineate its status in terms 
of the concepts outlined under the heading 
The Nature of Personality Characteristics. 
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Step 2. A factual description of each 
technique used for assessing the variable, 
together with basic data available on 
standardization, validity, reliability, etc. 

Step 3. Analysis of the explicit and 
implicit assumptions underlying the tech- 
nique with the aim of determining what 
aspects of the theoretical variable each 
technique is in fact measuring. 

Step 4. Implications of the analysis for 
application of the technique in research or 
clinical practice. 

Step 5. Suggestions for improving exist- 
ing methods for measurement of the vari- 
able. 


METHODS FOR THE ASSESSMENT 
OF SPECIFIC PERSONALITY 
CHARACTERISTICS 


AGGRESSIVENESS 


The appraisal of aggression confronts 
us with most of the theoretical and 
methodological problems discussed in 
the preceding sections; hence it repre- 
sents a particularly useful point at 
which to begin our discussion of tech- 
niques for evaluating specific person- 
ality characteristics. As ordinarily de- 
fined, the construct of aggression sub- 
sumes any action, thought, or impulse, 
the presumed aim of which is physical 
or psychological injury, either real or 
symbolic, to an individual or surrogate, 
This typical conceptualization places 
primary emphasis upon aggression as 
a motivational disposition, as we have 
defined this term earlier. In common 
practice, for example, where observa- 
tions of a variety of specific aggressive 
behaviors are made, principal interest 
is usually centered upon the implica- 
tions of such behaviors for the indi- 
vidual’s hostile intent. But an important 
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difficulty may arise with an uncritical 
application of this approach, namely, 
that behaviors classified by an observer 
as “aggressive” may actually be re- 
sponse tendencies rather than instru- 
mental activities in the service of ag- 
gressive motivation. In very young 
children, for instance, one may see 
“destructive” behavior, such as knock- 
ing over block towers or cutting up 
clay figures, which, under some cir- 
cumstances, may actually represent 
characteristic modes of manipulating 
material objects (i.e., response tend- 
encies) rather than hostile acts reflect- 
ing an impulse to hurt or injure. In 
appraising aggressiveness in children, 
then, it is important to attempt to dif- 
ferentiate between behavior that reflects 
the presence of aggression as a motive 
and phenomenologically similar be- 
havior that is essentially a response 
tendency. The task is by no means an 
easy one, but it deserves considerably 
more attention than has typically been 
devoted to it. 

Turning to the sources of data uti- 
lized for the appraisal of aggression, we 
find that only three of the four sources 
previously described are frequently 
used: (1) systematic direct observation 
of the subject’s aggressive behavior, 
(2) reports or judgments of aggressive- 
ness by others having varying degrees 
of contact with the subject, and (3) 
the individual’s projective responses. 
Relatively little use has been made of 
reports of attitudes in the appraisal of 
aggression, probably because of the as- 
sumption that it is difficult to obtain 
valid self-descriptions of an impulse like 
aggression, which is subject to a good 
deal of social inhibition and control. 
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The obvious variety in the sources 
of data just mentioned immediately 
raises the question of the equivalence 
of measures of aggressiveness obtained 
by these very different approaches. It 
is well known, for example, that the 
relationship between fantasy aggression 
(as revealed in projective responses) 
and overt aggression is by no means 
simple or easily predictable. This is one 
illustration of the more genera! prob- 
lem of the complex relationship be- 
tween motives and their expression in 
behavior (Sanford et al., 1943; Lindzey, 
1952; McClelland, 1958a; Atkinson, 
1958), a problem to which we return 
repeatedly in subsequent discussion. 

In the paragraphs that follow illus- 
trations are presented of the principal 
techniques that have been used for the 
appraisal of aggression. As before, pro- 
cedures are grouped according to the 
sources of data upon which they rely. 

Systematic, Direct Observations of 
Aggressive Behavior. A number of 
promising techniques have been devel- 
oped and applied for the purpose of 
measuring individual differences in 
amount and types of aggressive behav- 
ior by systematic, direct observation in 
naturalistic settings. Typically, these 
techniques involve the determination of 
the frequency of occurrence of a va- 
riety of specific kinds of aggressive be- 
havior, as noted by trained judges 
working over a number of time pe- 
riods. The observations may consist of 
detailed, running narrative accounts, 
which are subsequently analyzed for 
the occurrence of aggressive acts (e.g, 
Jersild and Markey, 1935; Lewin, Lip- 
pitt, and White, 1939), or, as has be- 
come more common, the observer may 
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directly classify and tabulate the vari- 
ous defined types of aggressive behavior 
as they occur (e.g., Sears et al., 1953; 
Walters et al., 1957). The use of specific 
categories of aggressive behavior per- 
mits meaningful differentiation among 
qualitatively different forms of aggres- 
sion, for example, verbal or physical, 
and among different objects of aggres- 


sion, such as persons or inanimate ob- 
jects. The frequencies of occurrence of 
such behavior units, separately and/or 
in aggregate, may then be used as in- 


dices of aggressiveness for each child. 
As an illustration of an extremely 
careful systematic application of this 
approach, the reader is referred to the 
study by Sears, Whiting, Nowlis, and 
Sears (1953). These investigators clas- 
sified the aggressive behavior of nurs- 
ery-school children in six categories. 


1. Injury to person (including hurting 
someone, getting someone punished, 
damaging another’s property, derogating 
status). 

2. Discomforting another (including 
threatening gestures and language). 

3. Insuring compliance with demands. 
uction of inanimate objects or 
tions (in the form of displaced 
aggression or nondirected aggression). 

5. Having object in own possession 
(taking things away from another). 

6. Removal of immediate or anticipated 
frustration (removing barriers, minimiz- 
ing injuries from another person, main- 
taining status as in “saving face,” taking 
back an object that has been previously 
taken away by someone else, retaliation). 


A child’s score for each category was 
the number of occurrences of the de- 
fined type of aggressive acts during 16 
fifteen-minute periods of observation, 
and a total aggression index was ob- 


. D 
tained by summing the scores for all 
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categories. (It should be noted that in- 
dices of this kind may be complicated 
by differences in children’s activity 
level.) 

With careful definitions of the ob- 
servational categories to be used and 
with sufficient training of judges, it is 
possible to achieve a satisfactorily high 
degree of interobserver agreement with 
techniques involving direct observa- 
tion. In the study just described, for 
example, two judges simultaneously 
and independently classifying the be- 
haviors of 43 children over 74 fifteen- 
minute observations achieved percent- 
ages of agreement varying from 67 to 
100 for the six aggression categorics, 
with an average agreement of 86 per 
cent. 

The success with which the method 
of direct observation can be used in 
appraising any phenomenon depends 
not only upon the conceptualization 
and definitions of the categories for 
classifying behavior, and upon the in- 
terjudge reliability of the category sys- 
tem, but also upon the strategy for 
sampling of observations; that is, the 
number, duration, and spacing of the 
behavior samples to be observed. This 
fundamental problem in the method- 
ology of observational procedures is 
dealt with elsewhere in the present 
volume (Chapter 3). It is mentioned 
briefly here, however, because of the 
known variation in aggressive behavior 
in specific situations. If one wishes to 
obtain indices of broad generality, one 
needs to sample fairly large segments 
of the child’s behavior in a variety of 
situations, and it is necessary that the 
sampling of such situations be reason- 
ably comparable from child to child. 
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The direct classification and tabula- 
tion of aggressive behaviors as they oc- 
cur by means of a prearranged system 
of categories has many methodological 
advantages. For some purposes, how- 
ever, the investigator may gain by re- 
cording relatively detailed narrative de- 
scriptions of the behavior under study 
and the setting in which it occurs, with 
the coding and tabulation of specific 
categories of behavior occurring later. 
The principal advantage of such an ap- 
proach is that it includes a description 
of the situational context in which the 
aggressive behavior takes place. More- 
over, it permits sequential analyses of 
changes in behavior, without necessary 
loss of the quantitative treatment of 
the various categories of aggressive acts 
under study (e.g., Jersild and Markey, 
1935). 

Finally, an approach of this type 
makes possible the differentiation be- 
tween aggressive dispositions as re- 
sponse tendencies and as motives. For 
example, one could evaluate each child’s 
record for the extent to which it reveals 
a variety of aggressive behaviors across 
different situations, with invariance of 
goal [i.e hostile intent or injury to 
another (aggressive motive) |, as dis- 
tinguished from a specific type of ag- 
gressive behavior occurring frequently 
but only in a particular situation. With- 
out such analysis, two children with 
quite different behavior protocols re- 
flecting different kinds of disposition 
might conceivably achieve the same 
over-all aggressiveness score on the 
basis of mere frequency of occurrence 
of behavioral acts. To the authors’ 
knowledge, such analyses of observa- 
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tional data have not yet appeared in 
the research literature. 


Reports or Judgments of Aggressive- 
ness from Others. One step noved 
from the direct observation of aggres- 
sive behavior is the familiar method of 
asking judges to make ratings of ag- 
gressiveness. Such judgments usually 
require the rater to summarize in quan- 
titative form a variety of cumulative, 
relatively uncontrolled observations and 
impressions. 

Among the earliest attempts to set 
up carefully defined scales for judging 
various kinds of aggressive behavior (as 


e the 
Scales 


well as a variety of other trai 
Fels Child Behavior Rating 
(Richards and Simons, 1941) and the 
California Behavior Inventory for 
Nursery School Children (Conrad, 
1932, 1933; Read, 1940). The revised 
Fels Child Behavior Rating Scales” con- 
tain four scales in the area of aggres- 
sion: one is concerned with the fre- 
quency of aggressive behavior, another 
with the success or lack of success of 
the child’s aggression against others, a 
third describes his reaction to aggres- 
sion by others, and a fourth evaluates 
the extent to which the child tends to 
elicit aggressive behavior directed to- 
ward him. The last scale constitutes an 
excellent example of aggression as a 
stimulus characteristic. Each scale has 


*There have been many extended treat- 
ments of the methodological problems in- 
volved in the construction and use of rating 
and ranking methods (e.g., Thorndike and 
Hagen, 1955; Guilford, 1954). The reader is 
referred to these for detailed consideration 
of technical issues. 

*Fels Research Institute, Yellow Springs, 
Ohio. 
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a general definition, as well as five de- 
fined cuc points, and the rater makes 
his judgment at any point on a graphic 
continuum. 

The previously mentioned study of 
Sears et al. (1953) illustrates the effec- 
tive use of rating scales to obtain judg- 
ments of aggressiveness from nursery- 
school teachers. Each of 13 scales, as- 
sumed to represent various instrumen- 
tal activities subserving aggressive 
motivation, was described in detail. Rat- 
ings were made on a seven-point 
graphic scale, four points being defined 
in terms of frequency of occurrence.! 


With adequate training and experience 
in the use of the scales, pairs of teachers 
were able to achieve a satisfactory de- 
gree of agreement in rating children’s 
aggressiveness (r’s from .66 to .o8). 
The final rating used was the sum of 
the judgments by the two teachers. As 
previously mentioned, children’s ag- 
gression was also appraised in this study 
by means of a direct observational tech- 
nique. The correlations between the 
observational measures and the teachers’ 
ratings of total aggressive behavior were 
64 for boys and .48 for girls. These 
Correlations provide evidence support- 
ing the construct validity of both tech- 
niques as methods for appraising ag- 
gressive behavior, 

A number of promising techniques 
have been utilized to obtain estimates 
of the aggressive behavior of older chil- 
dren. Mussen and Naylor (1954), for 
example, used daily behavior reports 
(made by attendants and a handicraft 

‘The complete definitions of the scales are 


Not found in Sears (1953) but are available 
elsewhere (Beller, 1948). 
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teacher over a two-week period) of the 
aggressive behavior among delinquent 
boys (ages ọ to 15) living in cottage 
groups. Each report consisted of a check 
list of 12 specific kinds of aggressive 
behavior, and the observer indicated 
which types of behavior had occurred 
each day. In addition, at the end of a 
week, the observer rated each boy on 
five scales representing different as- 
pects of aggression. Although no data 
are reported on interjudge reliability for 
these indices, they apparently possess 
some validity as measures of aggression, 
since they revealed consistent relation- 
ships with independent estimates of 
fantasy aggression. Thus they appear to 
represent a promising practical ap- 
proach to the problem of getting esti- 
mates of overt aggression from lay 
observers, based on day-to-day associa- 
tion. 

Several other investigators have dealt 
effectively with the problem of ob- 
taining estimates of specific types of 
aggressiveness from teachers. Thus fre- 
quency of fighting behavior shown by 
boys in the first three grades was rated 
by teachers in a study by Kagan (1956). 
A more global approach, aimed at dif- 
ferentiating socially acceptable and 
socially unacceptable forms of aggres- 
sive behavior in high-school boys (15 
to 17 years old), is illustrated in the 
recent work of Jensen (1957). He used 
a combination of nominating and rating 
techniques by teachers to identify con- 
trasting groups of high-school boys 
classifiable as “aggressive-bad,” “aggres- - 
sive-good,” and “passive.” Each teacher 
named five boys most closely fitting the 
descriptions of the aggression types and 
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also ranked the boys and filled out 
descriptive check lists containing 


various items of aggressive behavior. 
From these data, the investigator picked 
relatively homogeneous groups of boys 
representing each of the three types of 
aggression “syndromes.” 

As we move further away from 
quantitative judgments on various de- 
fined variables, we approach the ap- 
praisal of aggression as a stimulus 
characteristic, evaluated in terms of the 
relatively long standing impression a 
child has made on other children. In 
a current research study Eron and 
others (1958) have used the familiar 
“guess who” technique in order to 
obtain criterion measures of aggressive- 
ness based on peer nominations of third- 
grade children, After considerable pre- 
testing, the investigators found 26 
specific items of aggressive behavior on 
which children could agree quite well 
in making their nominations (e.g., 
“Who starts a fight over nothing?” 
“Who grabs things from other chil- 
dren?”). Further, the items could be 
grouped into clusters having median 
interjudge reliabilities in different class- 
rooms ranging from .85 to .92. For 
total aggression scores thus obtained, the 
correlation with teacher ratings of ag- 
gression was .63. Although there are 
methodological problems involved in 
the use of peer-nominating techniques, 
such as the lack of comparability of 
frequency of nomination scores ob- 
tained in groups of different size and 
composition (Bronfenbrenner, 1945), 
this approach can be used profitably 
(Beswick and Cox, 1958). 

Having dealt with evaluations and 
reports of children’s aggressive be- 
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havior by trained observers, by 
and by classmates, we now tur 


to a source of information that is liter- 
ally “closest to home”: the child’s 
mother. Although there has always 


been some skepticism with regard to the 
accuracy of mothers’ reports concern- 
ing various aspects of their children’s 
behavior, some investigators have found 
this a useful approach (e.g., M: ‘Farlane 
et al., 1954; Sears, Maccoby, and Levin, 
1957), particularly if one uses careful 
interviewing and rating metl ods to 
obtain the data. When reports from 
mother are used to supplement data 
on aggression obtained from other 
sources, they may contribute additional 
information of considerable importance. 
Taken by themselves, however, they 
would generally appear to represent 
material of uncertain objectivity and 
validity. 

Projective Responses. It is probably 
safe to say that this source of data has 
been by far the most widely used by 
investigators concerned with the ap- 
praisal of aggression. Such usage reflects 
the view of aggression as a motive sub- 
jected to considerable social control and 
inhibition; hence it is not readily ac- 
cessible to more direct methods of 
study. In the paragraphs that follow 
representative projective approaches are 
discussed under the headings of picture- 
interpretation methods, sentence-com- 
pletion methods, and play methods. 

Picture-Interpretation Methods. 
Since the classical work of Murray and 
his collaborators (1938), the Thematic 
Apperception Test (1943) and a variety 
of similar picture-interpretation tech- 
niques (Hartwell et al., 19515 Symonds, 
1949; Bellak, 1954) have been widely 
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used as methods for the global appraisal 
of personality in children. In this con- 
text, specific assessments of the strength 
and quality of aggressive motivation 
have sometimes been made by having 
skilled judges make subjective ratings 
on the basis of their interpretation of 
the total protocol (e.g., Sanford et al., 
1943; Frenkel-Brunswik, 1942). 
However, there has been an increas- 
ing tendency for investigators to utilize 
picture-interpretation methods in a 
more focused manner. Mussen and 
Naylor (1954), for example, in the 


study of g-to-15-year-old boys pre- 
viously mentioned, obtained a measure 
of fantasy aggression by counting the 
number of times aggressive acts by the 


hero or leading character occurred in 
the subject's stories on ten selected 
TAT pictures, 


The following were considered acts of 
aggression; fighting, killing, criminally 
assaulting; getting angry, hating, quarrel- 
ing, cursing, criticising, blaming, ridicul- 
ing; breaking and smashing objects; escap- 
ing restraint, running away, resisting 
Coercion; being negativistic, resisting 
authority, lying, cheating, stealing, gam- 
bling; forcing someone to change his be- 
havior or ideas; domineering or restraining 
Someone; suicide, self-injury, self-deprecia- 
tion (p. 236). 


The measures of aggressive motiva- 


tion thus obtained were significantly 
related to the indices of overt aggression 
obtained from observers’ reports, as 
described previously (p. 783), and the 
relationship was even stronger when 
joint consideration was given to fantasy 
aggression and to anticipated punish- 
ment of aggression expressed in the 
TAT stories. Subjects with high fantasy 
aggression and low fear of punishment 
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showed more overt aggressive behavior 
than those with little fantasy aggres- 
sion accompanied by high fear of pun- 
ishment [see also discussion of Hollen- 
berg and Sperry study (1951) on p. 
790]. The authors rightly point out that 
their findings must be regarded in 
part as a function of the lower-class 
sample with which they deliberately 
elected to work, since 

. earlier studies have shown that in 
other social classes and age groups the 
relationship between covert and overt 
aggression may be negligible or even 
negative. These findings may not be at 
all applicable to members of a social class 
which has rather rigid taboos against the 
expression of aggression, or to older people 
whose internal controls of aggressive ex- 
pression may be more firmly established 


(P: 239). 

This study provides convincing evi- 
dence that fantasy measures of aggres- 
sive motivation have promising validity 
for predicting overt aggression. At the 
same time, it should be pointed out 
that such measures may be valid under 
some circumstances and not under 
others because of the complex relation- 
ship between motives and their expres- 
sion in behavior, in part as a function 
of the situational context. For example, 
a more recent study by Jensen (1957) 
reports no relationship between amount 
of fantasy aggression revealed in ten 
TAT pictures and overt aggressive be- 
havior judged by teachers (see pp. 783- 
784). In contrast with Mussen and Nay- 
lor’s research, however, Jensen’s study 
involved older subjects (15 to 17 years), 
presumably representing a different 
social class, a different set of TAT pic- 
tures (which were group-adminis- 
tered), a different scoring method 
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(each story being rated from o 
to 10 for amount of aggression), and 
different criterion measures of overt 
aggression. Despite the lack of relation- 
ship between amount of fantasy and 
overt aggression, Jensen did find that 
boys who showed aggressive behavior 
of a socially unacceptable sort had more 
stories without punishment or defense 
against aggression than boys expressing 
socially acceptable forms of aggression. 
At the same time, the “aggressive-bad” 
group showed more freedom in ex- 
pressing themselves by stories contain- 
ing socially disapproved themes and 
language. This study again underscores 
the importance of examining picture- 
interpretation material specifically for 
content involving punishment or con- 
trol of aggression. At the same time, it 
points to the potential value of analyz- 
ing stories for certain expressive qual- 
ities viewed as “overt”? behavior 
samples (e.g., tabooed language and 
themes). 

Kagan’s study (1956) is representa- 
tive of the current interest in the use 
of pictures specifically designed for 
appraising a particular motive. Work- 
ing with 118 boys in grades one to 
three, he presented 13 pictures, each 
containing a boy in situations varying 
in ambiguity with respect to the sug- 
gestion of a fighting theme (none 
actually showed a fight in progress). 
Arguing that we need to emphasize 
greater precision in our analyses of ag- 
gressive fantasy and behavior, Kagan 
found a significant relationship between 
a simple count of the fighting themes 
in the boys’ stories and ratings of fre- 
quency of fighting behavior obtained 
from teachers. He also reports evidence 


that other categories of aggressive 
fantasy (e.g., stealing) were not related 
to overt aggression as manifested by 
fighting behavior. This study demon- 
strates the desirability of going beyond 
indices of aggression based on general 
categories (such as those used by 
Mussen and Naylor) to des! more 
systematically with specific kinds of 
aggression, both in fantasy and in overt 
behavior. In this way, we may hope 
to increase our understanding of the 
processes interconnecting behavior in 
these two spheres. 

The findings reported by Kagan also 
suggest that pictures with more obvious 
aggressive content are preferable for 
differentiating between groups varying 
in amount of overt aggression. This 
work is illustrative of the increasing 
attention investigators are lirecting 
toward the general methodological 
problem of the relation between picture 
cues and the content of the fantasies 
they evoke (e.g., Atkinson, 1958, PP: 
596-683). 

Another recent development that 
points toward more refined quantitative 
methods for appraising aggression, as 
well as other motives, is represented by 
Lesser’s attempt (1958) to apply Gutt- 
man’s scaling method (1950) in evaluat- 
ing children’s fantasy aggression. Using 
ten specially constructed pictures ad- 
ministered to 72 boys, ages 10 to 13, 
Lesser was able to construct a reason- 
ably satisfactory Guttman scale of 
fantasy aggression. These pictures had 
previously been used to obtain a gross 
measure of fantasy aggression that ha 
shown promising relationships to in- 
dices of aggression in overt behavior. 

A method employing picture cues o 
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a highly structured type is the Chil- 
dren’s “orm of the Rosenzweig Pic- 
ture-F'r tion Study (Rosenzweig 
et al, 1948). This instrument is an 
adaptation of the adult form of the 


test (Rosenzweig, 1945) and is designed 


for children from 4 to 13. It contains 
24 onlike drawings depicting a 
child in a mildly threatening situation 
involving another person, and the sub- 


ject is asked to indicate what the child 
might say. Each response is scored for 
direction of aggression (outward, in- 
ward, or minimized and glossed over) 
and kind of reaction to frustration. The 
P-F technique is more amenable to 
quantitative scoring and statistical treat- 
ment than many projective methods, 
and initia] norms are available for 131 
boys and 125 girls from 4 to 13. Evi- 
dence of the test’s construct validity, 
however, is still unclear and is limited 
for the most part to the results of 
comparisons between grossly contrasted 
groups, such as clinic and nonclinic 
cases (Rosenzweig and Rosenzweig, 
1952) and delinquents and nondelin- 
quents (Norman and Kleinfeld, 1958). 
If supported by further evidence of 
construct validity, the P-F technique 
would appear to be a useful research 
instrument for the appraisal of aggres- 
sive reactions to frustration. 
Sentence-Completion Methods. The 
method of story or sentence completion 
has seen rather wide clinical use as a 
projective technique for global per- 
sonality assessment (e.g., Rohde, 1946; 
Rotter and Rafferty, 1950; Sargent, 
1953). One of the important advantages 
of this method is the possibility it offers 
for structuring the incomplete sentence 
or story to elicit responses that are 
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relevant to a particular area of interest. 
For example, Graham et al. (1951), 
constructed 50 incomplete sentences, 
each of which portrayed a person as 
an object of aggression. The subject 
Was to complete each statement by in- 
dicating the most likely way a person 
might act in such a situation. The ag- 
gressive situations were systematically 
varied with regard to the role of the 
one who is being aggressive (e.g., 
parent, sibling, friend or classmate, 
authority, and inferior) and the degree 
of aggressiveness shown (hitting, bawl- 
ing-out or making dirty cracks, talk- 
ing-about behind back, giving nasty 
looks, and not-liking). For instance, 
“When John’s mother hit him, he 
...”; “The colored kid called him 
names, so Ricky .. .”; “When her 
mother bawled her out like that, 
Thelma . . .” (p. 513). 

The 50 incomplete statements just 
described were administered to about 
100 boys and girls in the seventh, eighth, 
and ninth grades, and all responses 
judged to be aggressive (i.e., involving 
injurious effect) were scored for 
strength of aggression with a high de- 
gree of interjudge agreement. The in- 
vestigators were primarily concerned 
with the relationship between strength 
of aggressive response and the type and 
strength of instigation, within the ver- 
bal material used. Consequently, they 
provide no data on the relationship 
between the projective measures of ag- 
gression and independent indices of 
aggressive behavior. At the same time, 
however, the theoretical consistency 
of the results obtained suggest that a 
carefully constructed sentence com- 
pletion “test of this type may possess 
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considerable construct validity and 
warrants further exploration. 

Play Methods. For a good many 
years the play interview has been one 
of the most widely used clinical meth- 
ods for the appraisal of young children’s 
personality (Levy, 1936, Erikson, 1940; 
Murphy et al., 1956). In this technique, 
which commonly combines diagnostic 
and therapeutic functions, the child is 
encouraged to play with a variety of 
materials (family dolls, clay, paints, 
toys, etc.) by an adult examiner who 
assumes a generally permissive, under- 
standing, nonjudgmental role. The adult 
may enter into the play in a variety of 
ways, by making encouraging com- 
ments, asking questions, or interpreting 
the child’s feelings. 

More recently, investigators have at- 
tempted to develop and exploit the 
play interview as a controlled, system- 
atic research instrument for appraising 
child personality (Bach, 1945; Pintler, 
1945; Sears, 1947; Lynn and Lynn, 
1959). These more structured ap- 
proaches usually involve uniform pre- 
sentation of standard sets of play 
materials, most often family dolls, and 
systematic procedures for categorizing 
and evaluating the child’s behavior in 
quantitative terms. (For an unusual 
and provocative quantitative treatment 
of play-interview-behavior items, the 
reader is referred to the study by 
Carroll and Levin, 1956.) 

The controlled doll-play interview 
has had particularly wide application 
in the study of aggression in children. 
A good illustration is the structured 
doll-play technique described by Sears 
(1951) and utilized in a number of 
recent studies on aggression (e.g., Sears, 
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Whiting, Nowlis, and Sears, 1953; Levin 
and Sears, 1956). The material consists 
of a miniature six-room roofless “house” 
containing realistic furniture that is set 
up on the floor and five dolls that are 
easily recognizable as father, mother, 
boy, girl, and baby. The examiner urges 
the child to play with the dolls ind the 


house they live in and interacts with the 
child by means of encouraging com- 
ments, continued attention and interest, 
etc., to stimulate continued play with 
the dolls. At the same time, the ex- 
aminer makes an observational record of 
the child’s aggressive play, w hich per- 
mits coding of aggressive acts with 
respect to (1) frequency or amount, (2) 
direction (i.e., agent and object of ag- 
gression), (3) type or content, and (4) 
latency (time elapsing before occur- 
rence of the first aggressive act). Each 
subject receives two twenty-minute 
sessions of play, usually on consecutive 
days. 

Sears (1951) conducted an extensive 
methodological and normative study 
with the technique just described, 
which was administered to 150 boys and 
girls 3 to 5 years of age. She found 
that independent observers could cate- 
gorize the occurrence of various types 
of aggressive acts with a high degree of 
agreement (82 per cent) and that chil- 
dren maintained reasonably similar rel- 
ative positions with regard to the 
amount of aggression shown in the two 
sessions (7’s=.73 for boys, -49 for 
girls). The doll-play technique that 
Sears utilized is thus seen to have ap- 
preciable interjudge as well as test- 
retest reliability, and it represents a” 
extremely useful technique for the ap- 
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praisal of aggression as expressed 
play." 
Other types of play materials and 


procedures have also been used in con- 
trolled or semicontrolled techniques for 
eliciting and evaluating aggression. For 
example, Siegel (1957) observed the 
aggressive play of pairs of preschool 
children who were left alone for four- 
teen minutes with instructions to play 
freely until the examiner returned. A 
standard set of play materials was 
provided, including rubber daggers, 
clay, toy telephones, rubber sponges, 
small toy vehicles, a doll bed and a 
doll, and inflated balloons. During each 
twenty seconds of play, two observers 
independently rated each child on a 
four-point scale for level or intensity 
of aggression and level of anxiety and 
guilt over aggression. Total score for 
the child was simply the sum of these 
ratings for the entire observation. Inter- 
observer agreement for these total 
scores was very high (r = .97 for level 
of aggression and .77 for anxiety and 
guilt). Siegel’s method is of particular 
interest, since it permits expression of 
fantasy as well as overt aggression with 
a greater variety of play materials than 
is typically provided in standardized 
doll play and without the presence of 
an adult, At the same time, it utilizes 
ratings of the intensity of behavioral 
units of aggression, which may even- 
tually provide more valid measures of 
Strength of aggressive motivation than 


‘The Levin and Sears study (1956) pro- 
Vides additional normative data on 379 5- 
year-olds and generally confirms the evidence 
just cited in regard to the reliability of the 
doll-play technique. 


OF PERSONALITY CHARACTERISTICS IN CHILDREN 


789 
are obtainable from indices relying on 
frequency alone. 

Another interesting variant of the 
play method is represented in the tech- 
nique devised by Stone (1956), in which 
children are provided with an op- 
portunity to break inflated balloons 
under varying conditions of adult en- 
couragement or permissiveness. Al- 
though not yet highly developed as a 
standardized method, this technique 
appears to represent a potentially 
promising semistructured, miniature- 
situation approach to the analysis of 
children’s aggressive behavior. One of 
the difficulties with techniques of this 
sort, as well as the more structured play 
methods, is the previously discussed 
problem of determining whether be- 
havioral acts classed as “aggressive” re- 
flect response tendencies rather than 
motives implying hostile intent. 

Although doll play and related tech- 
niques are typically classified under the 
rubric of “projective techniques,” it 
should be noted that in contrast to the 
predominantly verbal responses usually 
elicited by most projective methods doll 
play evokes both verbal fantasy and 
fantasy behavior. The child can actively 
make the dolls play imaginary roles of 
various sorts, while at the same time he 
can directly express his own aggression 
against the dolls, toy furniture, the ex- 
aminer, etc. Because the doll-play tech- 
nique can elicit samples of fantasy and 
even overt aggression so directly, it has 
usually been regarded as having a high 
degree of intrinsic validity in its own 
right; thus doll-play aggression has 
served as the major dependent variable 
of interest in a number of studies (e.g., 
Siegel, 1956; Levin and Sears, 1956; 
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Sears, Pintler, and Sears, 1946). For the 
most part, such studies have yielded 
theoretically congruent results that lend 
support to the construct validity of the 
doll-play measures. For example, Hol- 
lenberg and Sperry (1951) found that 
the frequency of doll-play aggres- 
sion was greatest for children experienc- 
ing high punishment for aggression and 
high frustration in the home, whereas 
it was lowest for children experiencing 
little punishment and frustration. Sim- 
ilarly, Sears et al. (1953) found a posi- 
tive relationship between severity of 
punishment for aggression in the home 
and amount of children’s doll-play ag- 
gression, although in a later study 
(Levin and Sears, 1956) this relationship 
was true for girls only. 

Although these findings support the 
construct validity of doll-play measures, 
it should be recognized that the rela- 
tionship between aggression in doll play 
and in natural social situations has not 
yet been widely investigated. Hence 
the question of the generality or 
specificity of the indices of aggressive 
motivation revealed in play remains a 
problem for further study. All in all, 
however, structured play techniques 
for appraising aggression in young chil- 
dren represent valuable research meth- 
ods that have already been subjected to 
considerable technical study as research 
tools and have demonstrated their use- 
fulness in theoretical studies of fantasy 
aggression. 


Evaluative Summary 


It is apparent from the foregoing 
discussion that although many promis- 
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ing techniques for the appraisal of 
aggression in children have been de- 
veloped the research worker will find 
relatively few refined psychometric 
instruments available. To some extent, 
this state of events reflects the over-all 
status of psychological measurement in 
the field of personality study. In addi- 
tion, however, the paucity of standard- 
ized techniques is partly due to the 


complex nature of aggression as a psy- 
chological variable. 

Viewed as a motive, aggression is 
typically subject to a variety of social 
inhibitions and controls from many 

sive 1m- 


sources. Consequently, aggre 
pulse may manifest itself in many dif- 
ferent forms, some obvious, some 
subtle, and its overt expression will vary 
greatly, depending upon the nature of 
the situational conditions and the rep- 
ertoire of control mechanisms that the 
child has learned to invoke on the basis 
of previous experience. Some children 
with considerable aggressive motivation 
may reveal aggressive behavior very 
rarely, and then only under special cir- 
cumstances, so that virtual absence of 
overt aggression does not necessarily 
imply absence of aggressive impulse. 

When we consider overt behavior 
that is phenomenologically classifiable 
as “aggressive,” we are faced with the 
problems of deciding whether such be- 
havior reflects the presence of aggres- 
sive motivation, with explicit or 1m- 
plicit hostile intent, or whether it should 
be regarded as a nonmotivational re- 
sponse tendency. We sometimes have 
the additional complication of “aggres- 
sive” behavior, which subserves some 
other motive such as the need for 
dominance or mastery. 
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Problems such as these pose obvious 


and formidable difficulties for the in- 
vestigator insofar as the methodological 
assumptions of generality and specifi- 
city are concerned. He needs to be 
aware, for example, that if he relies only 


on behavioral measures of aggression 
his measures may reflect the specific 
situational conditions to a high degree. 
If he bases his estimates on projective 
responses primarily, these may tell him 
very little about the person’s aggressive 
behavior. These and similar difficulties 
that the most theoretically 
meaningful and practically useful ap- 
praisals of aggression are likely to be 
obtained in a multifaceted approach, 
which combines observations and re- 
ports of behavior and projective re- 
sponses obtained across a variety of 
situational contexts. Such “triangula- 
tion” would enable the investigator to 
evaluate not only the strength of ag- 
gressive motivation but also the types 
of stimuli most likely to instigate ag- 
gression, the kinds of controls a child 
imposes under different circumstances, 
and variations in overt aggression that 
occur as a function of all of the preced- 
ing factors. It is recognized that the 
approach just described is methodologi- 
cally expensive in terms of time, money, 
and technical skills. When practical 
considerations preclude such strategy, 
it is suggested that the investigator em- 
ploy at least two measures of aggression, 
one based on observations oF reports 
of behavior, the other on projective 
responses. 

Research workers may find some of 
the specific techniques discussed in this 
chapter usable in the manner in which 
they were originally employed (¢-- 


suggest 
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the observational method used by Sears 
et al., 1953, and the TAT measures 
of Mussen and Naylor, 1954). In other 
instances the investigator may wish to 
adapt a technique to fit his particular 
research needs. In either event the re- 
searcher should take appropriate meas- 
ures to insure adequate reliability of 
administration and scoring before 
undertaking operational use of any of 
the techniques mentioned. Most of the 
methods described in the chapter have 
either been shown to have adequate 
interjudge reliability or appear to be 
reasonably capable of meeting this 
requirement. 

Finally, it should now be apparent 
that one cannot evaluate adequately the 
validity of any techniques here de- 
scribed in terms of a relationship to a 
single empirical criterion. Rather one 
needs to look for different kinds of 
evidence to indicate that the method 

ossesses some construct validity (see 
earlier discussion of this concept). 
Whenever such evidence has existed 
for the techniques reviewed, it has been 
cited in our discussion. It should be 
remembered, however, that even a tech- 
nique possessing promising construct 
validity may yield measures that do not 
have a high degree of generality across 
situations. Once again, then, we are led 
to the conviction that whenever possible 
the appraisal of aggression in children 
should be undertaken with the simul- 
taneous application of measures based 
on observations and reports of behavior 
as well as projective responses, all ex- 
amined across a variety of different 
situations. This variation might be sup- 
plied by nature, by contrived experi- 
ment, or preferably by both. In short, 
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we see the methodological gains of the 
future lying in a multifaceted approach 
that fuses experimental, observational, 
and testing methods, 


ANXIETY 


The term “anxiety” has been widely 
used with a rather large variety of 
connotative and denotative meanings. 
In what appears to be its most general 
meaning and commonest usage, anxiety 
refers to an affective state of the in- 
dividual, characterized by fearfulness, 
apprehension, or dread. Following the 
original suggestion of Freud (1935), 
many workers have found it both theo- 
retically and practically useful to dis- 
tinguish between a generalized state of 
apprehensiveness, in which the individ- 
ual’s anxiety is not restricted to any 
particular object or class of objects 
(Freud’s “free- floating” anxiety), and 
fearfulness or apprehension attached to 
particular objects or situations. 

Although some investigators em- 
phasize this distinction, the custom is 
by no means universal, and the terms 
are often used interchangeably. Thus 
even a brief review of the research 
literature reveals the variety and am- 
biguity of conceptualizations and opera- 
tional techniques used by different in- 
vestigators. Although research workers 
are tending increasingly to designate 
the specific types of anxiety, which 
their instruments presumably measure 
(test anxiety, audience anxiety, sex 
anxiety, etc.), mere absence of such 
a specific modifier does not necessarily 
mean that the instrument measures 
generalized anxiety; rather it may only 
reflect the fact that the nature of what 
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is being measured is not exactly clear, 

This lack of conceptual and opera- 
tional consistency is eloquently noted 
and condemned by Cattell and Scheier 
(1958), who are particularly critical 
of the use of single measures of ciety. 


. .. each test in splendid operational in- 


sularity spawns its own possibly que 
species of “anxiety,” while the researcher 
whose “pet” test it is often 15 to 
believe it is the only species of anxiety, 
and soon proceeds to assume that al! ocher 
“anxieties” behave just as it does (p. 352). 
Although one might argue with these 
authors’ recommendation of factorial 


studies as the optimal approach for 
achieving conceptual clarity in the field, 
their concern with the multiplicity y of 
concepts and instruments in anxiety re- 
search is one in which all investigators 
in this area must share. 

To turn directly to this problem, ac- 
cording to our conceptual lysis, 
anxiety appears to have been regarded 
primarily as a response tendency and a 
motive rather than as a stimulus char- 
acteristic. It is our impression, more- 
over, that a great many measuring tech- 
niques deal with both aspects of anxiety 
at the same time, and, although it is 
difficult to separate the two concep- 
tually and operationally, we feel that it 
is important to attempt to do so. Thus 
a person characterized by anxiety as a 
response tendency would be one who 
brings to all situations as a dispositional 
quality an affective state of fearful or 
anxious expectation. Similarly, an in- 
dividual who reacts only to specific 
types of objects or situations with fear- 
fulness may be regarded as manifesting 
a specific type of anxiety as a Po 
tendency. In either case, anxiety 1 


in part in the affective state of 
the individual and in part in a variety 
of physiological and motor responses, all 
of which may affect the individual's 
performance in various activities or 
tasks. 

In contrast, when one deals with 
various patterns of response that serve 
the function of reducing, eliminating, 


or avoiding anxiety, then, it seems to 
us, one is viewing anxiety as a motive 
rather than as a response tendency. 


Many important kinds of adaptive as 
well as maladaptive behavior are known 
to function in this manner, from simple 
flight or avoidance through subtler 
devices of inhibition and control to the 
more complex mechanisms of defense. 
Obviously, anxiety could not operate 
in this way as a motive unless it had 
also been present asa disturbing response 
tendency. Nevertheless, two persons 
having similar levels of anxiety, operat- 
ing as a response tendency, may still 
be quite different in terms of the ex- 
tent to which anxiety operates as a 
motivating factor in their behavior. In 
our appraisal of anxiety in children, 
then, we would do well to ask not 
only “how anxious or fearful is this 
child?” but also “to what extent and in 
what way does his anxiety serve a 
motivating function in his behavior?” 
Most of our assessment techniques have 
tended to mix these two aspects of 
anxiety in a single measure, with more 
weight usually being given to the first 
than to the second, particularly in ob- 
jective tests. 

Once one accepts the notion that it is 
important to evaluate anxiety both as 
a motive and as a response tendency, 
one is faced with the rather formidable 
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prospect not only of evaluating the level 
and type of anxiety present but also of 
appraising the adaptive mechanisms, 
controls, and defenses with which the 
individual handles his anxieties or fears. 
How essential this broader approach 
may be depends in large part upon the 
degree to which measures of anxiety 
as response tendency, expressed with- 
out reference to motivational com- 
ponents, do in fact show appropriate 
relationships to psychologically relevant 
criteria. 

Before turning to such questions, we 
must first examine the techniques and 
sources of data available for measuring 
individual differences, both in general 
and in focused types of anxiety. As be- 
fore, we shall confine ourselves pri- 
marily to those techniques that have 
been specifically designed to measure 
anxiety. Thus we eliminate from con- 
sideration more global measures dealing 
with such broad concepts as neu- 
roticism, security-insecurity, and ad- 
justment, which may reflect anxiety 
along with many other variables. 

The sources of data most commonly 
utilized for the measurement of anxiety 
in children seem to have been reports of 
behavior or attitude from the child him- 
self, or sometimes from an adult, and 
projective response behavior. Relatively 
little use has been made of direct ob- 
servations of behavior for arriving at 
quantitative indices of anxiety, although 
some techniques that utilize this source 
of information appear to have some 
merit for future work on the problem. 

Self-Reports of Behavior and Atti- 
tude. A great deal of concerted research 
effort has been directed toward the de- 
velopment of questionnaire measures of 
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anxiety in children. Sarason and his 
associates have been involved in a long 
program of research on test anxiety and 
its correlates in college students and, 
more recently, in children. As their 
principal measure of “test” anxiety, 
these investigators developed a 30-item 
“Yes-No” questionnaire, which is “con- 
cerned with attitudes toward and ex- 
periences in test and test-like situations” 
(Sarason et al., 1958, p. 105). For the 
most part, the questions focus upon 
anxiety as response tendency, since they 
ask about affective, physiological, and 
motoric manifestations of anxiety (e.g., 
Do you worry a lot before you take a 
test? When the teacher says she is 
going to find out how much you have 
learned, does your heart begin to beat 
faster? When . . . , do you sometimes 
feel that the hand you write with shakes 
a little?). The questions are read aloud 
by an examiner, and the test can be 
group-administered effectively to chil- 
dren as young as first graders. 

Sarason and his colleagues have ad- 
ministered this questionnaire to large 
numbers of school children from grades 
one to five, and they have diligently 
explored the relationships between their 
test-anxiety measures and a wide variety 
of psychologically relevant criteria, 
based upon both field and experimental 
studies. The results of these studies are 
summarized briefly in one of the more 
recent publications of this group 
(Sarnoff et al., 1959). The test has very 
satisfactory reliability (both split-half 
and test-retest), and the over-all results 
of the research done with it suggest, 
for the most part, that the questionnaire 
is tapping a psychologically meaning- 
ful anxiety manifestation in elementary- 
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school children, although the relation- 
ships with appropriate criteria are rarely 
very large and not always as expected. 
The test-anxiety measures (TA) show 
a low positive relationship to teacher’s 
anxiety ratings, and they tend to be 
correlated negatively with both meas- 
ured IQ and school achievement (Sara- 
son et al., 1958) as well as with per- 
formance in experimental situations re- 
quiring new learning (Waite et al., 


1958). In samples of both English and 
American children, TA scores tend to 
increase from the first to the fourth 
grade. Also, English children had higher 
mean TA scores than American chil- 
dren, although their general anxiety 


levels (as measured with a short general 
anxiety questionnaire) were equivalent, 
as predicted on the basis of the pre- 
sumably greater importance of school 
examinations for the young school child 
in England (Sarnoff et al., 1958). 

Some of the research on the Test- 
Anxiety scale illustrates rather complex 
problems of a general sort that tend to 
plague the investigator concerned with 
the appraisal of personality charac- 
teristics, such as anxiety, even with 
structured questionnaires, In a recently 
reported study (Sarnoff et al., 1959) 
the test-anxiety scale was given to 
English children at the end of the 
fourth-grade year and then again the 
following February, a week before a 
presumably highly anxiety-arousing 
standard classroom examination (the 
“i1+”). Instead of finding a rise in 
test anxiety as the examination became 
imminent, the investigators found that 
the scores of both boys and girls 
dropped significantly. In their inter- 
pretation of this puzzling result the 
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authors suggest that the drop in scores 
might reflect an increase in the 
“normal” defensiveness that ordinarily 
tends ro produce a drop in mean scores 
upon retesting (Lighthall et al., 1960). 
If this interpretation is correct, the 
authors rightly point out, “we are 
obliged to acknowledge that, under 
certain circumstances, the TA scale may 
be measuring defensiveness as well as 
test anxiety.” 

It will be recalled that this is pre- 
cisely the kind of problem we referred 
to in our discussion of the importance 
of trying to deal separately with anxiety 
considered as response tendency and as 
motive. The experience of the Sarason 
group suggests that even when an in- 
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strument is intended to deal primarily 
with anxiety as response tendency (in 
the sense that it asks about feelings 
under various hypothetical conditions) 
the test responses may at the same time 
reflect various adaptive mechanisms that 
subjects invoke in coping with anxiety. 
This problem may have been one of the 
factors responsible in part, at least, for 
the absence of a significant relationship 
between TA and scores on the 11+ 
examination in England, in contrast with 
results obtained with American chil- 
dren. 

Problems such as response set and 
item characteristics, as factors determin- 
ing TA questionnaire responses, are 
reportedly being thoroughly analyzed 
by the Sarason group. One of their most 
consistent findings, which may be more 
thoroughly explained when these 
analyses are completed, is that girls get 
higher test-anxiety scores than boys 
in both England and America. The 
authors suggest that this may be due to 
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the greater social acceptability of ex- 
pressions of worry and concern from 
girls than from boys. 

In summary, the bulk of the findings 
obtained with the test-anxiety ques- 
tionnaire suggest that this is a useful 
research instrument with reasonably 
good construct validity, dealing with a 
relatively specific and important type 
of anxiety manifestation in young 
school children. (Correlations between 
test anxiety and general anxiety are 
reported as about .55.) One of the most 
impressive features of the work of the 
Sarason group is the broad variety of 
psychologically relevant criteria against 
which the validity of the instrument is 
continuously being evaluated. One 
wonders, however, whether an exten- 
sive and productive research program 
of this sort might not have been even 
more fruitful had it been possible simul- 
taneously to study and validate several 
appraisal techniques representing differ- 
ent approaches to the measurement of 
anxiety, rather than placing major em- 
phasis upon a single approach repre- 
sented by the short questionnaire. 

Another rather widely used ques- 
tionnaire measure of children’s anxiety 
is the Children’s Manifest Anxiety Scale 
(CMAS), developed by Castaneda, 
McCandless, and Palermo (1956) for 
use primarily with fourth-, fifth-, and 
sixth-grade children. This 42-item ques- 
tionnaire represents a children’s form 
of the Taylor Manifest Anxiety Scale 
(Taylor, 1953), which in turn was de- 
rived from the Minnesota Multiphasic 
Personality Inventory. The questions 
deal with reported physiological and 
psychological concomitants of gen- 
eralized anxiety, worrisomeness, etc. 
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(e.g, I blush easily, My hands feel 
sweaty, I get angry easily, I often do 
things I wish I had never done). It is 
of interest that this scale was de- 
veloped primarily to provide a measure 
of general drive level or strength of 
motivation in experimental studies on 
the relationship between drive strength 
and performance in children. 

Numerous studies have indicated that 
the CMAS has some interesting relations 
to criteria of various sorts. For example, 
as predicted on the basis of similar 
studies with adults, anxious children 
were found to perform less well than 
nonanxious children on relatively dif- 
ficult and complex learning tasks, al- 
though anxiety seems to facilitate certain 
kinds of simple learning (Castaneda, 
Palermo, and McCandless, 1956; 
Palermo et al., 1956). High anxious 
children tend to show more perceptual 
rigidity and greater speed of perceptual 
closure on experimental tasks than do 
low anxious children (Smock, 1958). 
Test scores also tend to be negatively 
correlated with measures of both IQ 
and school achievement, as well as 
sociometric standing (McCandless, et 
al., 1956a,b). 

The test appears to have satisfactory 
test-retest reliability, correlations rang- 
ing between .70 and .94 over a period 
of one week (Castaneda, McCandless, 
and Palermo, 1956), and between .59 
and .91 over a one-month period 
(Palermo, 1959). As in the case of the 
test-anxiety scale developed by the 
Sarason group, girls tend to achieve 
higher CMAS scores than boys, but 
no increases in score with age over the 
fourth- to sixth-grade period were 
found. Although the authors have in- 
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corporated a brief lie scale in the ques- 
tionnaire, problems of response set still 
remain for further study. 

In general, the results obtained with 
the CMAS suggest that it is tapping 
meaningful anxiety manifestations in 
children with some validity, and it may 
therefore be used with profit as a re- 
search instrument. Perhaps because of 
the original interest in the scores as 
measures of drive strength in experi- 


mental studies, the criteria against which 
the test has been validated do not 
appear to have represented a verv broad 
range of theoretical constructs about 
anxiety as anxiety. Consequently, as is 
true of any instrument of this type, 
continuing construct validation of the 
test as a measure of anxiety is highly 
desirable. 

A final illustration of a promising 
questionnaire measure of a specific form 


of anxiety is the Audience-Anxiety 
Scale in the Children’s Audience-Sen- 
sitivity Inventory, described by Paivio, 
Baldwin, and Berger (1959). The 
Audience-Anxiety (AA) Scale consists 
of 16 questions designed specifically to 
evaluate the child’s predisposition to 
react with anxiety in audience situa- 
tions in which his performance is ob- 
servable by other individuals, When ad- 
ministered to several hundred third-, 
fourth-, and fifth-grade children, the 
AA scale had a split-half reliability of 
80, whereas a six-month test-retest 
comparison yielded a reliability of .54- 
Encouraging support for the construct 
validity of this instrument comes from 
the significant negative relationships 
found between audience-anxiety meas- 
ures and ratings of children’s willingness 
to volunteer for public performance 
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in a suminer camp “skit-night” program 
(rs = —.35 for boys, —.45 for girls). 
It is of further interest to note that in 
this same study the test-anxiety ques- 
tionnaire of the Sarason group yielded 
correlations very similar to those just 
mentioned (—.36 and —.39), whereas 
the Children’s Manifest Anxiety Scale, 
which is a measure of generalized 
anxiety manifestations, had similar but 
somewhat lower correlations (—.29 
and —.26) with this particular criterion. 


These findings support the notion that 
relationships tend to be higher when 
both test and criterion measures are 
specifically designed or selected to deal 
with a designated type of behavior. 
That these three measures of anxiety 
have a good deal of common overlap, 
however, despite the correlational dif- 
ferences just mentioned, is revealed in 
the intercorrelations among them, based 
on 421 children in the third to tenth 
grades (AA correlates .74 with TA and 
.62 with CMAS; the latter two correlate 
-70). 

The three questionnaires just dis- 
cussed appear to be promising instru- 
ments for obtaining measures of self- 
reported anxiety as response tendency. 
They have the advantage of being ob- 
jectively scorable and group admin- 
isterable, However, they share the prob- 
lems of response set and are dependent 
upon the capacity and willingness of 
the subject to report accurately feel- 
ings and experiences presumably indica- 
tive of anxiety as an affective state with 
its physiological and motor concomi- 
tants. 

Reports of Behavior from Others. 
Turning from the child’s self-reports 
about anxiety reactions, we find that 
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investigators have sometimes been able 
to obtain useful information about a 
child’s anxieties from observations and 
impressions of teachers, parents, or 
other adults, Effective use of parental 
reports on children’s specific fears is 
illustrated in the familiar work of 
Jersild and Holmes (1935), who asked 
parents to record the frequency of 
occurrence of various fear reactions 
during a period of twenty-one days. 
This type of approach, which asks for 
parental reports of specified behaviors 
over a period of time, might well be 
more fully explored as a means of 
providing measures of individual dif- 
ferences in anxiety level and manifesta- 
tion, although the investigator would 
need to be a bit wary about the re- 
liability and validity of such parental 
reports. 

The use of carefully constructed and 
administered rating scales frequently 
makes it possible to obtain valuable 
judgments on children’s personality 
characteristics from teachers or other 
adults who have opportunity for 
lengthy periods of observation and con- 
tact with the subjects. A good illustra- 
tion of the construction and applica- 
tion of such scales in the evaluation of 
the behavior of nursery-school children 
may be found in the previously men- 
tioned work of Conrad (1932, 1933) 
and Read (1940). Among the rating 
scales used by these investigators are 
several that deal with anxious behavior: 
(1) suspiciousness of people, (2) ap- 
prehensiveness, (3) fear in response to 
bullying, threatening commands, etc., 
(4) fear in nonsocial situations, and (5) 
elaboration or indirectness of response 
to fear. The last-named scale is of par- 
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ticular interest, since it deals with a 
motivational aspect of anxiety, that is, 
how the child adapts to fear, rather than 
with anxiety as a response tendency, 
which is essentially the object of the 
first four scales. The two extremes of 
this scale are defined as follows: 


1. Reacts to fears by a relatively indirect 
(compensatory or “sublimated”) response; 
e.g, by showing off in other directions; 
by bravado; or by other similar subterfuge 
designed to conceal the fear from the 
child itself, and from others. . . . 7. Re- 
acts to fears by a simple, frank, direct 
response. Child either cries, whimpers, 
etc., or else directly resists the fear. Does 
not resort to attempts at concealment or 
“sublimation” of response (Read, p. 481). 


The scales are carefully defined in be- 
havioral terms and are accompanied 
by very detailed instructions to the 
teachers to provide a common basis for 
making judgments, factors which un- 
doubtedly contributed to the demon- 
strated effectiveness with which they 
can be used. These rating scales thus 
offer both specific techniques, which the 
investigator may find useful for ap- 
praising anxious behavior in nursery- 
school children, and suggestive models 
for the development and use of similar 
scales applicable to older children. 

Less elaborate scales than those just 
described have also been used with some 
effectiveness in evaluating anxious be- 
havior of second-to-fifth-grade chil- 
dren, In the work of Sarason et al. 
(1958), the investigators compared 
children’s test-anxiety scores with rat- 
ings made by teachers on 17 items 
describing various specific kinds of 
anxious behavior in the classroom situa- 
tion (e.g, “does the child exhibit 
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unwarranted fidgeting . . . squirming, 
restless behavior . . . when called upon 
to recite in class”; “does the child work 
better in a situation in which he can 
take his time than in one in which there 
is time pressure?”). Although there is 
no report given of the reliability of the 
ratings thus obtained, the fact that they 
correlated significantly, although not 
substantially, with the questionnaire 
measures suggests the potential useful- 
ness of such judgments, 

A final illustration of the use of re- 
ports about children’s anxiety from 
adults is found in the study of the 
audience-anxiety questionnaire (Paivio 
et al., 1959). These investigators took 
advantage of a naturally occurring 
situation in which boys and girls at a 
summer camp were asked to volunteer 
to perform publicly on a “skit night.” 
Camp counselors were asked to rate the 
children in terms of their willingness to 
volunteer for this public performance, 
and the measures thus obtained, even 
though relatively crude, correlated 
meaningfully with the questionnaire 
scores. 

In general, it is our feeling that the 
research worker would do well to 
exploit more fully, as a source of data 
on children’s anxious behavior, the con- 
sidered judgments and reports of ob- 
servers who have the opportunity of 
fairly long-term contact with the sub- 
jects over a variety of situations. Al- 
though there are important problems 
involved in eliciting reliable and valid 
information from this source, with 
proper training of judges and with ap- 
propriate instruments it should be pos- 
sible to utilize this source of information 
profitably to obtain data on anxiety 
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considered as both response tendency 
and motive. 

Projective Responses. Among the 
most widely used techniques for ap- 
praising anxiety in children are the 
various global projective methods, 
which evolved as diagnostic testing pro- 
cedures in the clinical evaluation of in- 
dividual children and adults. Although 
such techniques involve a high degree 
of subjectivity and for the most part 
cannot yet be regarded as well validated, 
quantitative measuring devices, they 
do offer two potential advantages. First, 
because of the indirect nature of the 
approach, they presumably permit the 
child to express various kinds of anxiety 
manifestations that he might not reveal 
directly when questioned or observed. 
Second, the projective methods tend to 
focus attention upon anxiety as motive; 
that is, they are frequently concerned 
not only with the level and nature of 
the subject’s anxieties but also with the 
various ways in which the subject tends 
to cope with or respond to anxiety. 
Because of these characteristics, it is to 
be hoped that the gradual refinement of 
some of the projective techniques in 
the direction of greater objectivity and 
quantification, along with systematic 
validation studies, may eventually pro- 
duce highly useful methods for the 
measurement of anxiety as motive and 
as response tendency. 

There have already been some 
Promising attempts to structure and 
systematize projective techniques along 
the lines just indicated. Two of the 
studies already mentioned in the dis- 
cussion of aggression (Mussen and 
Naylor, 1954; Siegel, 1957) illustrate 
how one can derive from projective 
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techniques measures reflecting the de- 
gree to which anxiety serves as an in- 
hibiting or control factor in the ex- 
pression of aggressive impulses. Mussen 
and Naylor simply counted the number 
of times the “hero” in ten TAT stories 
was subjected to punishment “press” 
and used this as a measure of the sub- 
ject’s fear of punishment. It will be 
recalled that their prediction of overt 
aggressive behavior was improved when 
they considered not only the subject's 
level of aggressive motivation but also 
his anxiety about punishment. 

Siegel used the controlled-play situa- 
tion to obtain measures of aggression 
and indices of anxiety and guilt during 
the play of pairs of children, Observers 
made ratings every twenty seconds in 
which their judgments of anxiety, in- 
hibition, or guilt were guided by a list 
of specific behavioral signs adapted 
from the work of Ryder (1954). The 
list included “escape behavior; tense, 
stiff, or awkward movements; incom- 
pleted or blocked aggressive acts; self- 
directed hostility; moralistic comments; 
nervous mannerisms, tics, thumb-suck- 
ing; tense facial expression . . . halting 
play at sound of footsteps at door” 
(p. 375). Siegel was primarily interested 
in changes in children’s play from a 
first- to a second-play session a week 
later and found significantly less aggres- 
sion as well as less anxiety in the second 
session than in the first, which suggests 
that her anxiety measures were reflect- 
ing concern with expressed aggression. 
It is our feeling that methods of this 
sort, which concentrate upon specified 
behavioral signs of anxiety in a con- 
trolled situation, are of potentially great 
promise and should be more fully ex- 
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ploited as techniques for the measure- 
ment of anxiety. 

Apart from technical considerations 
of method as such, the two studies just 
mentioned illustrate rather nicely, we 
think, the kind of concern with anxiety 
as motive, which we have been stressing 
as an important conceptual approach to 
the measurement of anxiety. Both 
studies are concerned with the anxiety 
that is associated with the expression 
of a particular motive (i.e., aggression) 
and the possible controls or inhibitions 
that such anxiety tends to produce in 
the behavior of the individual. It is this 
sort of information about the manner in 
which anxiety motivates or influences 
the adaptive responses of the individual 
that we believe must be assessed more 
effectively, along with the individual’s 
anxiety level or anxiety proneness. 

A final illustration of a projective 
technique that was specifically struc- 
tured to yield measures of anxiety in 
young children may be found in the 
interesting pictorial method described 
by Dorkay and Amen (1947) and 
previously investigated by Temple and 
Amen (1944). The test consists of 14 
pictures portraying a child in familiar 
situations of various kinds, either with 
another child or an adult. The face 
of the child in the picture is left blank, 
and at the top of the picture two heads 
are drawn, one with a happy and one 
with a sad expression. The subject is 
asked in each case, “What kind of a 
face do you suppose this child will have, 
a happy face or a sad face?” Although 
anxiety scores based on nursery-school 
children’s choices had low test-retest 
reliabilities over a three-month period, 
correlations with teachers’ ratings of 
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anxiety were promising (.28 for original 
and .59 for retest scores). Structured 
projective techniques of this sort seem 
worth exploring further for appraising 
anxiety in very young children. 

Direct Observations of Behavior. It 
is interesting to note that very little 
use has been made of systematic ob- 
servations of behavior in natural settings 
as a basis for measuring anxiety in chil- 
dren. In part, this reflects the nature of 
the anxiety construct, which is difficult 
to operationalize in terms of a narrow 
range of behavioral manifestations easily 


amenable to direct observation. It is 
obvious that if one child in the class- 
room manifests anxiety by wit idrawal, 
another by hyperactivity, and another 
by immature attention-seeking it is 
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difficult to make up a set of cate 
that would apply equally to all children. 

Observation of children’s behavior in 
contrived situations is a source of data 
that has been used with considerable 
profit, Siegel’s study of aggression and 
anxiety by means of a controlled play 
technique (1957), which we discussed 
as a projective method, may be regarded 
as a good illustration of direct be- 
havioral observation in a contrived 
setting. A considerably greater degree 
of situational structuring is found in 
the direct confrontation techniques 
used by Jersild and Holmes (1935) in 
their familiar studies of children’s fears. 
Each child was exposed to a variety of 
potentially  anxiety-arousing stimuli 
(e.g., loud sound, height, snake), and 
his reactions were carefully observed. 
Although these methods were not de- 
veloped to provide measures of in- 
dividual differences in anxiety-prone- 
ness and although formidable difficulties 
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are involved in contriving such po- 
tentially fear-producing situations ap- 
propriate for various age levels, and 
which can be presented to children un- 
objectionably, it appears to us that a 
more intensive exploitation of this kind 
of approach might well be worthwhile. 
Even though such techniques may not 
be practical for obtaining routine meas- 
ures of anxiety, they might well serve 
as supplementary measures or as criteria 
against which to evaluate the construct 
validity of operationally more con- 
venient appraisal techniques. Methods 
of this sort may have the added ad- 
vantage of permitting more systematic 


study of the motivational implications 
of anxiety, in terms of the nature of the 
individual’s adaptive responses to anx- 


iety arousing stimuli. 

An illustration of a contrived situa- 
tion of the kind just discussed may be 
found in the work of Levin, Baldwin, 
Gallwey, and Paivio (1959). These 
investigators, who were interested in 
“public” and “private” story-telling 
performances by 10-to-12-year-old chil- 
dren, used the number of errors in the 
child’s speech when performing before 
an audience of six adults as one measure 
of “audience stress” produced by the 
situation. The relationships between 
these behavioral indices of a specific 
type of anxiety and questionnaire meas- 
ures of “self-consciousness” (appre- 
hensiveness about one’s evaluation by 
others) and “exhibitionism” (positive 
attitude toward public exposure of the 
self or a self product) are such as to 
encourage further development of con- 
trived situations to provide behavioral 
measures of particular anxiety mani- 
festations. 
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Closely related to behavioral observa- 
tions of anxiety in contrived situations 
is the use of performance on “objective” 
tests of various sorts. The work of 
Cattell and his associates (1955, 1956) 
provides one illustration of this ap- 
proach, which has potential but not yet 
demonstrated validity. These workers 
have made available for research use 
experimental batteries of objective tests 
to measure a variety of personality 
factors in children (11 to 16) and adults, 
including a set of ten intended to 
provide a measure of “anxiety to 
achieve.” The anxiety tests are pri- 
marily paper-and-pencil techniques, 
yielding scores such as susceptibility to 
annoyance, honesty in admitting com- 
mon frailties, acceptance of good 
aphorisms, and emotionality of com- 
ment. Although the appropriateness of 
these particular tests for measuring 
anxiety in children is open to serious 
question, in part because they are based 
almost entirely on factor analyses with 
adult subjects, the methodological goal 
toward which the tests were aimed is 
an important one, that is, the measure- 
ment of anxiety by means of objective 
instruments requiring neither deliberate 
self-evaluation nor projective responses. 
We believe that the likelihood of this 
goal being attained will depend on the 
degree to which the objective tests 
are based upon significant theoretical 
constructs and hypotheses about the 
nature of anxiety and its manifestations. 


Evaluative Summary 


Our review of techniques for the 
measurement of anxiety in children 
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leaves us in much the same position as 
we found ourselves after evaluating 
procedures for the appraisal of aggres- 
sion, Although some promising tech- 
niques have been and are being de- 
veloped, there is relatively little avail- 
able in the way of standardized measur- 
ing instruments possessing a high de- 
gree of construct validity. To a large 
extent, the measurement difficulties 
reflect the conceptual and theoretical 
problems that face anyone attempting to 
deal systematically with the objective 
appraisal of anxiety, There is, for ex- 
ample, the important question of dis- 
tinguishing between generalized, or 
“free-floating,” anxiety and more 
specific anxieties or fears. Another 
fundamental problem is that of dif- 
ferentiating between level of anxiety 
(viewed as a response tendency) and 
typical modes of coping with anxiety 
or characteristic effects of anxiety on 
adaptive behavior (anxiety viewed as 
motive). It is virtually impossible, we 
believe, to assess level of anxiety with- 
out at the same time dealing with be- 
havior that to some extent reflects the 
nature of the person’s controls over 
anxiety. Even if it were possible to 
measure pure level of anxiety, which 
some of our questionnaires attempt to 
do, one wonders how meaningful such 
measures can be unless one also knows 
something about the individual’s modes 
of adaptation or inhibition, 

It is our belief that separate evalua- 
tions providing both kinds of informa- 
tion will be needed in order to make 
yalid predictions about behavior, For 
example, some questionnaire measures 
of anxiety as response tendency cor- 
relate negatively with examination 
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grades, yet certain very anxious chil- 
dren do very well on examinations. The 
prediction of behavior for these chil- 
dren might be considerably improved 
if we knew how the child coped with 
anxiety in stressful situations. Such in- 
formation might be provided by ob- 
servations of the child in contrived or 
natural situations, by carefully obtained 
ratings from teachers, etc, Similarly, 
as the complex relationships between 
anxiety and performance on experi- 
mental tasks of learning, perception, and 
problem solving become increasingly 
clarified, it may well be that such tasks 
may themselves provide valuable sup- 
plementary indices of anxiety. 

The task that we have posed is ob- 
viously not a simple one. We feel, 
however, that the problem can be 
approached fruitfully by means of a 
strategy that involves the simultaneous 
application of a selected variety of the 
kinds of methods previously discussed, 
with the aim of evaluating the level and 
specific nature of the child’s anxiety as 
well as how he adapts to it and how it 
influences his performance. 


MOTIVATION FOR ACHIEVE- 
MENT 


The last decade has seen a resurgence 
in American psychology in the study of 
task-oriented behavior and motivation. 
This resurgence has been spearheaded 
primarily by the work of McClelland 
and his colleagues (McClelland et al., 
1953; Atkinson, 1958) in the develop- 
ment of methods for the assessment of 
human motives by the analysis of 
fantasy productions. The motive studied 
most intensively by such methods, and 
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the one for which special techniques 
have been devised for use with children, 
is need achievement. 

Any theoretical analysis of this vari- 
able is complicated by the studied 
avoidance in the most recent publica- 


tions of McClelland and his colleagues 
of any attempt to define the concept of 
motive in a priori terms. We quote 
McClelland: 

But what is a motive? Is it not desirable 


to define what is being measured before 
trying to discuss ways of measuring it? 
There are theorists who have proceeded in 
this wa . . The approach to be fol- 
lowed here is more empirical, less prejudi- 
cial in advance as to what the charac- 
teristics of a motive will turn out to be. 
At the outset it takes no position as to 
whether, for example . . . motives drive 
or direct, or whether they are temporary 
states or enduring dispositions of the or- 
ganism. To a certain extent, such questions 
can be answered only after one has a 
Measure of motivation, not before (19588, 


p. 8). 


As one of the present authors has 
written elsewhere (Bronfenbrenner, 
1960a), this view placed McClelland in 
a position analogous to that of psy- 
chologists who define intelligence as 
“that which is measured by an intel- 
ligence test”; a motive thus becomes 
whatever is measured by the methods 
that McClelland and his colleagues em- 
ploy. Accordingly, before we can 
attempt a theoretical analysis, we must 
reverse our usual procedure and ex- 
amine the techniques utilized for meas- 
uring ” achievement in children. Three 
such procedures, utilizing projective 
materials, have been developed: the first 
employs verbal cues instead of pictures 
as stimuli (Lowell, 1950; Winterbottom, 
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1953; 1958), the second relies on the 
analysis of graphic expression (Aronson, 
1958; McClelland, 1958), and the third 
involves an adaptation of a doll-play 
technique (Melville, 1959). 

The Method of Verbal Stery Cues. 
Faced with the fact that young children 
often fail to respond satisfactorily when 
pictures are used to elicit thematic ap- 
perception (as in the standard McClel- 
land procedure), Lowell (1950) in- 
troduced the use of verbal cues as 
stimuli for evoking imaginative stories 
from a sample of Navaho ninth-graders. 
The same procedure was adopted, with 
minor modifications, by Winterbottom 
(1953; 1958) in what to date is the 
most comprehensive investigation of 
n achievement in young children. The 
subjects, 29 8-year-old, middle-class 
boys, were asked to make up stories in 
response to an “idea” presented orally 
by the examiner and consisting of such 
brief verbal cues as “a mother and her 
son,” “a boy who has just left his 
house,” “brothers and sisters playing, 
one is a little ahead,” and “a young man 
with his head resting on his hands,” 
Following the practice commonly em- 
ployed for validation by the McClelland 
group, Winterbottom obtained stories 
under both “relaxed” and “achieve- 
ment-oriented” conditions. In the first 
the task was presented as a game, with 
every effort being made to put the 
subject at ease and to reduce as much 
as possible any “test atmosphere.” Fol- 
lowing the “relaxed” stories, the child 
was given a “puzzle test” (a form 
board), which would “tell how smart 
he was”; he was told further that he 
should try his best because he was to be 
compared with others in his class. After 
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three minutes of work on the test, a 
second set of stories was collected. Both 
sets were scored according to the 
achievement criteria developed by 
McClelland et al. (1953), and Atkinson 

(1958). The fundamental requirement 
is evidence in thematic context of 
“concern over competition with a 
standard of excellence.” Such concern 
may be reflected in references to needs, 
goals, obstacles, positive or negative 
affective states, instrumental activity, 
etc. Each of these features receives unit 
weight in each story, with a possible 
range of 13 points per story. 

Results. The interjudge reliability for 
total score based on stories written by 
the 8-year-old boys was highly satis- 
factory (r =.92). McClelland and his 
co-workers regard an increase in score 
from the relaxed to the aroused condi- 
tion as primary evidence for the validity 
of the method. This criterion is met in 
the present instance, although, as 
Winterbottom acknowledges, some 
question can be raised, since the two 
sets of verbal cues employed had not 
been shown to be equivalent. Further 
evidence for construct validity is found 
in correlations with presumed ante- 
cedent and consequent measures. Thus, 
when cases were divided at the median 
into high and low groups, mothers of 
high need achievers reported signifi- 
cantly more demands for independent 
accomplishment before the age of 8 than 
did mothers of low need achievers; the 
former also evaluated their children’s 
accomplishments more favorably and 
used physical expressions of affection to 
reward achievement. As for the chil- 
dren themselves, high need achievers 
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worked more independently 
sistently in the testing s 


and per- 
ion and 


were rated by teachers as trying harder, 
taking more pleasure in success, more 
independent, and more popular. Similar 
correlates of high m achievement are 
reported in a recent study by Rosen and 


D’Andrade (1959) with a sample of 


g-to-11-year-old boys. The fact that 
these investigators successfully used the 
usual TAT procedures rather than 
verbal story cues suggests at the 
picture method for measuring 7 
achievement may be applicable with 
boys at this and perhaps even younger 


age levels. 

The impact of these impressive find- 
ings is tempered somewhat by the re- 
sults of a follow-up study recently com- 
pleted by Feld (1959), who tracked 
down and retested 17 of Winter- 
bottom’s original cases six to eight years 
later. Unfortunately, the retesting was 
done not with the method of verbal 
cues employed originally but with TAT 
pictures as stimuli. The correlation with 
the boys’ 2 achievement scores obtained 
earlier was significant only under the 
achievement-oriented conditions. The 
obtained r of .38 “compares favorably 
with test-retest reliabilities previously 
reported for much shorter time in- 
tervals” (Chapter 3, p. 4). The measures 
of need achievement at adolescence, 
however, failed to show significant re- 
lationships either with the original, 
retrospective, or current reports of 
mother’s behavior toward the child. 
That the mothers’ behavior reported six 
years earlier has relevance to some pres- 
ent characteristics of her son is in- 
dicated by the high correlation (7 = 
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—.70) between independence training 
before the age of 8 and the boys’ test 
anxiety as measured in adolescence. 
Problems of Validity. With these 
methods and results in mind, we may 
now return to the question of just what 


is being measured by McClelland’s 
technique and Lowell and Winter- 
bottoms adaptation of it. The fact, 


reported by Winterbottom and Rosen 
and D'’Andrade, that scores derived 
from story protocols do predict con- 
current achievement-oriented behavior 
in preadolescent boys is reassuring. It 
is unfortunate that Feld’s study did not 
include measures of performance at 
adolescence that could serve as criteria 
for evaluating the predictive power of 
both the earlier and current measures 
of n achievement. But, in any event, 
such correlations with external criteria 
are not in themselves sufficient to es- 
tablish the validity of the method. To 
begin with, there is the question of 
Specificity: to what extent may the 
technique be measuring variables other 
than a disposition to achieve. The ques- 
tion is raised in part by work done by 
McClelland and his followers subse- 
quent to the completion of Winter- 
bottom’s study. Thus papers included 
in Atkinson’s compendium (1958) call 
attention to the influence of protocol 
length, sequence of stimuli, and choice 
of pictures as factors affecting the 7 
achievement score. In addition, a num- 
ber of the studies point to the im- 
portance of distinguishing between fear 
of failure versus hope of success as 
components in the conventional 7 
achievement index. The issue of ex- 
traneous variables, however, can be 
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raised on a still broader level, especially 
when measures of this sort are being 
applied to children. To what extent, 
for example, are individual differences 
in score a function of variation not in 
motives but in abilities. Apart from the 
influence of general intelligence, there 
are the more specific developmental 
phenomena of progressive differentia- 
tion in the capacity for time perspec- 
tive, conception of causal relationships, 
attribution of intent, etc. All of these 
abilities would appear to be involved 
in the production of thematic content 
scorable for 7 achievement. 

Finally, even though the issue may 
not be relevant for McClelland, we must 
pose the question of the extent to which 
the 7z achievement score measures what, 
in our terminology, are response tend- 
encies rather than motives. For example, 
as one of the present authors has pointed 
out elsewhere (Bronfenbrenner, 1960a), 
one could argue that the effect of the 
arousal procedures employed in the 
McClelland technique is to make 
certain experiences cognitively salient. 
To push the argument one step further, 
perhaps fantasy materials predict be- 
havior principally to the extent that 
they are autobiographical; e.g., the child 
who has succeeded in the past is more 
likely to tell stories about success and 
to succeed in the immediate future. 
Alternative hypotheses such as these 
once again call attention to possible pit- 
falls in the current tendency in psy- 
chological research to interpret fantasy 
materials solely or primarily in motiva- 
tional terms. 

The Method of Graphic Expression. 
Questions of this same kind can be 
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raised even more forcefully with respect 
to the second technique for measuring 
need achievement adapted for use with 
children, This is the method of graphic 
expression developed by Aronson 
(1958). Using two abstract designs con- 
sisting of children’s scribble patterns, 
this investigator sought “to discover 
empirical relationships between 2” 
achievement and various modes of 
graphic expression” (p. 252). There 
were no a priori hypotheses. The 
designs were exposed tachistoscopically 
to male college students for whom 7 
achievement scores had previously been 
obtained with the conventional TAT 
procedure. Immediately after exposure, 
students were asked to reproduce what 
they had seen. Cross-validation with 
four additional samples of college stu- 
dents revealed five major features of 
graphic expression distinguishing high 
from low m achievers: the high x 
achievers used more discrete versus 
fuzzy, overlaid lines, left less unused 
space, preferred diagonal configurations 
to straight ones, and drew more S- 
shaped figures and fewer multiwave 
formations. 

Aronson cites as one of the primary 
virtues of his technique the fact that 
it is applicable to young “preliterate” 
children. Acknowledging that “it is a 
very large jump indeed” from male 
college students to 5-year olds, McClel- 
land (1958b) employed Aronson’s tech- 
nique, with some modifications, in a 
study of motivational correlates of risk- 
taking in 26 kindergarten and 32 third- 
grade children. No breakdown on sex 
is given—a surprising omission in view 
of the admitted inapplicability of Mc- 


Clelland’s technique with female sub- 
jects (McClelland et al., 1953, p. 173). 


The stimulus card was presented for 
three seconds, and children w asked 
to “make scribbles like these; if you 


can’t remember all of them, make up 
your own.” Since children drew far 
fewer identifiable forms than adults, 
the last three of the five criteria were 
combined into a single composite “form 
score.” When applied to the drawings 
for Aronson’s original college sample, 
this new score yielded a correlation of 
-36 with the original measure of g raphic 
expression, a result that indicates very 
little overlap between the two measures. 
Reliability across two designs was .38 
for the 9-year-old group but only .13 
for the kindergarteners. McClelland 
believes that the latter value is “a seri- 
ous underestimate of the true correla- 
tion caused by the coarse grouping of 
results and also by the two deviant 
cases in a very small sample” (p. 313): 
Although he goes to some length to 
support his argument that the true re- 
liability of the doodle measure of n 
achievement “is somewhere around .40,” 
the present authors remain uncon- 
vinced. 

The evidence offered for validity is 
little more encouraging. The argument 
that “the doodle measure is reflecting 
n achievement, since it increases with 
age” arouses more doubts than it allays, 
since it raises once again the question 
of the influence on test performance of 
differential capacities for form percep- 
tion, motor coordination, spatial organi- 
zation, and the like. The fact that the 
doodle measure of m achievement 
showed a correlation of only .o8 with 
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IQ for the third-grade sample is some- 
what reassuring, although it may be at- 
tenuated by the low reliability of the 
method. Unfortunately, IQ scores were 
not available for the more crucial kin- 
dergarten group. 

Similar considerations apply to what 
McClelland regards as the most impor- 
tant evidence for the validity of the 
technique, the relationship between the 
doodle measure of need achievement 
and risk taking. McClelland had hy- 
pothesized that children high in need 
achievement would prefer to take mod- 
erate risks as opposed to long shots or 
sure bets. This prediction was substan- 
tially confirmed in both age groups. 
Again the question may be raised 
whether factors other than 2 achieve- 
ment reflected in the index of graphic 
expression may account for the ob- 
tained relationship. Consider, for ex- 
ample, the substantial research evidence, 
noted by Atkinson (1958, p. 337), to 
the effect that anxious persons tend to 
show extremes in level of aspiration, 
Setting their goals either very high or 
very low. Atkinson explains this highly 
reliable finding as a reflection of dif- 
ferences in 7# achievement, on the as- 
sumption that nonanxious “normals are 
relatively more positive in their moti- 
vation in achievement-related situa- 
tions” (p. 337). An equally plausible 
alternative hypothesis explains this same 
difference as a direct function of anx- 
iety. It is noteworthy in this connec- 
tion that graphic features such as those 
scored by Aronson as uniquely relevant 
to low 7 achievement are interpreted in 
the Bender-Gestalt test, a task similarly 
involving reproduction of abstract de- 
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signs, as indices of anxiety level and 
immaturity (Pascal and Suttell, 1951). 

The Measurement of Achievement 
Motivation through Doll Play. Mel- 
ville (1959) has recently adapted Sears’ 
method of doll-play analysis (Sears, 
1951) to provide measures of achieve- 
ment fantasy. Using a doil house con- 
structed to simulate the classroom of a 
modern elementary school, she devised 
reliable scoring procedures for a num- 
ber of variables, two of which are pre- 
sumed to be directly related to achieve- 
ment: 

Work routine, thematic, which must be 
verbalized or clearly indicated. The 
teacher may be the agent, “The teacher 
says ‘now do your number work’”; a 
child may be the agent of such actions as 
“The boy is doing his seat work,” “This 
little girl is having her reading turn.” The 
principal or parent may come to school to 
see the children do their work. For pur- 
poses of this study with young children, 
“work” may be “choosing time,” organ- 
ized games, painting, as well as the more 
academic work of reading and number 
work, 

Achievement. Someone expends effort, 
evaluates, compares school work with 
some standard of quality. “The boy is 
working hard on his seat work”; “The 
principal comes over and looks at the kids’ 
work to see how good they are doing” 


(pp. 25-26). 
For a sample of 48 first and second 


graders, percentages of agreement by 
two observers, who classified behavioral 
units into seven different categories, 
ranged from 80 to 94. To correct for 
individual differences in total produc- 
tivity, the score for each category was 
expressed as a fraction of the total num- 
ber of behavioral units produced by the 
child. Results of the technique were 
validated against systematic observa- 
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tions of the children’s school behavior 
over a five-day period. In each grade 
youngsters were divided into upper and 
lower thirds, on the basis of percentage 
of time spent in “industrious” behavior. 
These “high” and “low” groups were 
then compared for differences in fan- 
tasy scores, Although both the work- 
routine and achievement categories 
from the play session showed differ- 
ences in the predicted direction, only 
the work routine was reliable. The 
achievement score also yielded positive 
but nonsignificant relationships with 
teacher judgments of motive strength 
but gave inconsistent results in predict- 
ing achievement-test scores when intel- 
ligence was held constant. 

In one respect, however, the high 
and low industry groups did exhibit 
reliable differences associated with the 
achievement-fantasy score. Over the 
three play sessions, the high group 
showed a significant increase in achieve- 
ment fantasy, whereas the low group 
did not. Also, when in the last session 
the children were given achievement- 
oriented comments (e.g., “I'll bet that 
teacher wants those children to do their 
very best work.”), only the high group 
showed a significant rise in achieve- 
ment fantasy. These findings led Mel- 
ville to conclude, “. .. it is not in 
mere use of the achievement category, 
but rather in... changes in the use 
of this category from session to session 
and with additional achievement stimu- 
lation that one should seek to measure 
an achievement motive in this kind of 
fantasy setting” (p. 51). 

Melville’s findings once again raise 
the question of the extent to which 
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measures of the disposition to achieve 
reflect what we have called response 
tendencies rather than goal-oriented 
motives. Thus, particularly in children, 
one can conceive of a characteristic 
way of behaving, involving industry 
and persistence, that is not necessarily 
oriented toward attaining a particular 
end or standard of performance. In- 
deed, to judge from the name that Mel- 
ville applies to her index of overt be- 
havior (“industriousness”) a to its 
operational definitions, this kind of re- 
sponse tendency would seem to con- 
stitute a major component of the cri- 
terion against which the fantasy meas- 
ures were validated. Cons t with 
this view is the finding that differences 
in fantasy between high and low in- 
dustry groups were greater for the 
work-routine category, which empha- 
sized being busy, than for the achieve- 
ment category, which calls for activity 
involving comparison with “some 
standard of quality.” 

How might one distinguish a goal- 
oriented motive from a generalized re- 
sponse tendency? One possible ap- 
proach is suggested by Melville's find- 
ing that although the high and low 
groups did not differ in over-all achieve- 
ment score the high group showed 
greater increases from situation to sit- 
uation, especially in response to achieve- 
ment-oriented comments from the ex- 
perimenter. Presumably, a child with a 
strong response tendency toward hard 
work and persistence would tend to 
behave in the same way in every situa- 
tion. One who is motivated to achieve, 
however, would become diligent pri- 
marily in those situations that call for 
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the attainment of a goal of a high stand- 
ard of performance. Indeed, taking a 
leaf from the work of both Melville and 
McClelland, one might argue that the 
most appropriate index of an achieve- 
ment votive, as distinguished from a 
response tendency, would be the in- 
crease in achievement imagery from the 
“relaxed” to the “aroused” condition. 

This line of reasoning is strikingly 
consistent with the theoretical model 
developed in a series of papers by At- 
kinson (1958) in his attempt to recon- 
cile the paradoxical findings obtained 
in studies of m achievement (as meas- 
ured from the TAT) and level of as- 
piration. Atkinson argues convincingly 
that scores from level-of-aspiration ex- 
periments cannot be interpreted as 
measures of motive strength. Rather 
they are resultants of an interaction of 
three factors—motive strength, expec- 
tancy (defined as the subjective prob- 
ability of success or failure), and incen- 
tive [“the magnitude of the reward or 
potential satisfaction should the ex- 
pected consequence occur” (p. 288)]. 
He proposes further that the strength 
of the achievement motive is reflected 
in the steepness of the incentive gradi- 
ent between success and failure; per- 
sons highly motivated to achieve per- 
ceive a much steeper affective or choice 
gradient than those with low motiva- 
tion, Atkinson suggests a number of 
experimental procedures for measuring 
the slope of this gradient, most of these 
techniques being adaptations of meth- 
ods utilized by the McClelland group 
for validating the TAT measure of n 
achievement. The possibility that such 
direct behavioral methods might prove 
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superior to the now conventional pro- 
jective techniques is not considered. 

Toward Behavioral and Experimen- 
tal Measures of Achievement Motiva- 
tion. Yet a number of recent researches 
with children suggest that this possi- 
bility offers considerable promise. We 
have already mentioned the observa- 
tional measures of “achievement motive 
strength” utilized by Melville for vali- 
dating her doll-play technique. These 
measures are being developed in a com- 
prehensive program of research on 
“achievement and self-esteem motiva- 
tion” in children being directed by 
Pauline Sears at Stanford University 
(Sears, 1957). Although a detailed ac- 
count of these procedures is yet to be 
published, preliminary data cited by 
Melville indicate highly satisfactory in- 
terobserver reliability (7’s ranging from 
.67 to .go in three first- and second- 
grade classrooms) for samples taken 
over a five-day period. 

A less structured approach based on 
summary ratings as against direct ob- 
servation is represented by the work 
of Beller (1957), who developed a series 
of rating scales for use by teachers in 
rating “autonomous achievement striv- 
ing” in children 2 to 6 years of age. 
Specific variables rated include satis- 
faction derived from work, carrying 
out of routine tasks, overcoming ob- 
stacles, taking initiative, and comple- 
tion of activity. Interjudge reliabilities 
for the five subscales range from .67 
to .80. The ratings were substantially 
independent of similarly constructed 
ratings of dependency. 

As Sears notes in the initial publica- 
tion on her research (1957), such natu- 
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ralistic observational methods have 
practical and theoretical limitations. 
The methods are expensive both in 
time and money and yield measures 
that tend to be multidimensional. Nev- 
ertheless, they provide an appropriate 
point of departure for the clarification 
of concepts and the subsequent devel- 
opment of more economical and rigor- 
ous experimental techniques. 

Indeed, the development of such 
techniques is already under way. To 
begin with, as one of the present au- 
thors has noted elsewhere (Bronfen- 
brenner, 1960a), a number of the ex- 
perimental procedures utilized by Mc- 
Clelland and his colleagues as validating 
criteria for fantasy measures offer 
promise as indices of motives in their 
own right. The manner in which such 
procedures can be adapted for use with 
children is well illustrated in the ex- 
periments of Sears and Levin (1957) 
on level of aspiration. These investiga- 
tors set for themselves the goal of de- 
veloping six tasks, each of which would 
involve graded levels of difficulty, be 
relatively unfamiliar, and yet be suffi- 
ciently simple, absorbing, and challeng- 
ing to yield valid and reliable results 
with preschool children. Their findings 
indicate that, by and large, the experi- 
mental procedures met these exacting 
criteria. Moreover, although devised 
for studying level of aspiration, the 
procedures lend themselves nicely to 
the kinds of theoretical and operational 
distinctions required for differentiating 
achievement-oriented motives and re- 
sponse tendencies. 

Perhaps the best example of progress 
and potential in the experimental analy- 
sis of achievement motivation and its 
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antecedents is found in the work of 
Rosen and D’Andrade (1959). These 
investigators tested 140 boys between 
the ages of ọ and 11 with the TAT 
measure of 7 achievement in order to 
select a sample of 40 extreme high and 
low scorers pair-matched for age, race, 
IQ, and social class. In order to study 
the behavioral correlates of # achieve- 
ment both in children and their parents, 
Rosen and D’Andrade then devised a 
series of ingenious experimental proce- 
dures, two examples of which are 
given here: 


1. Block Stacking. The boys were asked 
to build towers out of very irregularly 
shaped blocks. They were blindfolded and 
told to use only one hand in order to 
create a situation in which the boy was 
relatively dependent upon his parents for 
help. His parents were told that this was 
a test of their son’s ability to build things, 
and that they could say anything to their 
son but could not touch the blocks. A 
performance norm was set for the experi- 
ment by telling the parents that the 
average boy could build a tower of eight 
blocks; they were asked to write down 
privately their estimate of how high they 
thought their son could build his tower. 
The purposes of this experiment were (a) 
to see how high were the parents’ aspira- 
tions for and evaluations of their son, €g» 
if they set their estimates at, above, or 
below the norm; (b) to see how self- 
reliant they expected or permitted their 
son to be, e.g., how much help they would 
give him. 

2. Anagrams. In this task the boys were 
asked to make words of three letters Or 
more out of six prescribed letters: G, H, 
K, N, O, R. The letters, which could 
be reused after each word was made, were 
printed on wooden blocks so that they 
could be manipulated. The parents were 
given three additional lettered blocks, 
T, U, and B, and a list of words that could 
be built with each new letter. They weré 
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informed that they could give the boy a 
new letter (in the sequence T, U, B) 
whenever they wished and could say any- 
thing to him, short of telling him what 
word to build. There was a ten-minute 
time limit for this experiment. Since this 
is a familiar game, no efforts were made 
to explain the functions of the task. 

The purposes of this experiment were 
(a) to see how self-reliant the parents 
expected their son to be, e.g., how soon 
they would give him a new letter, how 
much and what kind of direction they 
would give him, if they would keep 
him working until he got all or most of 
the words on the list or “take him off 
the hook” when he got stuck, And (b) 
to obtain, by scoring interaction between 
the subjects, measures of the affect gene- 
rated by the problem-solving process, 
eg., the amount of tension shown by the 
subjects, the positive and negative remarks 
a toward one another (pp. 191- 
192). 


The results of these experiments 
showed consistent differences in the 
behavior of both boys and parents in 
the two # achievement groups. As in 
the Winterbottom (1953) study, high 
n achievers were characterized by per- 
sistent striving and general competitive- 
ness; they built higher towers of blocks, 
worked faster, and accomplished more. 
An important added feature of the 
Rosen study is the inclusion of controls 
for IQ; hence Rosen can legitimately 
conclude that “the superior perform- 
ance of high m achievement boys ap- 
pears to be more a function of greater 
self-reliance and zest in competitive 
activity than of intelligence (p. 199). 
Similarly, parents of the high 7 achieve- 
ment boys tended to be more competi- 
tive, show more involvement, have 
higher aspirations for their sons, set up 
standards of excellence even where none 
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was given, etc. In addition to similari- 
ties, important differences were found 
in the behavior of the two parents. 
Mothers of high m achievers differed 
from those of the controls in being 
both more rewarding and more 
“pushy.” In contrast, their husbands, 
although similarly setting high stand- 
ards, gave their sons greater autonomy; 
they tended “to beckon from ahead 
rather than push from behind” (p. 216). 

In terms of our interest, the most 
important contribution of Rosen and 
D’Andrade’s research does not lie in 
its substantive findings nor even in its 
implications for the construct validity 
of McClelland’s fantasy measure of n 
achievement. In our view, the greatest 
promise of the study lies in its demon- 
stration of the applicability of experi- 
mental procedures for the measurement 
of achievement motivation in children 
under conditions simulating real-life 
situations. Noteworthy in this regard is 
the fact that Rosen administered all of 
his experiments not in the laboratory 
but in the homes of his subjects—“usu- 
ally in the kitchen.” 

While pointing to the promise of be- 
havioral and experimental measures of 
the disposition to achieve, we would 
emphasize the importance of continu- 
ing to improve and refine the instru- 
ments already available for the study of 
achievement motivation through fan- 
tasy. Much is to be gained by the 
practice employed in the major studies 
we have reviewed of using both overt 
and fantasy measures with the same 
subjects. But merely to note a low but 
significant correlation between the two 
and to go no further is to stop short 
at the very gate of scientific under- 
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standing. In our judgment, it is pre- 
cisely from the study of the interac- 
tion of behavior in fantasy and overt 
action in a variety of situational con- 
texts that we stand to learn most about 
the nature and measurement of human 
motives and their expression. 


PITFALLS AND PROSPECTS 


Having examined intensively a wide 
variety of techniques employed for 
measuring three rather different types 
of personality variables in children, we 
feel some obligation, in concluding our 
discussion, to consider existing methods 
as a whole and to call attention to re- 
curring problems and possibilities. To 
begin with the debit side of the ledger, 
in the course of our review we found 
ourselves referring time and again to 
many of the same blind alleys and 
sources of confounded measurement. 
Chief among these are the following: 


1. Reliance on purely empirical ap- 
proaches to the construction and validation 
of measuring techniques without prior 
and continuing conceptual clarification of 
the precise variable or variables to be 
measured. Especially questionable in this 
regard is the practice of accepting mere 
predictive power as sufficient evidence for 
establishing the construct validity of a 
method. 

2. Failure to consider and control the 
extraneous variables that may in fact be 
measured by a particular technique. 
Common sources of such confounding in- 
clude response sets (such as acquiescence, 
social desirability, or general level of 
productivity), factors associated with 
broader social influences (such as class 
and culture), and, especially in the case 
of children, differential levels of psycho- 
logical development (e.g., capacities for 
motor coordination, form perception, and 
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concept formation). In particular, the 
tendency to interpret projective responses 
solely in motivational terms may becloud 
the operation of cognitive factors that 


account appreciably for individual dif- 
ferences in test performance. 
3. Reliance on measurements by a single 


technique applied in a single situation often 
far removed from the context in which the 


variable appears in the day life of 
the child. Given the fact that the behavior 
of children is even more likely to be 
affected by immediate environmental 
events than the behavior of adults, the 


practice of interpreting results of ques- 
tionnaires and projective techniques as 
reflecting primarily enduring personality 
characteristics without sufficient regard 
for the influence of momentary situational 
conditions or more stable social contexts 
has probably done much to retard the 
development of more adequate measuring 
techniques in which environmental varia- 
tion could be controlled, or systematically 
varied, through experimental manipulation. 


Although the foregoing defects are 
still to be found in many current stud- 
ies, there is evidence of increasing 
methodological sophistication and in- 
genuity among behavioral scientists 
working with children, The following 
are among the most promising develop- 
ments in this respect. 


1. A trend away from traditional 
mechanistic methods of test construction, 
with their single-measure indices of re- 
liability and validity, toward multiple ap- 
proaches for establishing the construct 
validity of the theory underlying the 
measuring instrument. In this connection, 
the effort to distinguish between what we 
have called response tendencies and goal- 
oriented motives would seem to be po- 
tentially fruitful. y 

2. Investigation of a particular variable 
simultaneously through fantasy and overt 
behavior with the aim not merely © 
demonstrating a positive correlation be- 
tween the two but of exploring the 
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situational conditions and processes that 
appear to account both for the continuities 
and discontinuities in the two spheres of 
expression. 

3. The effort to supplement traditional 
objective and projective instruments with 
systematic behavioral observations and ex- 


perimental procedures that are sufficiently 
simple and attractive to be applicable to 
children but permit controlled manipula- 
tion of situational factors. 

4. The articulation of social and cultural 


variables into the procedures employed 


for the appraisal of personality charac- 
teristics. In many instances such articula- 
tion can be accomplished only by statisti- 
cal design rather than experimental ma- 
nipulation, 

If we are correct in our vision of 
trends to come in the measurement of 


personality characteristics in children, 
one fact seems clear—the world of child 
psychology, like the rest of science, will 
be requiring new and more complex 
concepts, techniques, and research de- 
signs that are at once imaginative, rig- 
orous, and appropriate to the demands 
of an expanding universe. 
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The scientific study of children’s groups 
has been a relatively neglected area of 
inquiry during the last ten years. It was 
brilliantly launched just prior to 1940 
by the investigations of Lewin and his 
students (Lewin, Lippitt, and White, 
1939; Lewin, 1946; Lewin, 1948). Then 
came World War II and the shifting of 
Lewin’s creative efforts toward the in- 
fluence of democratic leadership and 
social climate on the morale and ef- 
ficiency of adult groups. This marked 
the beginning of the “group dynamics” 
nucleus and cast the die in the direction 
of the many insightful investigations of 
adult groups by Bavelas, Cartwright, 
Festinger, Lippitt, and associates. The 
study of children’s groups waned in 
popularity within the group dynamics 
movement.! 

The years following World War II 
Were also marked by another trend, 
which encouraged a preference for the 

‘A few of Lewin’s students maintained an 
active interest in children’s groups. See Barker 


and Wright, 1954; and Polansky, Lippitt, and 
Redl, 1950. 
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study of adult rather than child groups. 
Large congressional appropriations were 
made to the various branches of the 
armed services to subsidize psychologi- 
cal research on behalf of our national 
defense. It is understandable that the 
armed services were interested primar- 
ily in studies employing young adults as 
subjects. Carter’s investigations of the 
dynamics of “leaderless” groups, sup- 
ported by the Office of Naval Research, 
were conducted with young adults 
(Carter et al., 1950, 1951). The classic 
Ohio State investigations of leadership 
were heavily subsidized by military 
funds (Shartle, 1952; Hemphill, 1949). 
The “sociometric” procedures, which 
were initiated by the study of children’s 
groups (Moreno, 1951, 1953; Jennings, 
1950), were extended theoretically and 
operationally in adult groups (Gardner 
and Thompson, 1956, Taguiri, 1952) 
because of the available funds for re- 
search with military defense implica- 
tions. 

The last four or five years have seen 
some important changes in our national 
policies related to research subsidy. 
The National Science Foundation sup- 
ports “pure” research with no stated 
biases toward subjects of any species 
of whatsoever level of maturity. The 
now flourishing National Institutes of 
Mental Health have no explicit policies 
that favor the study of any particular 
aged subjects. The U. S. Office of Edu- 
cation research program definitely 
favors investigations of child subjects. 
The current Zeitgeist thus augurs well 
for a revival of research interest in chil- 
dren’s groups. Indeed, with ample funds 
now available for research with child 
subjects, we may see some of the pre- 
viously deflected experimenters return- 
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ing to the study of younger subjects 
because of the alleged greater simplicity 
of the child’s behavior. 

Whether the study of children’s 
groups will again become “fashionable” 
and demand the interests of a fair share 
of our scientists remains to be seen. 
However, one thing is very clear. Much 
progress has been made during the last 
ten years in research theory and meth- 
odology by investigators using adult 
subjects. There are now many tech- 
niques to be adapted for use with 
younger subjects by the psychologist 
primarily interested in the dynamics of 
children’s groups. There are many hy- 
potheses whose generality over the de- 
velopmental span must be investigated. 
New and provocative theories of the 
dynamics of social interaction and group 
processes have been proposed, which 
may, or may not, be helpful in future 
studies of children’s groups. These con- 
ditions dictate the format and contents 
of a large part of this chapter on re- 
search procedures for the study of chil- 
dren’s groups. Any attempt to restrict 
the discussion to those methods that 
have proved their usefulness in the 
study of children’s groups would neg- 
lect the most fruitful aspects of pres- 
ently available methods for group 
study. 


RESEARCH IS NEEDED 


To the impassive and disinterested 
scientist, “research is its own reason 
for being.” The curiosity of the in- 
quiring mind is insatiable and leads man 
toward the observation and manipula- 
tion of almost every discriminable as- 
pect of his surroundings. He has an 
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intense need to “understand” as well as 
a strong need to manipulate his en- 
vironment toward conditions most fa- 


vorable to his “happiness.” The study 
of children’s groups is related to both 
these needs. 

Theoretical Problems. The develop- 
ing abilities of the human organism for 
social interaction are obvious, even to 
the casual observer. The y< infant 
reeptual and 


lacks the necessary per 

motor skills for anything bur the most 
primitive of symbiotic relationships 
with his mother—the social authentic- 
ity of which remains in doubt. The 
preschool years are marked by an ever 
widening array of social skills. The 


as the 


phonemes appear spontaneou 
infant’s neuromusculature matı 
become differentially patterned through 
learning experiences into a functional 
communication system. How much of 
a language repertory is necessary before 
young children become responsive to 
each other in a group situation? Studies 
of 2-year-old children show that they 
are largely involved in solitary and/or 
parallel activities within a social group- 
ing (Parten, 1932). Is this because they 
lack the necessary language skills to 1n- 
fluence and be influenced by each 
other? Or is it because they lack the 
developing skills of socioempathy? 
Murphy has shown that the perception 
of a companion being in distress, which 
is necessary for the social response of 
sympathy, does not make its appearance 
until around the fourth year (Murphy, 
1937). This is about the same age when 
fairly sustained cooperative undertak- 
ings are observed among preschool 
children (Parten, 1932). What percep- 
tual and motor skills are required be- 
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fore the “group properties” of cohesion 
(French, 1941), contagion (Polansky, 
Lippitt, and Redl, 1950), “fight and 
flight” (Bion, 1952), and so on, make 
their appearance? When do children 
acquire the need to be in the company 
of their peers, and what social-emo- 
tional variables influence the acquisition 
of this need? Answers to these ques- 
tions, which are based on solid research 
findings, are not as yet available. Ob- 
servations of young children in the 
Kibbutz of Israel (Faigin, 1958; Rabin, 
1957) suggest that they may be respon- 
sive to the “social climate” of their 
peers at a very early age. However, the 
relevant variables need a systematic 
examination with appropriate control 
groups before one’s scientific curiosity 
can be reasonably satisfied. 

The foregoing unanswered questions 
could be raised in analogous fashion 
throughout the later periods of the de- 
velopmental years. With the increasing 
maturity of the growing child, even 
more questions about the dynamics of 
children’s groups become important. 
The following are only a small sample 
of those that pique the curiosity of an 
observer of children’s social relations 
when they are in the company of their 
peers. What developmental and social 
variables initiate the sex cleavage in 
children’s spontaneous play groups dur- 
ing the preadolescent years? Do the 
same variables function to modify the 
social climates of those formal groups 
(e.g., the classroom) in which children 
of this age are forced into social inter- 
action? What perceptual, motor, and 
other abilities and interests promote 
the greater “social maturity” of the 
female in our culture as contrasted with 


823 


her male peer? Could this usual state 
of affairs be modified experimentally? 
What developmental and social vari- 
ables are related to the adolescent’s 
withdrawing from formally organized 
social groupings (€.g., Boy Scouts, Girl 
Scouts, etc.) in favor of less “formally 
organized” groupings (e.g., the gang)? 
What social variables in group function- 
ing predispose adolescents toward the 
invention and use of new verbal con- 
cepts? 

The many dimensions of group life 
during childhood deserve study in their 
own right. The questions raised repre- 
sent only a very few of the unknowns. 
The unknown is always tantalizing to 
the scientist, so there can be little doubt 
that children’s groups will eventually 
be studied as intensively, and with at 
least an equal profit to man’s knowl- 
edge, as the complex social interactions 
of ants, bees, chickens, and dogs. 


METHODOLOGICAL 
ADVANTAGES 


There is a hard core of American 
psychologists who believe that research 
progress can best be expedited by con- 
centrating our efforts first on the be- 
havior of infrahuman subjects by con- 
structing an explanatory model for 
“simple” organisms and then revising 
and expanding this model, as subsequent 
research may dictate, to explain the 
more “complex” behavior of man, These 
psychologists have traditionally selected 
the white rat as an appropriate experi- 
mental subject. If one accepts the valid- 
ity of this approach, he can make an 
equally strong case for studying the 
dynamics of children’s groups prior to 
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the presumably more difficult task of 
identifying and measuring the dimen- 
sions of social functioning within groups 
composed of adults. 

An argument can be made that the 
psychological defenses of children will 
have been less frequently practiced 
than those of adults. Hence the available 
behaviors from which the scientist must 
make his inferences will be more direct 
and spontaneous, Although this may in 
fact be a defensible argument, it is prac- 
tically and operationally not demon- 
strable at our present state of ignorance 
about the functioning of social groups 
at any level of maturity. 

Another methodological advantage 
that is sometimes claimed for the study 
of children’s groups is their readier 
availability and a greater ease in manip- 
ulating the conditions important to their 
functioning. There seems to be some 
justification for the first part of this 
claim, although college sophomores 
may still be preferred because of their 
greater proximity to the typical univer- 
sity laboratory. The second alleged ad- 
vantage seems unwarranted. Children 
are generally less tolerant than adults 
about anything that bores or frustrates 
them. If group conditions become un- 
interesting or unduly frustrating, they 
are more prone than adults to withdraw 
(either overtly or covertly) from the 
group. For example, in one of Lewin’s 
authoritarian groups a boy who became 
a scapegoat for his companion’s aggres- 
sion quit the group (Lewin, Lippitt, and 
White, 1939). In addition, there are 
some ways of manipulating the social 
conditions of adult groups that no one 
as yet has been ingenious enough to 
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adapt for use in the study of children’s 
groups. A good example is the introduc- 
tion of a “ringer” or “stooge” into the 
adult group, either to make undetected 
observations or to play a social role im- 
portant to the group’s functioning 
(Festinger, 1953). 

In general, it may be concluded that 
there are no demonstrated advantages 
for furthering scientific knowledge by 
the study of child as contrasted with 
adult groups. The investigator who 
elects to do research with children’s 
groups may be legitimately influenced 
in this direction by any of the following 
conditions: he may be most interested in 
studying developmental variables a ffect- 
ing social groups during the accelerated 
growth period of the early years of life; 
he may have readier access to child sub- 
jects; he may find it more personally 
rewarding to do research with children; 
or he may wish to throw light on some 
pressing social problem whose solution 
would help to advance the happiness 
and adjustment of many children. As a 
group, child psychologists have been 
characteristically most highly influenced 
by the last condition—the need to help 
children, 

One distinct methodological advan- 
tage in the study of children’s groups 
is the urgency with which teachers, par- 
ents, and youth counselors regard their 
practical problems in the area of social 
psychology. This sense of urgency can 
be used to advantage in securing the 
cooperation of individuals responsible 
for the supervision and guidance of 
children’s groups. However, the investi- 
gator must be able to project a sincere 
concern about the importance of the 
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applied problems if he hopes to gain 
full-fledged cooperation. 

Individuals who work with children 
in group situations are faced with a 
host of unsolved problems. They are 
forced to adopt rule-of-thumb proce- 
dures to maintain interest and “disci- 
pline” because no sure guide lines have 
been established in the research litera- 
ture. 

Consider the plight of the conscien- 
tious teacher who must interact daily 
with some 30 to 40 children in a group 
situation. She has some vague impres- 
sion that she should be “democratic” 
toward the group, arrange the curricu- 
lum to meet the largely unknown needs 
of each and every pupil, promote co- 
operative efforts without destroying in- 
dividual initiative, and all the while see 
that her pupils do not damage school 
property or disturb the classroom across 
the hall. A formidable assignment! 

The recreational supervisor is faced 
with similar problems, with perhaps a 
little less pressure to maintain “disci- 
pline.” However, he is confronted with 
an equally difficult task that is partially 
solved for the teacher by laws of com- 
pulsory school attendance. The recrea- 
tional leader must find ways and means 
of maintaining a high level of interest 
and individual satisfaction within his 
group or it will dwindle away. Like the 
teacher, he falls back on rule-of-thumb 
devices, such as badges, intergroup com- 
petitions, and so on. Are these ap- 
proaches the most effective ways to 
solve his problems? Perhaps, although 
they are frequently ineffective. Con- 
trolled research is necessary to isolate 
the relevant social variables and then 
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to vary them systematically to see how 
they influence the structure and dy- 
namics of different kinds of recreational 
groupings. 

The psychologist who does therapy 
with children in group situations is in 
a somewhat more favorable position 
than the teacher or recreational super- 
visor. He has usually been well trained 
in observational methods, so that he is 
more prone to develop and modify his 
rule-of-thumb guides with experience. 
However, he is forced to “play by ear” 
and sometimes adopts regrettable group 
procedures. He is potentially a signifi- 
cant contributor to knowledge about 
the structure and dynamics of children’s 
groups, although his atypical subjects 
may distort or conceal many of the pri- 
mary dimensions of group functioning. 

What are some of the problems whose 
solutions would have practical implica- 
tions for these teachers and counselors 
of children? Are there hints in the avail- 
able research literature on either chil- 
dren’s or adults’ groups that may pro- 
mote and expedite our groping toward 
valid and useful generalizations? Let us 
consider a small sample of the problems 
that have attracted the interests of so- 
cially oriented psychologists. 


GENERAL RESEARCH 
APPROACHES 


It has now become conventional to 
speak of three general approaches to 
the study of human behavior. This ru- 
bric serves to limit the boundaries of 
the present discussion. 

The normative-descriptive study of 
children’s groups in their naturalistic 
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setting is employed as a first approxi- 
mation to the identification of the most 
significant variables in group function- 
ing. It has been used to describe the 
structure and dynamics of such diverse 
children’s groups as adolescent gangs 
(Thrasher, 1927), communal living 
within the Kibbutz in Israel (Faigin, 
1958), group process involving both 
children and adults in a small town 
(Barker and Wright, 1954), infant-nurse 
relationships within nurseries (Dennis 
and Najarian, 1957), and so on. Since 
this approach is usually applied far and 
wide over diverse cultural and social 
conditions and since it generally enjoys 
the “fresh-naive-look” of phenomenol- 
ogy, it always bears the potential of 
uncovering heretofore hidden or neg- 
lected variables important to group 
functioning. The special characteristics 
and techniques peculiar to the norma- 
tive descriptive or “ecological” ap- 
proach are discussed at length in 
Chapter 3 of this book. 

The correlational approach may, or 
may not, utilize existing children’s 
groups as research subjects. Its distin- 
guishing characteristic is the attempt to 
identify covariances among the variables 
related to group structure and dynam- 
ics or covariances between these varia- 
bles and certain manipulable disposi- 
tional conditions such as type of adult 
leadership. An example of the latter is 
the established covariance between the 
number of ‘“dominative” overtures 
made by different classroom teachers 
and the dominative responses of their 
pupils (Anderson, 1939). The infer- 
ence is then drawn that adult domina- 
tion tends to produce pupil domina- 
tion. A good example of the study of 
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covariances of variables withir 
functioning is Hemphill’s (1949 
lational analysis of the relatio: 
tween several group dimensi 
mogeneity, stability, etc.) and me 
relationships with the group (participa- 
tion, dependence, etc.). Another ex- 
ample is Cattell and associates’ (1953) 
factor analysis of 80 groups on 93 vari- 
ables (including behavior ratings, group 
judgments, and motor performances), 
which resulted in the identification of 
15 factors considered important to the 
description of group structure and dy- 
namics (e.g group elation, kless- 
ness, and intrinsic synergy). The cor- 
relational approach is time-honored in 
psychology, geology, astronomy, and 
under any conditions in which it is 
difficult (or impossible) to manipulate 
the conditions predispositional to the 
variance of some dimension of interest. 

A substantial number of psycholo- 
gists like to bring nature into the ex- 
perimental laboratory. They prefer to 
follow the pattern of physics and chem- 
istry rather than that of astronomy or 
geology. They attempt to control all 
but a very few of the “independent” 
variables, then systematically to vary 
these conditions in an effort to deter- 
mine the covariation of selected “de- 
pendent variables” (e.g., to vary the 
size of a group systematically and to 
observe any correlated changes in the 
formation of subgroups during free 
discussion periods). Although the con- 
trol group is not a necessary component 
of the experimental method, it is a com- 
mon adjunct, since it provides a reason- 
able estimate of the influence of the 
variables not under the experimenter’s 
control. Another method of control is 


group 
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to have several groups exposed to the 
same, but differentially ordered, experi- 
mental conditions (e.g., Group A is ex- 
posed to conditions X, Y, and Z in that 
order; Group B to Y, Z, and X in that 
order; and Group C to Z, X, and Y in 
that order). This is the so-called coun- 
terbalanced experiment. This method 
produces unknown interaction effects, 
the results of which may support spuri- 
ous inferences. Despite its obvious op- 
erational complexities the experimental 
method can be used to pinpoint covari- 
ances with the greatest rigor. The con- 
tributions of experimental studies to our 
knowledge of psychophysics, learning, 
and psychophysiology have been most 
impressive. Can the social psychology 
of group behavior be expected to draw 
with equal success on experimental pro- 
cedures of this type? It is too early to 
say, but it can be stated that controlled 
experimentation with children’s groups 
has been conducted with signal success. 
(See Roseborough, 1953, for an excel- 
lent summary of experimental studies.) 

A vexing problem in each of these 
research procedures, when applied to 
the study of children’s groups, is esti- 
mating the limits of the obtained gen- 
eralizations. Although this problem is 
common to all of science, it seems es- 
pecially troublesome in the study of 
group behavior because there are so 
many parameters potentially important 
to patterned relationships among proc- 
ess variables. For example, to mention 
only a few possibilities: the numerical 
size of the groups, the sex of the mem- 
bers, the ages of the members, the sub- 


1 See Campbell (1957) for an excellent dis- 
cussion of experimental design in group 
studies. 
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culture from which the members are 
drawn, the “intrinsic” interest of group 
activities to the members, special abili- 
ties of the members, such as intelli- 
gence, motor skills or handicaps, emo- 
tional disturbances, and so on, through 
many more conditions and all of their 
possible combinations. It would be a 
Gargantuan and futile undertaking to 
conduct investigations that would ex- 
haust all of the combinations of param- 
eters potentially important to any 
generalization about the structure or 
dynamics of children’s groups. For- 
tunately, such a program is not de- 
manded. Any generalization is always 
subject to restriction by conflicting 
findings from studies involving similar 
parameters; and any “law,” “principle,” 
or “trend” remains forever hypothetical 
in its generality and adequacy. This 
state of affairs is a continuous stimulant 
to the seasoned scientist but may be a 
depressing reminder to the beginning 
investigator who hopes to “solve” his 
problem once and for all by a “crucial” 
experiment. The inductive-deductive- 
inductive processes by which generali- 
zations are drawn, hypotheses pro- 
posed, and theories constructed are 
never static in a growing science that 
promotes its own ends by the logical 
and operational tools it invents. 


SELECTING 
OR CREATING GROUPS 


It will be assumed that the experi- 
menter has selected some important 
problem for study on the basis of a 
preferred theory or in response to a 
pressing problem in group instruction, 
management, or supervision. He knows 
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what variables he wishes to study but 
is relatively unfamiliar with the lore 
of selecting or creating groups of chil- 
dren with whom to work, How can he 
best proceed? 

Selecting Appropriate Groups of 
Children. If the experimenter is pri- 
marily interested in the functioning of 
some type of existing group, such as a 
delinquent gang, a classroom, or a 
formally organized recreational unit, its 
composition is largely determined for 
him. Under these conditions his prob- 
lem is to select a sample of the available 
groups in such a way that his findings 
can be maximally generalized to this 
class of children’s groups. 

There are two widely practiced pro- 
cedures among social scientists in select- 
ing the sample of groups to be studied. 
The first is based on sheer expedience. 
The investigator selects all, or part of, 
the groups that are near at hand. This 
approach will limit his generalizations 
to the groups studied unless he can 
demonstrate that those under investiga- 
tion are very similar in many ways to a 
larger and reasonably well defined 
population of groups. This demonstra- 
tion permits tentative generalization of 
findings. 

A more defensible procedure for 
selecting groups to be studied consists 
of drawing a representative sample 
stratified on those variables considered 
relevant to the dimensions of group 
structure and dynamics under investiga- 
tion. Judgments about relevance may 
be based on the findings of prior re- 
search or on theoretical assumptions. 
A good model for this type of selection 
can be found in the work of authors of 
standardized achievement tests devel- 
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oped for use in our nation’s classrooms 
(Kelley, 1953). 

Another feature that must be consid- 
ered in the selection of existi 
is their accessibility for study 3 
obtaining the necessary permission and 
cooperation of the groups and their 


sponsors, there is the more difficult 
problem of securing observations with- 
out destroying the properties of the 


group being studied. Although this isa 
problem in any investigation, it may be 
especially troublesome in working with 
existing groups whose members will 
naturally be curious about the experi- 
menter’s special interest. For example, 


an informally organized boys’ gang that 
tolerated an experimenters intrusion 
would probably be atypical to an ex- 


treme degree. Ways must be found to 
make the investigator’s presence ac- 
ceptable and “natural” so that the vari- 
ables being studied are not distorted by 
the acts of observation. 

Groups for Experimental Purposes. 
For the study of many problems in 
group structure and dynamics, there 
are advantages in creating special 
groups. This was the experimental pro- 
cedure followed in the now classic 
studies of democratic and authoritarian 
leadership (Lippitt, 1940). The boys 
for each of the clubs were selected so 
that they had a common sociometrie 
structure in the classrooms from which 
they were drawn. 

If the experimenter decides to create 
his own groups, he must be mindful of 
several conditions that may influence 
their functioning—or even their con- 
tinuing existence throughout the de- 
sired experimental period. There must 
be reasons for creating groups that are 
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acceptable to their members in order 
to avoid “artificiality.” These reasons 
will vary from community to com- 
munity, for they should be contingent 
on the particular mores, folkways, and 
general styles of living prevalent in the 
subcultures from which the subjects are 
selected, 

In order to maintain the group, the 
experimenter must arrange for activi- 
ties that are intrinsically interesting to 
the children involved, If this cannot be 
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effected, there will be disruptive ab- 
sences or drop-outs that may have seri- 
ous influences on the functioning of 
the groups being studied, 

If the investigation involves the com- 
parison of outcomes or ongoing proc- 
esses for experimental and control 
groups, the experimenter who creates 
his own groups is called upon to dem- 
onstrate the comparability of the mem- 
bers on all variables judged to be rele- 
vant. For example, this might involve 


TABLE 19-1. DESCRIPTION OF ADULT-LEADER’S FUNCTIONS IN THE AUTHORI- 
TARIAN, THE DEMOCRATIC,AND THE LAISSEZ-FAIRE GROUP ATMOSPHERES 
STUDIED BY LEWIN AND HIS STUDENTS 


Authoritarian 


Democratic 


Laissez-Faire 


1. All determination of pol- 
icy by the leader 

. Techniques and activity 
steps dictated by the au- 
thority, one at a time, so 
that future steps were al- 
Ways uncertain to a large 
degree 

3: The leader usually dic- 
tated the particular work 
task and work companion 
of each member 

4. The dominator tended to 
be “personal” in his praise 


. All policies a matter of 


group discussion and de- 
cision, encouraged and 
assisted by the leader 


. Activity perspective 


gained during discussion 
period. General steps to 
group goal sketched, and 
when technical advice 
was needed the leader 
suggested two or more 
alternative procedures 
from which choice could 
be made 


. The members were free 


to work with whomever 
they chose, and the divi- 
sion of tasks was left up 


and criticism of the work 3 

of cach member; re- 

mained aloof from active 

group participation ex- 

cept when demonstrating to the group 
4 


. The leader was “objec- 


tive” or “fact-minded” in 
his praise and criticism 
and tried to be a regular 
group member in spirit 
without doing too much 
work 


. Complete 


. Infrequent 


freedom for 
group or individual deci- 
sion, with a minimum of 
leader participation 


. Various materials supplied 


by the leader, who made 
it clear that he would 
supply information when 
asked. He took no other 
part in work discussion 


. Complete nonparticipation 


of the leader 
spontaneous 
comments on member ac- 
tivities unless questioned 
and no attempt to appraise 
or regulate the course of 
events 


From Ralph White and Ronald Lippitt, Leader _ 
social climates. In Cartwright and Zander (1953) witl 


son and Company. 


behavior and member reaction in three 
h permission of authors and Row, Peter- 
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a demonstration of comparable ages, 
1Q’s, socioeconomic status, sociometric 
relationships, skills, and the like. This 
is no easy assignment; therefore, suc- 
cess in securing comparability of sub- 
jects is usually approximate. Since it has 
been shown that only one member with 
strong social influence can have far- 
reaching effects on group structure and 
dynamics (Festinger, 1953), the impor- 
tance of member comparability in espe- 
cially created groups cannot be over- 
estimated, 


EXPERIMENTAL PROCEDURES 


The most important work in the ex- 
perimental manipulation of “group at- 
mospheres” was initiated by Lewin and 
carried on by his former students. Prior 
to Lewin’s formulations, there were 
some empirical guides, such as those 
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of Thomas (1929), Jack, (i934), and 
Page (1936); however, these investiga- 
tors failed to conceptualize the perva- 
sive and dynamic social interactions that 
define the broad spectrum of “group 
atmosphere” or “social clima 

Let us take a look at Lewin’s proce- 
dures for influencing group structure 
and dynamics by the induced social 
forces of democratic, laissez-faire, and 
authoritarian leadership. The adult lead- 
ers were especially trained to produce 
the group atmospheres described in 
Table 19-1. 

It was postulated and verified that 
the leader’s behavior would induce so- 
cial forces that would influence the 
group members’ responses to each 
other and to the leader, so that the 
group atmosphere would be somewhat 
self-propelling once it was set in mo- 


(a) 


(b) 


Fig. 19-1. Subgrouping and potency of the group as a whole in (a) an autocratic and (b) a 
democratic setting. (From R. Lippitt. 1940. An experimental study of the effect of democratic 
and authoritarian group atmospheres. Univer. lowa Stud. Child Welf., 16, 133-135. As adopted 
for use by K. Lewin, Behavior and development as a function of the total situation. In 
L. Carmichael (Ed.). 1954. (2nd ed.) Manual of child psychology. With permission of the 
University of Iowa and John Wiley and Sons, Inc. 
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tion. Observational procedures were 
adopted to describe the social process 
within the ongoing group atmospheres 
(e.g., subgroupings, expressions of “we- 
ness,” hostility, and dependence on 
leader) and to assess the over-all effects 
of each atmosphere on group func- 
tioning. One of the types of differen- 
tial influence of autocratic and 
democratic settings is presented sche- 
matically in Fig. 19-1. 

Several of Lewin’s students have suc- 
cessfully influenced group processes in 
a predictable way by manipulating se- 
lected independent ‘variables, For ex- 
ample, Thompson (1940) was able to 
produce an underpriviledged minority 
in a larger group of 10-year-old chil- 
dren by prejudicial adult leadership. 
The children in the privileged majority 
continued to discriminate against the 
minority members, even when the 
leader was absent from the group. This 
shows something of the self-sustaining 
forces that can be induced into a group 
situation. Wright (1940) showed that 
adult-induced frustration may produce 
an increment in the friendship between 
two children if they “see themselves 
Opposed to the adult.” French (1944) 
showed that “organized groups” (pre- 
viously organized in social structure) 
evidenced more interdependence and 
motivation than socially restrained, un- 
acquainted groups when confronted 
with a frustrating puzzle and an ex- 
perimentally produced fire. 

Many subsequent investigators have 
utilized stress conditions to influence 
social processes within a group. Bishop 
(1951) showed that experimentally 
produced negative attitudes of mothers 
toward their preschool-aged children’s 
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behavior resulted in more inhibition 
and non cooperation in their children. 
Deutsch (1949) utilized several groups 
of five adults each to study the effects 
of intergroup and intragroup competi- 
tion on group functioning (group solu- 
tions to puzzles and human relations 
problems). Observation sheets, nine- 
point rating scales, weekly question- 
naires, and postexperimental question- 
naires were employed to assess group 
processes and outcomes. It was con- 
cluded that intergroup competition 
favored more within-group cooperative 
interrelationships and greater organiza- 
tional productivity. Grosser, Polansky, 
and Lippitt (1951) have shown that 
the impulsive initiation of acts by paid 
subjects is effective in overcoming the 
restraints of other children by a proc- 
ess that they describe as “behavioral 
contagion.” Hare (1953) has demon- 
strated that boy leaders trained for par- 
ticipatory leadership have more influ- 
ence on group functioning than boys 
trained for supervisory leadership. 
Lanzetta and associates (1954) have 
found that threats toward either the 
group as a whole or to any of its mem- 
bers had a negative effect on interper- 
sonal relations and the effectiveness of 
the group. All of these studies are il- 
lustrative of the many ways that an 
experimenter can influence the struc- 
ture and dynamics of group function- 
ing by experimental procedures.’ The 

*See L. Festinger. 1953. Laboratory experi- 
ments. In L. Festinger and D. Katz (Eds.), 
Research methods in the behavioral sciences. 
New York: Dryden. Pp. 136-172. This is an 
excellent discussion of such laboratory pro- 
cedures as use of paid participants, false re- 
porting, and pre-experimental instruction of 
leaders or members. 
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possible techniques seem limited only 
by the experimenter’s ingenuity and the 
theoretical “meaningfulness” of the in- 
duced social forces. 


DIMENSIONS 
OF GROUP FUNCTIONING 


The following is not an exhaustive 
survey of the concepts that have been 
proposed to describe group processes; 
rather it is a selection of those dimen- 
sions about which considerable empiri- 
cal research data are now available. The 
present discussion should not be inter- 
preted as implying that all of these 
variables have found a permanent niche 
in behavioral science or that the most 
important variables are now known. 
Research with groups is of recent 
origins, and there is every reason to 
believe that it is still essentially explor- 
atory-descriptive in scope. 

Cohesiveness.* Cohesiveness encom- 
passes the properties of a group (such 
as goals, type of organization, and 
status in the community) in combina- 
tion with the needs of an individual, 
which may be expected to be met by 
belonging to the group (for example, 
affiliation, recognition, succorance, and 
the like), “The cohesiveness of a group, 
thus, is the resultant of all the forces 
acting on all members to remain in the 
group.” (Cartwright and Zander, 1953.) 


‘The definition of this concept (and of 
that immediately following) is based on 
D. Cartwright and A. Zander (Eds.), 1953- 
Group dynamics: research and theory. Evan- 
ston, Ill.: Row, Peterson. The serious student 
of the structure and dynamics of children’s 
groups will do well to master the contents of 
this volume. 
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Research findings and theoretical 
analysis point toward a number of 
variables related to degrees of group 
cohesiveness. The members may be at- 
tracted by each other as individual per- 
sonalities, by the activities supported 
by the group, or by both conditions. 
Cohesiveness based on the functioning 
of different patterns of social variables 
produces different patterns of commu- 
nication and interpersonal attractions. 
Cohesiveness that is a function of the 
intrinsic needs of members results in 
different group processes than cohe- 
siveness based on social forces exerted 
by external authorities (e.g., the differ- 
ences between a classroom and an in- 
formally organized gang). Cohesive- 
ness is enhanced by cooperative rela- 
tionships, by some external threats to 
the group's existence, and by some ex- 
ternal evaluations of a favorable type 
Cohesiveness is diminished by interpet- 
sonal conflicts and frustrations within 
the group, by unpleasant assignments, 
and by some types of negative evalua- 
tions from outsiders. Whether the con- 
cept of cohesiveness is largely descrip- 
tive of a large class of patterned func- 
tions or has some “unitary” invariant 
properties such as general intelligence 
remains in doubt (Gross and Martin, 
1952). Nevertheless, the concept has 
stimulated some very fruitful investi- 
gations (e.g., Fessenden, 1953; French, 
1941; Schachter et al, 19515 Thibaut, 
1950). 

Group Pressures.2 Although many 
formally organized groups (eg; the 
classroom, the Boy Scout troop, am 
the Sunday-schooi class) exert fairly 


* Cartwright and Zander, 1953- 
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obvious pressures on their members by 
chartered intent, informally organized 
groups may exert equally strong, or 
stronger, pressures on their members 
for uniformity of behavior, attitudes, 
standards of excellence in performance, 
and so on. It seems reasonable to infer 
that these social pressures serve two 
general functions important to the 
group: “to help the group accomplish 
its purposes, and to help the group 
maintain itself as a group” (Cartwright 
and Zander, 1953). 

Many ingenious experiments have 
been conducted to investigate the an- 
tecedents and consequences of group 
pressures. The techniques adopted by 
Asch (1951), by Festinger, Schachter, 
and Back (1950), and by Lippitt et al. 
(1952) should be especially noted. In 
a general summary Cartwright and 
Zander (1953) note that the strength 
of social forces of the individual mem- 
ber toward uniformity and conformity 
is probably determined by the follow- 
ing conditions: the attraction of the 
group for the member, the importance 
of the issue on which conformity is 
demanded, an awareness that the stand- 
ard for conformity is unanimously sup- 
ported by other members, and the size 
of the group exerting the forces to- 
ward conformity. 

The special characteristics of those 
social pressures that fail to effect unani- 
mous conformity or that result in the 
formation of “splinter” subgroups are 
largely unknown. However, such vari- 
ables as differential perceptions, attrac- 
tiveness of the group, differential pun- 
ishments for nonconformity, differen- 
tial rewards for yielding, and so on, 
have been proposed as possible expla- 


833 
nations. This aspect of the functioning 
of group pressures offers an especially 
attractive area for further research— 
indeed, exploratory research in chil- 
dren’s groups. 

Leadership. Considerable research 
has been conducted over the years on 
the influence of different kinds of adult 
leadership on the structure and dy- 
namics of children’s groups; however, 
relatively little has been done to inves- 
tigate the dimensions of a child mem- 
ber’s leadership in children’s groups 
(Parten, 1933; Hanfmann, 1935). The 
developmental aspects of the leadership 
function have been grossly neglected 
in the last decade, a period during 
which psychologists have exerted sub- 
stantial amounts of energy to study 
leadership in adult groups. The find- 
ings of studies with adults should, how- 
ever, provide a rough blueprint for in- 
vestigations that would have a devel- 
opmental perspective. 

The evidence for a general “trait” of 
leadership is not convincing. An indi- 
vidual who is a leader in one group 
may not be a leader in a group with 
different goals. For example, Carter and 
associates (1950) found, from their 
factor analysis, two different kinds of 
leadership ability—an “intellectual lead- 
ership” factor and a “doing things with 
one’s hands leadership” factor, It seems 

robable that there is even a greater 

variety of leadership abilities. Hence 
the most fruitful exploratory research 
in children’s groups might be the study 
of variables related to different types 
of leadership. 

Group Structure. Group structure 
defines the patterning of social relations 
within a group. Who prefers the com- 
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panionship of whom? Who has the 
greatest influence or “power” over 
whom? Who communicates most with 
whom? Which members are most fre- 
quently found within a common sub- 
grouping? Information related to the 
answers to these and similar questions 
falls under the general rubric of “group 
structure.” 

Group structure is the one dimen- 
sion in group processes for which rea- 
sonably reliable and useful scales have 
been developed (Jennings, 1950; North- 
way, 1952; Gardner and Thompson, 
19592). Advances in sociometry have 
stimulated a great deal of research with 
which the reader may become familiar 
by reading the various issues of the 
professional journal Sociometry. The 
properties of several instruments for 
measuring group structure are discussed 
in a later section of this chapter. Suffice 
to say at this point that social-relations 
scales have been a boon to research on 
the social structure of groups. 

Dimensions of Group Structure and 
Dynamics. Space does not permit a dis- 
cussion of the many identifiable prop- 
erties of group functioning. Social 
communication (Back, 1951; Festinger, 
1950, 1951), socioempathy (Barker, 
1942; Borgatta, 1954), communication 
patterns (Bavelas, 1950; Heise and 
Miller, 1951; Leavitt, 1951), and social 
perception (Tagiuri, 1952) are among 
the many dimensions that have at- 
tracted the attention of investigators. 

The thoughtful analysis by Hemp- 
hill and Westie of some of the salient 
dimensions of group structure and dy- 
namics is a rough chart of the variables 
that merit immediate study. They have 
developed scales for evaluating the fol- 
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lowing dimensions of group function- 
ing: 


1. Autonomy is the degree to which 
a group functions independently of other 
groups and occupies an independent posi- 
tion in society. It is reflected by the de- 
gree to which a group determines its own 
activities, by its absence of allegiance, 
deference and/or dependence relative to 
other groups. 

2. Control is the degree to which a 
group regulates the behavior of individ- 
uals while they are functioning as group 
members. It is reflected by the modifica- 
tions which group membership imposes on 
complete freedom of individual behavior 
and by the amount or intensity of group- 
derived government. 

3. Flexibility is the degree to which a 
group’s activities are marked by informal 
procedures rather than by adherence to 
established procedures. It is reflected by 
the extent to which duties of members are 
free from specification through custom, 
tradition, written rules, regulations, codes 
of procedure, or even unwritten but 
clearly prescribed ways of behaving. 

4. Hedonic Tone is the degree to 
which group membership is accompanied 
by a general feeling of pleasantness or 
agreeableness. It is reflected by the fre- 
quency of laughter, conviviality, pleasant 
anticipation of group meetings, and by the 
absence of griping and complaining. 

5. Homogeneity is the degree to which 
members of a group are similar with re- 
spect to socially relevant characteristics. 
It is reflected by relative uniformity of 
members with respect to age, sex, race, 
socio-economic status, interests, attitudes 
and habits. - 

6. Intimacy is the degree to which 
members of a group are mutually ac- 
quainted with one another and are familiar 
with the most personal details of one an- 
other’s lives, It is reflected by the nature 
of topics discussed by members, by modes 
of greeting, forms of address, and by in- 


*From Hemphill and Westie (1950) with 
permission of authors and The Journal Press. 
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teractions which presuppose a knowledge 
of the probable reaction of others under 
widely differing circumstances, as well as 
by the extent and type of knowledge each 
member has about other members of the 
group. 

7. Participation is the degree to which 
members of a group apply time and effort 
to group activities. It is reflected by the 
number and kinds of duties members per- 
form, by voluntary assumption of non- 
assigned duties and by the amount of time 
spent in group activities. 

8. Permeability is the degree to which 
a group permits ready access to member- 
ship. It is reflected by absence of entrance 
requirements of any kind, and by the de- 
gree to which membership is solicited. 

9. Polarization is the degree to which 
a group is oriented and works toward a 
single goal which is clear and specific to 
all members. 

10. Potency is the degree to which a 
group has primary significance for its 
members. It is reflected by the kind of 
needs which a group is satisfying or has 
the potentiality of satisfying, by the ex- 
tent of readjustment which would be re- 
quired of members should the group fail, 
and by the degree to which a group has 
meaning to the members with reference 
to their central values. 

11. Size is the number of members re- 
garded as being in the group. 

12. Stability is the degree to which a 
group persists over a period of time with 
essentially the same characteristics. It is 
reflected by the rate of membership turn- 
over, by frequency of reorganizations and 
by constancy of group size. 

13. Stratification is the degree to which 
a group orders its members into status 
hierarchies. It is reflected by differential 
distribution of power, privileges, obliga- 
tions, and duties and by asymmetrical pat- 
terns of differential behavior among mem- 
bers. 

14. Viscidity is the degree to which 
members of the group function as a unit. 
It is reflected by absence of dissension and 
personal conflict among members, by ab- 
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sence of activities serving to advance only 
the interests of individual group members, 
by the ability of the group to resist dis- 
rupting forces, and by the belief on the 
part of the members that the group does 
function as a unit. 


Each of these dimensions is repre- 
sented by a series of items (e.g., for 
Control: “Activities of the group are 
supervised,” and “Members fear to ex- 
press their real opinions”), which are 
rated by an observer of the group on 
a five-point scale ranging from ‘“defi- 
nitely true” to “definitely false.” 


The scales have adequate reliability, 
yield scores showing a high degree of 
agreement between respondents describing 
the same group, and with a few exceptions 
have low intercorrelations among them- 
selves. (Hemphill and Westie, 1950.) 


GUIDE TO THE OBSERVATION 
OF GROUP BEHAVIOR 


This important topic has been so ably 
discussed by Heyns and Zander that 
many of their comments are presented 
here in only slightly abridged and 
adapted form:' 


The importance of theory in the design 
of experiments is often neglected. We 
need only point out here that the theoreti- 


*From R. W. Heyns and A. F. Zander, 
Observation of group behavior. In L. Fest- 
inger and D. Katz (Eds.), 1953. Research 
methods in the behavioral sciences. New 
York: Dryden. Pp. 398-415. (With permission 
of authors and publisher.) The investigator 
who has had no previous experience in the 
use of rating and category approaches to the 
description of group processes may read the 
original report in its entirety with profit. The 
excerpts reported here will serve as a helpful 
review and summary for all investigators in- 
terested in the measurement of group proc- 
esses. 
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cal framework plays a central role in de- 
termining the decision which the experi- 
menter must make in developing his 
methodology. This is particularly true 
with respect to selection of the behavior 
to be observed or rated and the definition 
of the categories into which these be- 
haviors are to be placed. Without a 
knowledge of the purpose of the experi- 
ment or research and the theoretical set- 
ting in which the experiment takes place, 
no one can prescribe for an experimenter 
the dimensions with which he ought to 
be concerned, or the amount of inference 
he ought to require of his observers. The 
experimental design and the specific hy- 
potheses to be tested will dictate the level 
of reliability of observed data which will 
be necessary. Many other decision impli- 
cations of the theoretical framework could 
be stated; several of these will be made 
explicit in the discussion of decision areas 
which follows. 


THE FRAME OF REFERENCE 


In the interests of reliability, observers 
must be clear about the dimension to be 
observed and about the vantage point they 
are to use in observing, recording, coding, 
or rating it. Observers may be instructed, 
for example, to observe the social inter- 
action along the dimension of interper- 
sonal affect—i.e., the extent to which the 
members of the group are personally fond 
of each other. As the dimension is fairly 
clear, the question arises whether the ob- 
servers are to react to the observed be- 
havior as if they are participants or 
whether they are to identify with the 
actor, making inferences about his mo- 
tives. These are different frames of ref- 
erence. Many others are possible. 


SIZE OF UNIT 


After the experimenter has determined 
the dimensions along which he wants rat- 
ings or categorization, he faces the prob- 
lem of determining the size of the unit 
of behavior to be rated or categorized. 
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Failure to state clearly to the observers 
the precise limits of the unit of observa- 
tion is a frequent source of unreliability. 
The reliability of the coding depends in 
part on the reliability of deciding what 
constitutes a unit of behavior. 

Roughly speaking, the size of the unit 
may vary from a single act—for example, 
the contribution of a single participant— 
to a total period of interaction. Time sam- 
pling (categorizing or rating units of time 
distributed randomly over the total action 
period) is an intermediate technique. This 
can obviously include a single act or all 
acts. 

In categorization, the selection of the 
size of unit is not independent of the 
category system itself. This has led to a 
fairly common practice of defining units 
in some such fashion as this: “any act 
which is classifiable into a single cate- 
gory.” Failure to consider this aspect in 
the definition of units will perforce affect 
reliability. Thus, in practice, experimenters 
dealing with verbal behavior of members 
of problem-solving groups have often 
found it impossible to deal with a total 
simple participation as the unit. A lengthy 
verbal statement too frequently contains 
elements which permit classification in 
several categories. 


SAMPLING METHODS IN 
OBSERVATION 


It often happens that one cannot, or 
does not want to, record all of the be- 
havior that takes place in a given situa- 
tion. This may occur because the meeting 
lasts too long to observe continuously, too 
many persons are present, interactions 
come too rapidly to record, or for any of 
a variety of reasons. In such a case We 
are forced to develop a method of obtain- 
ing a representative sample of the behavior 
being observed. This can be done in 4 
number of ways: (1) Attention can be 
concentrated on the behavior of a few of 
the members, ignoring the others present. 
(2) Attention can be directed to each 
person, or to a number of persons 1n the 
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group, each for a given length of time. 
(3) The whole observation instrument can 
be divided into parts and the social setting 
can be observed in terms of each part of 
the observation schedule for a standard- 
ized length of time. (4) Observations can 
be made only when certain key behaviors 
have been introduced into the meeting. 
(5) The most frequently used system for 
obtaining representative samples of the be- 
havior being observed is the time-sampling 
system, in which a standardized time unit 
is selected during which observation takes 
place. The assumption is that these parts 
will be an adequate description of all of 
the events. 

In general, it is risky to use sampling 
procedures unless one has an adequate 
theory to guide the selection of the sam- 
ple. As an illustration, let us assume that 
we Wish to observe a number of behaviors 
in a group but for some reason it is neces- 
sary to use a time-sampling procedure. 
We make the decision to observe in terms 
of a certain set of categories for five min- 
utes, skip ten minutes (during which we 
may be observing with a different set of 
“spectacles”) and return for another five 
minutes with the original categories, and 
so on. Any social situation, however, is a 
changing set of activities, and we may 
discover that certain events occur during 
the ten minutes we are not observing 
which distort the representativeness of the 
records made during our time samples. 
Thus, we would rightly be suspicious of 
the adequacy of our data. We could have 
avoided this difficulty if we had had some 
notion of the range of possibilities of cru- 
cial changes in this group which could 
affect the adequacy of our time samples 
and had arranged our sampling procedure 
in sucha way that it covered all of the 
crucial differences which may have dis- 
torted the representativeness of our time 
sample. 

Another problem one may encounter 
while using time sampling is that the psy- 
chological meaningfulness of the behavior 
being observed may be destroyed. For 
example, rating a group’s reactions to a 
given act of a leader may not give a mean- 


837 


ingful picture; that is, it may be atomistic 
or incomplete. When one observes the 
complete sequence of events in a group’s 
reaction to a leader’s act before recording 
anything, however, it is possible to rate 
or otherwise record the behavior in a way 
that includes all of the group’s reactions 
and all of the implications of these actions. 


TRAINING OBSERVERS 


The use of group observers means that 
we are using people as measuring instru- 
ments. A good measuring instrument is 
one which will accurately measure at va- 
rious times what it is supposed to measure. 
If a person is to become an “observing 
instrument,” he must be trained to see 
what it is required that he see. This may 
be a simple or a complex task, depending 
upon the data needed. It is an important 
phase of the preparation of a study using 
group observers and cannot receive too 
much attention. A well-developed obser- 
vation schedule will be only as good as 
the skill of the persons who are asked to 
use it. 

In general, one can expect that the ob- 
servers will have the greatest problems on 
those categories which require integration 
or collation of complex phenomena. They 
will have the least difficulty, in contrast, 
with those events which are simple ob- 
jective occurrences which require little 
insight or sensitivity on the part of the 
observer. Studies of coder reliability 
(Guetzkow, 1950) have found that there 
is most disagreement on data which are 
complicated and demand much inference, 
Since an observer functions as a highly 
trained coder, it is quite likely that he 
will have similar problems. 

This suggests that the skills required of 
an observer cannot be performed by all 
persons equally well, quite aside from the 
academic training they may have had. One 
may find that a group of observers has 
comparable ability to understand the phe- 
nomena involved in a study and to discuss 
them intelligently but that some of them 
may not be able to “see” these things in 
a group of interacting people. A study by 
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Luszki (1950) provides some insights 
which are relevant. She found that there 
is a positive relation between sensitivity 
to the feelings and emotions of others 
(empathic ability) and the ability to “see” 
what is happening in a social situation. 
Persons with good empathic ability (1) 
are better able to see what is happening 
in the role performance of others, (2) 
have good personal adjustment, as meas- 
ured by the instruments used in the study, 
(3) have insights into themselves which 
are similar to the evaluations made of them 
by others, (4) have more stable, positive, 
and secure feelings about the self and 
somewhat favorable perceptions of others, 
(5) have a more consistent and more fa- 
vorable perception by others. 


RELIABILITY 


The reliability of observational instru- 
ments has been much more a matter of 
concern to investigators of social behavior 
than has validity. For purposes of exposi- 
tion it may be useful to make a distinction 
that is not often made. In assessing the 
reliability of a system of social observa- 
tion, it is necessary to differentiate the 
reliability of the behavior being observed 
from the reliability of the categorization 
or rating which is made of that behavior. 
In other words, the reliability of the ob- 
server and of the behavior are separate 
problems. It is clear, of course, from this 
distinction that the consistency of behavior 
is a substantive problem, whereas the con- 
sistency of an observer is a methodologi- 
cal problem. Once the consistency of an 
observer has been established, it becomes 
possible to tackle the problem of the con- 
sistency of the behavior. 

The task of assessing degree of agree- 
ment among observers, in the case of a 
category system, breaks down into at least 
two separate reliability problems. The first 
is the extent of agreement among observers 
with respect to the number of units coded. 
This is essentially the extent to which the 
observers agree as to the boundaries of 
the unit which is to be categorized. Guetz- 
kow (1950) has pointed out, as has Bales 
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(1950), that unreliability in this area is an 
important factor affecting the reliability 
of categorization itself. It seems clear that 
until there is rater agreement to the 
boundaries of units to be categorized, 
there is little purpose in assessing rater 
agreement on categorizing. The second 
principal task in assessing reliability is to 
determine the extent to which observers 
agree on the category or rating they as- 
sign to a specific unit of behavior. 

The most frequently used statistic in 
appraising degree of agreement between 
observers has been the correlation coeffi- 
cient. This is especially useful, provided 
the assumptions underlying irs use can be 
met, because the extent of agreement that 
is being obtained can be evaluated in terms 
of a fixed standard. It is a separate ques- 
tion, however, whether in a given case 
one needs a correlation of 0.7, 0.8, or 0.9 
to be certain that one’s degree of agree- 
ment is satisfactory. It is apparent that 
the question immediately becomes: satis- 
factory for what? In other words, the 
experimenter or investigator must ask him- 
self the purpose of his observational scores. 
His theory will indicate the extent to 
which large or small differences are to be 
expected. It is a truism that where fine 
differentiation is necessary the scores must 
be more reliable than where gross differ- 
ences can be expected. It is impossible to 
state categorically that observational scores 
should be at such and such a level of re- 
liability to be useful, for the usefulness 
of a score depends on the use to which 
it is to be put. 4 

The second principal statistical device 
which has been utilized in characterizing 
amount of agreement between observers 
has been a percent-agreement score. This 
is essentially a matter of computing the 
percentage of the total number of items 
which were classified in the same way by 
the two observers or by all the observers 
combined. Guetzkow (1950) has provided 
statistical devices for evaluating the level 
of agreement. 

Bales (1950) illustrates the use of the 
chi square to evaluate degree of observer 
agreement. He points out that his use of 
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chi square is different from the more con- 
ventional applications, since his use applies 
it to a situation which does not represent 
random sampling. The principal advantage 
of chi square is that it does not require 
the assumptions of the usual parametric 
techniques. 

It seems useful to point out that the in- 
igator ought to be concerned with the 

r of the measure actually used in 
the analysis of the data. For example, it 
is a matter of relative unimportance 
whether observers agree with respect to 
the number of units of behavior assigned 
to a specific individual if the score with 
which the investigator is concerned is the 
number of units in each category made 
by the group as a whole. In that case the 
reliability of individual scores is not of 
concern as long as high group reliability 
is present. Another illustration is the re- 
liability of the categorization of each unit 
as opposed to the reliability of a category 
score for an individual. That is, the agree- 
ment between observers as to the percent- 
age of responses categorized in each of 
the categories in the system may be very 
high even though the observers show rela- 
tively low agreement on the categoriza- 
tion of each item. This condition occurs, 
of course, only when the errors of cate- 
gorization are fairly random. The point 
is, however, that the investigator must ask 
himself what score he is going to use and 
must then assess the reliability of that 
score. 


THE RELATION OF THE 
OBSERVER TO THE GROUP 
BEING OBSERVED 


One of the most frequent questions 
addressed to the investigator who reports 
a study which used observers is: “Did the 
presence of the observers influence the 
behavior of the group?” Usually he must 
answer that he has no evidence that the 
observers influenced the results in any 
way, but that they might have. It is quite 
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conceivable that the presence of an ob- 
server may be an important variable. This 
depends upon the nature of the group, the 
type of observation, the nature of the 
group’s activity, the nature of the varia- 
bles being observed, and a number of other 
things. Arsenian (1943) found that the 
simple presence of an adult sitting near 
the door seemed to lend assurance to a 
group of nursery-school children. Yet the 
presence of observers was a threat to 
young boys at a summer camp, according 
to Polansky (1949). The influence of ob- 
servers is a methodological problem which 
needs more careful study. 

Deutsch (1949) found that the members 
of small groups which met frequently over 
a period of three weeks were aware of the 
presence of observers at the beginning of 
their work together but had become al- 
most oblivious to them by the end of that 
period. Half of these groups were con- 
structed in such a way that the members 
cooperated with one another to achieve 
a common goal; the rest of the groups 
were in a situation in which the members 
competed with one another. It is interest- 
ing to note that the competitive groups 
were much more conscious of the observ- 
ers than were the cooperative groups. 
Thus, procedural differences within a 
group may influence the way in which 
the observers are perceived. 

Polansky (1949) noted that changes 
took place in the perception by boy 
campers of the purpose and role of cruis- 
ing observers. Early in the summer, re- 
search observers were well accepted by 
the campers, who had been told that the 
strangers were neutral persons who were 
interested in learning how they could im- 
prove the camp. By the third week, how- 
ever, they had become the objects of ag- 
gression by many of the boys. The ob- 
servers decided that their role was too 
ambiguous and represented a threat to 
these boys, many of whom were rebellious 
against adult authority. It was felt that 
the campers were projecting their feelings 
of hostility toward adults onto the ob- 
servers. After the observers changed their 
behavior to become more warm, human, 
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and friendly, they found that they were 
no longer rejected by the campers. 

Naturally, we must be cautious about 
generalizing from such an experience as 
this. In a laboratory situation, for example, 
where the group members are working 
toward a decision, it would no doubt be 
quite disruptive for an observer to indi- 
cate that he feels warm and friendly to- 
ward the participants. In short, in some 
situations we may want one perception of 
the observer; in another situation we may 
need quite a different one. 

Bales (1950) has used observers in a 
wide variety of laboratory arrangements. 
They have sat with the group, or behind 
a one-way screen with the group aware 
they are there, or behind the screen with 
the group wondering whether they are 
there. He has found no difference in the 
behavior of the groups which could be 
attributed to the influence of these various 
positions of the observers. 

Thus, it oe that group observers 
sometimes influence a social situation. 
When in doubt, it is wisest to explain the 
function of the observers to the group in 
some way that does not spoil any neces- 
sary naiveté of the subjects. The observers 
will do best, barring any special conditions 
such as those met by Polansky (1949), by 
showing all of the external signs of a piece 
of furniture. If they behave like a person 
who is having no reactions to the events 
being observed, the group members will 
perceive them as such. For most situations 
that is probably best. 


SCALES FOR MEASURING THE 
SOCIAL STRUCTURE OF GROUPS 


The social structure of groups can 
be described and evaluated by general 
observational procedures. However, it 
is often convenient to use the verbal 
responses of the children themselves to 
make inferences about the structure of 
the groups in which they hold mem- 
bership. “Sociometric” instruments are 
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convenient and useful for a number of 
reasons. They provide data that have 
been shown to be related to several 
other important dimensions of group 
processes, such as morale, group effec- 
tiveness, and patterns of communica- 
tion. They tap a residue of social-rela- 
tions tendencies that are usually diffi- 
cult to sample in direct observations of 
social behavior. They provide reason- 
ably reliable measures of what has been 
shown to be a fairly stable component 
of human-response tendencies. They 
can be employed in comparable form 
over a wide age-range of subjects; 
hence they have been useful in de- 
scribing developmental trends in social 
relations (Thompson and Horrocks, 
1947; Horrocks and Buker, 1951). 
“Nominating” or Partial-Rank-Order 
Procedures. Although the “social dis- 
tance” scales of Bogardus (1933) had 
priority in this category, the socio- 
metric techniques of Moreno (1951, 
1953) have been by far the most in- 
fluential in stimulating a mass of studies 
about the social structure of groups 
(Criswell, 1943; Loomis and Pepinsky, 
1948; Lindzey and Borgatta, 1954). 
The partial-rank-order approach pro- 
posed by Moreno is simple and flexi- 
ble. It consists of any desired number 
of questions such as the following: 
“Whom would you most like to have 
sit next to you in this classroom (oF 
other group)? “Whom would you 
most like to have go with you on a 
picnic (or on a dangerous military mis- 


*The directions for administering socio- 
metric tests of the partial-rank-order type and 
a convenient form for recording the obtained 
nominations is now commercially available 
(Bonney and Fessenden, 1955). 
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)-2. THE GENERAL FORMAT OF A TYPICAL SOCIOMETRIC QUESTIONNAIRE 


OF THE PARTIAL-RANK-ORDER TYPE 


A. Whom would you most like to have B. If you were going on a picnic, whom 


sit next to you in this classroom? 


rst choice 
2nd choice .. 
3rd choice 


sion, or co a party, and so on)?” “To 
whom do you talk most frequently?” 
“Who is your best friend?” And so on 
through the gamut of possible social 
relationships.? 

Answers to questions of the forego- 


would you most like to have go with 
you? 

ist choice 
2nd choice . 
3rd choice 


his first, second, third, fourth, and fifth 
choices (or some smaller, or larger, or- 
dering), or he may be asked to list as 
many individuals in a preferred order 
as he wishes.* 

The obtained data are usually tabu- 


Nominations Received 


Individual RAW MCG hE BKE WRI. JM: "GPO “ARS sss. TERA 
Ralph A. I I I 
Carl C. I I I 
John H. I I 1 

» Bill K. I 1 1 

& Ray L. I I I 

= Jim M I I I 

£ George P. 1 I I 

‘B Art R. I I 1 

H Sam S. I I I 

S Frank T. I I I 

Z Total 
number of 
nominations 
received 3 8 3 2 8 o 3 2 o I 


Fig. 19-2. 


choices permitted each member in response to, 


your buddy on an all-day hiking trip?” 


ing types may be solicited by inter- 
views or by written forms similar to 
the one presented in Table 19-2. The 
respondent may be requested to give 


*It should be noted that these questions may 
also be phrased in a negative way; €g., 
“Whom would you least like to have sit next 
to you?” 


A matrix of nominations in a Boy Scout group of 10 


members, with three 
“Whom would you most like to have as 


lated in a matrix similar to the one pre- 
sented in Fig. 19-2. They are also often 


‘Lippitt and associates (1952) have used 
photographs to help subjects identify other 
members of the group in the early stages of 
camping experience. McCandless and Marshall 
(1957) have also used photographs to help 
offset the “out of sight, out of mind” tend- 
encies among preschool-aged subjects. 
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——> One- woy choice 
—+— Mutual choice 
1, 2,0r 3 = order of choice 


THE STUDY OF THE CHILD'S SOCIAL 


BEHAVIOR AND ENVIRONMENT 


Note: For an absent boy or girl, use the respective 
symbol dashed, leaving any choice line open-ended. 
(See the case of Joe Brown in the above sociogram)- 
If rejections ore obtained, the choice line may 
be made in dashes or in a.different color. 
Whenever a direct line from chooser to chosen 
cannot be drawn without crossing through the symbol 
for another individual, the line-should be drawn with 
an elbow, os in the case of Bill Lane to Paula King. 


Fig. 19-3. The social structure of one classroom. The data for this sociogram were obtained 


by asking each pupil to make first, 


second, and third choices among his classmates for a 


particular social activity. From Jennings (1948). With permission of the American Council 


on Education. 


plotted in a sociogram like that shown 
in Fig. 19-3. 

The obtained data may be manipu- 
lated in a number of different Ways to 
describe several properties of group 
structure or to define the individual 
member’s social relationships with 
other members in the group. Experi- 


menters have been unusually ingenious 
in developing social-relations indices 
based on the partial-rank-order ap- 
proach (partial listing in rank order) 
to sociometry (see Proctor and Loomis, 
1951, for a summary). s 
For example, in instances in which 
the interest is in cliques of preferred 
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companions or in describing networks 
of communication within the group 
Luce and Perry (1949) have developed 
a method for identifying n-chains and 
cliques by squaring and cubing the 
matrix of intersubject preferences 
(see also Festinger, 1949). Festinger, 
Schachter, and Back (1950) used this 
method of matrix multiplication to ad- 

ge in describing the patterns of 
interconnections existing in a 
multiple-unit housing project, Although 
this approach does not make available 
any information that could not be ob- 
tained by a sustained inspection of the 
matrix, it does provide a straightfor- 
ward procedure for identifying social 
interconnections in a rigorous way. The 
availability of electronic computers 
greatly expedites the squaring and cub- 
ing of matrices obtained for conven- 
tional-sized small groups. 

Another index of social relations has 
been of enduring interest to investiga- 
tors with a primary concern for the 
individual’s status or social position 
within a group. As may be seen in Fig. 
19-2, it is relatively simple to sum the 
nominations in a given column and 
thereby find out how frequently the 
particular person associated with that 
column has been selected as a preferred 
companion by all other members of the 
group. When all of the columns are 
summed for a typical matrix, the re- 
sulting “scores” of social preference or 
“acceptability” usually comprise a 
highly skewed distribution. Three or 
four members receive over half of the 
votes and a few individuals may receive 
only one or two nominations, or none 
at all. 

Tt has become conventional in soci- 
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ometry to speak of “stars” (individuals 
with many nominations) and “isolates” 
(individuals receiving few, or no, nomi- 
nations). How many nominations, or 
how few, define a star or an isolate in 
a given-sized group making a definite 
number of nominations? Bronfenbren- 
ner (1945) has proposed one solution 
to this problem, His method, described 
as a “deviation from chance expect- 
ancy” approach, involves fitting a Pear- 
son Type III function to the obtained 
nominations and defining stars and iso- 
lates as falling within given restricted 
areas at appropriate ends of the theo- 
retical distribution, Although this ap- 
proach does constitute an unambiguous 
method for defining certain levels of 
social status, or acceptability, within a 
group, it does not always provide an 
unambiguous or psychologically mean- 
ingful basis for identifying stars and 
isolates (Thompson, Bligh, and Wi- 
tryol, 1951). 

The  partial-rank-order approach 
(partial listing in rank order) to the 
measurement of group structure has the 
advantage of simplicity in both data 
procurement and analysis. However, its 
simplicity makes it vulnerable to a large 
number of deficiencies. The reliability 
and stability of some of the obtained 
social-relations indices are reasonably 
high, but the stability of the individual 
nomination is highly variable (Witryol 
and Thompson, 1953). It suffers all of 
the deficiencies of an ordinal scale. For 
example, it cannot be legitimately em- 
ployed to compare the social-relations 
position or status of individuals who 
are members of nonoverlapping groups; 
nor can it be used to compare the so- 
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cial structures of different groups." 
These shortcomings have prompted a 
number of investigators to attempt to 
find more defensible methods for de- 
scribing the social structure of small 
groups.” 

The Syracuse Scales of Social Rela- 
tions. The Syracuse Scales were de- 
signed to overcome some of the de- 
ficiencies of the now traditional socio- 
metric procedures. They were con- 
structed to reflect specific individual 
needs rather than the more generally 
defined social situations of conven- 
tional sociometry. They involve each 
respondent’s adopting a reference pop- 
ulation that extends beyond the group 
being studied, so that the social struc- 
tures of different groups can be mean- 
ingfully compared. The bisection pro- 
cedures and the forced choices built 
into the rating procedures result in 
highly reliable and stable information 
about each of the social “interconnec- 
tions” within a group. The following 
is a brief description of these new 
scales: 


To overcome some of the deficiencies 
of the traditional sociometric procedures, 
standardized instruments called the Syra- 
cuse Scales of Social Relations have heen 


*See Gardner and Thompson (1956) for a 
discussion of the metric and psychological 
deficiencies of the partial-rank-order approach 
to sociometry, 

* The paired-comparison approach (Witryol 
and Thompson, 1953) is not elaborated as a 
Measure of group structure because the data 
obtained by its use are relevant only to the 
social acceptability component, It does provide 
highly reliable and stable information for this 
purpose. 

“From Gardner and Thompson (r9s9a) 
with permission of authors and The World 
Book Company, 
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constructed and are available for use with 
elementary, junior high, and senior high 
school pupils, The Syracuse Scales present 
social situations each one of which elicits 
responses calling for the satisfaction of 
one particular psychological need. For ex- 
ample, one situation requires the pupil to 
rate his classmates as kind, sympathetic 
friends to whom he would go in time of 
trouble (need-succorance). His ratings are 
made in reference to a scale of all persons 
he has ever known. 

Figure 19-4 shows the frame of refer- 
ence set up by Johnny F., a fifth grade 
pupil. From among all the persons he has 
ever known, he has selected his mother 
as the person he would 7os¢ like to have 
help him when he is troubled by some 
personal problem. He has written her 
name in the “five-star” box at the top of 
the rating sheet above the word MOST. 
He next chose Alice, one of his class- 
mates, as the person who seems to be least 
interested in helping him. He has entered 
her name in the “one-star” box above the 
word LEAST. Johnny decided that his 
Uncle Joe was about medium or in the 
middle for helping him when he is in 
trouble; that Dan, one of his classmates, 
was about halfway between his mother 
and his Uncle Joe; and that Neighbor 
Jones was about halfway between Alice 
and Uncle Joe. He has written these names 
in the appropriate boxes. A 

The five names which Johnny has writ- 
ten in the boxes at the top of Fig. 19-4 
constitute the frame of reference for his 
evaluations of his classmates. He has rated 
each of his classmates as being nearest to 
one of these five persons. As can be seen 
in Fig. 19-4, he decided that Edith A. was 
a little Jess good than Neighbor Jones, 50 
he drew a circle around the words “Less 
Good.” He considered Joan E., pupil No. 
5, as medium but a little better than Uncle 
Joe, so he drew a circle around the word 
“Better” in the column under MEDIUM. 
He evaluated all of his classmates in this 
fashion. 

Since the Syracuse Scales use a frame of 
reference that extends beyond the class- 
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NAME je (5) N-Suc. 
Alico") uighbmuflonii| Uncle Joo*™| Lan F) Tithe. E] 
LEAST BETWEEN MEDIUM BETWEEN MOST 
Jow) (GE) e] (EO ome] [ERO oe] [ESO] 1. 
SE) LES Cee] [Ee Ow] (EEO om] PERO] 2. 
Q Bener ta Q em) [E O seer] H [EO] 3. 
Q eme) [h O oe] [e O e) O oe] (CERO) 4. 
Q pener te O eme) [E O Ea Q Beer 0] 5. 
0 eS 

Qoo) C0 e) [E O oee 

O oe) (TEED eee] ERE O eee 

Q ow) CEO e) ESO ee) 

Q ener ea Q ew) [5 O sener 
[Oem] [CS Oem] [o ne 
Dow] (CO ete] CEO tee] 

O em) [E 0 ow] [E O tere 

© Bener En G vene En O tee | 

Q seme) [E O Bene J tea Q Bener Eas Q) Bener 

Oee) o e) [GO e) [i O ene 

Q Better En, O omer OA Bs O beter 

Q ee) (EO ewe] [QO ene] [EO ene] 

Q saw) CEO ee a O o) [ES e 

© Bener ES Q Getter Eg O bener $250. tater 

© vener Ea O ome) eC) CE O ee 


Fig. 19-4. The frame-of reference and the ratings of Johnny F., a fifth-grade pupil, on the 
Syracuse Scales of Social Relations. From Gardner and Thompson (1959a). With permission 


of authors and the World Book Co. 


room membership, it is possible to com- 
pare the social relations status of a pupil 
in a number of subgroups to which he 
belongs—classroom, clubs, athletic groups, 
etc. The pupils he selects as desirable for 
satisfying one need may be a very differ- 
ent group from those he chooses as de- 
sirable for a second need. In like manner, 
the frame of reference he sets up for one 
need may also differ considerably from 
that set up for the second need. 

For a particular need situation, each 
pupil makes a rating of every other pupil 
on a scale ranging from LEAST to 
MOST. Figure 19-5 shows the ratings that 
Tommy Adams, a fifth-grade pupil has 
given to each of his classmates on the 
need succorance scale. A scoring guide has 
been applied which can be used to convert 
Tommy’s ratings to numerical scores. 
Thirteen numerical scores, rangirg in 


value from 5 to 85 are marked off on the 
Scoring Guide to correspond to the rat- 
ings in the reference scale. 

In the illustration, Tommy has rated 
Phil B. as the person whom he would 
most like to have help him when he is 
in trouble and has placed this pupil’s name 
in the s-star box above the word MOST. 
A numerical score of 85 would be as- 
signed to this rating. Tommy has rated 
Ray M., pupil No. 9, as a “Better” than 
MEDIUM person for satisfying his need. 
A score of 50 corresponds to this rating. 
In like manner, a score of 20 would be 
assigned to Tommy’s rating of Cathy S. 
and a score of 10 to his rating of Betty O. 

The rating that a pupil receives from 
each of his classmates can be assigned a 
numerical value in a similar fashion. When 
these scores are transferred to a pupil’s 
test booklet, it becomes a complete record 


NAME 20 Adama (5) N-Sue. 
Fat | David 72.** eee | (Lary | [Aloe 
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SCORING GUIDE 


For Syracuse Scales 
World Book Company 


FOLD UP ALONG DOTTED LINE. 


26. LQ Sewer j q ia Q Beier J gas Y Bener] | sh Bete | | aa 0 | 26. 
27. O e) CSSD eater Te. ame | = Q) Better wa Q 27, 
28. E9 Better | oe etter] Lei O ee Ei O bener ta 0] 28: 
29. Oe) o e) E O e E Q vener ea 0 | 29. 
30. EE Esa Q Better Ey Q Bener Eeey Q Beter E | 30. 
31. Q beer ES Q sener Ea 0 mw) [E 0. seer w 0] 31. 
32, Q ome] (ae O ene] (EO ome] [oO ome] (oes 0] 32 
33. © sme] (E O oe Es O oee) [E5 O oee Es 0 | 33. 
34. Q ewe] [o e) [ae Oem] [EO ow] Coe 0) 34 
35. Q eme) [E8 0 sete] | EO eater TSO ener |] [E 0] 35. 
36. [© tener tO ome) [EO oem] [oO ome ] [oe 0 | 36. 
37. Q see) [EO sme) [EE 0 ne BO sewer] [ cs O] 37: 
pn c Made 10,085 20; 230 140 ROA 70180. 

Mid-score Rec'd 10 20 30 40 50 60 70 80 


Fig. 19-5. An example of the scoring procedures employed for the Syracuse Scales of Social 
Relations. From Gardner and Thompson (1959a). With permission of authors and the World 
Book Co. 
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of the ratings he makes of his classmates 
and the ratings he receives from them. 
The numerical scores can be used to 
deter a pupil’s average score for rat- 
ings made and for ratings received. These 
scores are referred to as mid-scores 
esent the median or middle score. 
-score of ratings made by Tommy 
is 45, the seventh score, since 
rated 13 classmates, 
mid-score of the ratings that a 


pupil makes and the mid-score of the rat- 
ings receives can be interpreted not 
only in terms of the average scores of 
his classmates but also in terms of per- 
centile norms obtained from a large group 
of pupils at his own grade level. These 
social-relations norms, based on over a 
thousand pupils at each grade level, are 
similar in type to those provided with 
standardized achievement tests. 


Norms are also provided in which the 
class average is used as a unit, so that the 
teacher can compare the average social- 
relations status of her class with the large 
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SCALES FOR MEASURING 
SOCIAL ROLES 


The partial-rank-order procedures 
employed in Moreno’s sociometric ap- 
proach have also been adapted for the 
measurement of social roles, or reputa- 
tions, in children’s groups. In this latter 
usage the children’s nominations are 
used as a metric for describing an in- 
dividual’s behavior tendencies and so- 
cial roles as they are viewed by all 
other members of the group. 

The “Guess Who?” Approach. This 
technique, which has been used at least 
since the time of Hartshorne and May’s 
early investigations of character devel- 
opment, presents each member of the 
group with a series of social role de- 
scriptions similar to those presented in 
Table 19-3. The subject is asked to list 


TABLE 19-3, THE GENERAL FORMAT OF THE “GUESS WHO?” APPROACH TO 
THE MEASUREMENT OF SOCIAL ROLES WITHIN A GROUP 


A. Here is someone who always knows B. Here is someone who is always bossing 
what to do so everyone will have a 
good time at a party. Guess who? 


number of classes included in the norma- 
tive group. Research has shown that this 
information is related to group morale 
(Gardner and Thompson, 1956). 

“Chums,” “triangles,” and “cliques” are 
graphically represented, using methods 
similar to those utilized in sociograms. In 
addition to a sheer representation of mu- 
tual attractions, the magnitudes of the re- 
ciprocal ratings also indicate the strength 
of the social bonds that hold the members 
of such subgroups together. 


others around and telling them what to 
do, Guess who? 


as many of his fellow group members 
as he thinks fit each of these descrip- 
tions. 

The metric for each social role, or 
description, is simply the number of 
nominations received by each member 
of the group. Data obtained in this 
fashion have been used to plot develop- 
mental trends in a number of investi- 
gations (Jones, 1943; Kuhlen and Lee, 
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1943; Tryon, 1939). Descriptions of 
this type have also been collected into 
relatively homogeneous groupings to 
study the more general reputations of 
group members (deGroat and Thomp- 
son, 1949; Meyer and Thompson, 
1956). The reliability and stability of 
indices based on “Guess Who?” proce- 
dures are generally satisfactory for cor- 
relational purposes and for group com- 
parisons. 

Ratings of Social Roles and Group 
Climates. Ratings tend to yield reliable 
and useful information in direct pro- 
portion to their clarity and the insight- 
fulness of the raters. The rating scale 
accepts the validity of the statement 
“Man is the measure of all things” at 
face value and in its most primitive 
form. Ratings are generally employed 
for descriptive and evaluation purposes 
when the attribute being studied is not 
yet amenable to more indirect forms of 
measurement. (See Chapter 11 of Guil- 
ford, 1954, for a sophisticated discus- 
sion of rating scales and their proper 
use. ) 

A variation of the ratings approach, 
and one that has gained in favor during 
the last two decades, is the method of 
categorization of observations accord- 
ing to an a priori system. The fre- 
quency of behaviors assigned to differ- 
ent categories provides the metric for 
individual and group comparisons. Be- 
cause Withall’s category system is typi- 
cal of the more carefully conceived 
approaches, it has been selected for 
illustrative purposes. Withall’s approach 
involves a rater (or raters) distributing 
a teacher’s (or teachers’) statements 
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made within a classroom into the fol- 
lowing categories: 


Category 1. Learner-supportive state- 


ments or questions. These teacher- 
statements or questions that express agree- 
ment with ideas, actions or opinions of 
the learner, or that commend or reassure 
the learner. Agreement is frequently ex- 
pressed by a monosyllabic response such 
as “Yes,” “Right,” “Uhuh,” and the like. 
Commendation or reassurance may be 
stated in terms of 

(a) class-accepted criteria or goals, or 

(b) the private goals and subjective 


criteria of the teacher 
The dominant intent of these statements 
or questions is to praise, encourage of 
bolster the learner. 

Category 2. Acceptant or clarifying 
statements of questions. These are teacher- 
statements or questions which 

(a) accept, i.e., evidence considerable 
understanding by the teacher of, or 
clarify, i.e., restate clearly and suc- 
cinctly in the teacher's words, the 
ideational or the feeling content of 
the learner’s statemen he domi- 
nant intent of these teacher-re- 
sponses is to help the learner gain 
insight into his problem, i.e., define 
his “real” problem and its solution 
in more operational terms. 


Category 3. Problem-structuring state- 
ments or questions. Problem-structuring 
responses by the teacher offer facts or 
ideas or opinions to the learner about 
either (i) phenomena, or (ii) procedures, 
in a non-threatening and objective man- 
ner. These responses contain mo element 
of advising or recommending the adop- 


(b) 


*From Withall (1952) with permission of 
author and Educational and Psychological 
Measurement. It should be noted that this 
category system is not restricted to use im å 
classroom setting. It could also be employed 
in any group in which the emphasis is upon 
an instructional program. 
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tion of certain ideas or procedures. Prob- 
lem-structuring responses are frequently 
posed as questions which seek further in- 
formation from the learner about the 
problem confronting him; or they may 


be statements which offer information to 
the learner about his problem. The learner 
is free to accept or to reject in part or 
in entirety the facts or opinions that are 
presented to him. Problem-structuring re- 
sponses may be questions which the 
teacher asks either (4) to further increase 
his own understanding of what the learner 
has said or (b) to increase the precision of 
the |earner’s statement of the problem. 
Problem-structuring responses are prob- 
lem-centered rather than either teacher- 
or jearner-centered. Nevertheless the 
dominant intent is to sustain the learner 
by facilitating his problem-solving ac- 


tivities. 

Category 4. Statements evidencing no 
supportive intent. These statements are 
neither teacher-centered, nor learner-cen- 
tered, nor problem-centered. They con- 
stitute a small percentage of the total 
teacher-responses. These responses include 
statements in which the teacher (a) ques- 
tions himself aloud, (b) repeats verbatim 
a statement that the learner has just made, 
(c) uses a polite formality, (d) states an 
administrative or procedural detail such 
as “We'll meet in Room 25 tomorrow.” 

Category 5. Directive statements or 
questions. These are teacher-statements or 
questions which advise the learner re- 
garding a course of action or his future 
behavior; they either narrowly limit his 
choice or offer no choice. These state- 
ments recommend to the learner the facts 
or procedures that the teacher proffers 
him. These statements or questions convey 
the impression to the learner that the 
teacher expects and hopes that he will 
follow her prompting and that she will 
approve if he does. The dominant intent 
of these responses is to have the learner 
take up the teacher's point of view and 
pursue a course of action that she advo- 
cates, 
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Category 6. Reproving, disapproving or 
disparaging statements or questions. By 
means of these statements a teacher may 
express complete or partial disapproval of 
the ideas, behavior and personality foibles 
of the learner. The teacher’s internalized 
societal values largely enter into these re- 
sponses. By means of these statements some 
teachers believe they are fulfilling their 
responsibility of inculcating in young 
people society’s standards of acceptable be- 
havior and achievement. The dominant 
intent of these statements is (4) to repre- 
sent to the learner societal values as the 
teacher perceives them, (b) to admonish 
the learner for unacceptable behavior and 
to deter him from repeating it in the 
future, (c) to impress on the learner the 
fact that he has not met the criteria for 
successful achievement which the teacher 
sets up. 

Category 7. Teacher-supportive state- 
ments or questions. These are statements 
or questions in which the teacher refers to 
himself and expresses a defensive attitude, 
or refers to his present or past interests, 
activities, or possessions with the purpose 
of reassuring himself and of confirming 
his position or his ideas in the eyes of 
those around him. The dominant intent 
of these teacher-responses is to assert, to 
defend or to justify the teacher, Statements 
in which the teacher perseverates on an 
idea, a belief or a suggestion fall into 
this category. By “perseverate” we mean a 
persisting in and a reiteration of an idea 
or opinion despite additional data which 
calls for a re-examination of the original 
idea or opinion. 

An application of this rating, or cate- 
gorization, approach to a sample of 271 
teacher-statements yielded a split-half 
reliability of .go. The mean percentage 
of agreement between four raters and 
the investigator was found to be 65, 
with a range of 53 to 77 per cent. 
These findings demonstrate something 
of the power of the categorization ap- 
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proach when it is skillfully conceived 
and administered by well-trained 
judges, 

Thelen and Withall (1949) have 
shown that the foregoing rating scheme 
yields data that are in rough agreement 
with pupils’ responses as measured by 
an “audience-reception” technique (pu- 
pils push buttons to indicate their dif- 
ferential reactions to the social-emo- 
tional climate of the classroom), 
Wrightstone (1951) has proposed a 
somewhat different technique for meas- 
uring the social climate of a classroom, 
and Horowitz and Cartwright (1953) 
have suggested that projective methods 
may be used to advantage in describing 
the more general properties of a group. 
Gardner and Thompson (1956) have 
successfully employed a number of 
novel approaches to the measurement 
of esprit de corps morale within a 
group (e.g., estimates of asymptotes of 
pleasant and unpleasant recall functions 
and patterning of values produced in 
defense of group’s continuing exist- 
ence), 


“BACKGROUND” LITERATURE 
FOR RESEARCH 


Research without a theoretical frame 
of reference is becoming relatively rare 
in social psychology. The advantages 
of a guiding theory have become so 
obvious to investigators that the ma- 
jority hesitates to conduct empirical 
studies that have no explicit theoretical 
home. It is fortunate for the beginning 
student with a primary interest in chil- 
dren’s groups that the background lit- 
erature on group structure and dy- 
namics is now readily accessible. 
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The following “books of readings” 
edited by highly competent investiga- 
tors of group processes provide a 
modest-sized but very usable library: 
Cartwright and Zander (1953), Fest- 


inger and Katz (1953), Guetzkow 
(1951), Hare, Borgatta, and Bales 
(1955), Lazarsfeld and Rosenberg 


(1955), Miller (1950), Sherif and Wil- 
son (1953), Sullivan (1952), and Ta- 
giuri and Petrullo (1958). 
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The Problem 


In this chapter we discuss how we can 
study the interpersonal behavior in 
which children are involved. Although 
this may appear to be an easy thing to 
do, most active researchers would agree 
that, next to children’s motives, it is 
the most tangled and difficult area of 
study in all of child development. Fur- 
thermore, there is a sense in which most 
of the techniques we have developed 
in this area are short-cut methods that 
permit us to work in the general realm 
of the “interpersonal” but that also 
usually evade some of the important 
problems, 

First we wish to create a “sense of 
problem” about interpersonal behavior 
because there is no doubt that there is 
a number of exciting and subtle issues 
here. Then we must look at what we 
now have available to work with, 


The writer wishes to record his apprecia- 
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Elisabeth C. Lambert and A. L, Baldwin. He 
takes all responsibility for what is said here, 
however, 
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Finally, we speculate concerning the 
costly and complicated techniques that 
the future may bring, which will permit 
a more direct attack on the problems 
of this area, 

It is necessary, however, to outline 
the focus of the present discussion be- 
Cause so much space in this book is 
devoted to what is broadly referred to 
as the “interpersonal” or “social” as- 
pects of a child’s development. Socio- 
metric, projective, and experimental 
methods are all used to study a child’s 
relationships with others, as are the ob- 
Servational techniques discussed by 
Wright (Chapter 3). We limit the dis- 
cussion in this chapter to what some 
people have done in order to observe 
as directly as possible the social inter- 
action of children, which involves such 
behavior as aggression, dominance, so- 
ciability, helpfulness, and competition. 

This discussion may be somewhat 
more substantive than some of the other 
topics in this book, since it focuses as 
much on a rather homogeneous set of 
phenomena as it does on methods and 
techniques. Since the emphasis here is 
on direct observation, such devices as 
sociometry, doll play, and the TAT 
are less relevant, for those devices per- 
mit a study of interaction more indi- 
rectly or at a farther remove from its 
actual occurrence. Doll play may be an 
excellent device for studying aggres- 
sive relations in the fantasy of children, 
and sociometry may permit a map to 
be drawn of the relationships children 
report to others that they are engaged 
in. The focus in this chapter, however, 
is on how an investigator can record in 
a reliable and valid way the relation- 
ships that a child engages in as seen by 
a “trained observer” rather than by the 
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child himself. No doubt there are re- 
lationships, for example, between the 
actual aggressive behavior of a child or 
a child’s peer group and his fantasy, 
and there is undoubtedly great overlap 
between the relationships a child sees 
himself in and those that a scientific 
observer would see. But these are re- 
search questions that cannot be decided 
a priori. The one kind of data can vali- 
date the other, so to speak, although 
investigators often place a premium on 
the “actual” behavior and the “scien- 
tifically observed” relationship as being 
the criteria against which other tech- 
niques are judged, This is our major 
focus 

fhe present aim is broader than the 
issues of “how to see behavior in the 
raw” or how to validate other meas- 
ures. We hope to discuss the methods 
of studying interpersonal behavior in 
such a way that we may be able to 
improve our chances of relating our 
records of these occurrences to the sit- 
uational and antecedent conditions that 
give rise to them. The aim of the study 
of child development is not merely to 
record the daily life of a child, nor is 
it merely to a sist in the task of devel- 
oping tests, It is to use records and tests 
in the context of the analysis of de- 
terminants of the phenomena that re- 
cords and tests are theoretically stated 
‘to reflect, Our long-range intention 
must be “causal” or “antecedent-conse- 
quent” analysis, so that observational 
methods must be tied closely with good 
design of studies with as much use of 
experimental method as the question 
under consideration permits. We reit- 
erate this emphasis throughout this 
chapter. 
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SOME KINDS OF INTERPERSON 
ACTION UNITS 


Figure 20-1 presents one way of rep- 
resenting an interpersonal-behavior sit- 
uation to which we frequently refer in 
this chapter. The figure represents three 
persons in interaction: person (P) who 
is usually the “subject” in child devel- 
opment studies; other (O) who is any 
other person with whom the child is 
seen as interacting, mother, father, sib- 
ling, etc; and a “third person” (TP) 
who is often involved when we con- 
sider a total interaction because he may 
be the recipient of a “displaced aggres- 
sion” that arises from the interaction of 
O and P or he may be the person who 
rewards or punishes both O and P be- 
cause of what they have done in the 
interaction. 

Each of these three individuals is in 
a “social or physical situation,” which 
in part determines what will go on, 
Arrows with dashed lines radiate from 
the situation “box” of the diagram to 
impinge upon the “external instigations” 
that are working on each actor. These 
arrows represent the situational influ- 
ences that comprise part of the external 
instigation. If, for example, all three are 
in a church, the situation will arouse 
inhibitions (or be inhibitions, depend- 
ing on one’s theory of stimulation) with 
regard to the loudness with which they 
talk and with regard to certain topics 
of conversation or forbidden actions. 
These situational components of the 
“external instigations” are called “ex- 
ternal” here only because they can be 
varied by taking the actors out of 
church or by varying such matters as 
whether a service is in progress. 
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»ottom box of the figure refers 
tural factors,” which impinge 


upon the situation box to help define 
the social situation to the actors or to 
help the scientific observer predict 
which aspects of the social or physical 
situation will impinge upon them. We 
have also represented the box of the 
cultura! factors as somehow encompass- 
ing the total diagram. We do this be- 


cause we still know so little about how 
these factors operate. There are many 


definitions for the notion of “culture” 
and many ways in which it is viewed 
as affecting action. Sometimes it is seen 


as determinative of the entire pattern 
of Fig. 20-1, Sometimes it is seen as 
being involved in the past learnings of 
each actor so that it is reflected in the 
cognitions and needs or other states of 
each actor. Sometimes it is seen as “de- 
fining” what should be done by each 
actor at the moment. There is also 
much vagueness about just what (short 
of eve rything) “culture” determines. 
We will not try here to resolve such 
large issues nor to prejudge any direct 
empirical issues on this score. We may 
say, however, that “being in a church 
Situation” has different effects on the 
external instigations for actors if the 
church is Buddhist rather than Greek 
Orthodox—assuming that the actors 
have learned the difference, We include 
“cultural factors,” therefore, in at least 
this sense, Although we intend nothing 
mystical, we must accept some vague- 
ness and some ambiguity in regard to 
these factors because of our present 
lack of knowledge and lack of agree- 
ment on the use of terms. 

The diagram in Fig. 20-1 implies or 
assumes an objective approach to the 
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study of interaction. This is not because 
of dogma or because of any belief that 
this is the “best” way to approach these 
matters. The objective view is adopted 
for two reasons. The first is that it is 
very difficult to diagram interaction in 
a simple way by using a subjective ap- 
proach because one would have to have 
a number of diagrams for each of the 
three actors in Fig. 20-1—the view of 
each with regard to the other two, plus 
diagrams representing what each 
thought that each of the others saw in 
him, and so on. This is too complicated 
for the requirements of the present 
problem. 

The second reason is simply that our 
state of knowledge in child develop- 
ment with regard to “internal instiga- 
tions” (cognitions, needs, drives, and 
other states and mechanisms) that oc- 
cur inside the actors is not yet adequate 
to treat them in principle as other than 
inferences made by the “independent” 
observer. That is, we can decide that 
a child is aware of the fact that he is 
in church because he tells us he is or 
on the basis of this statement and other 
behavior of his. In either case it is safest 
to view the decision as a deduction or 
an inference and state as clearly as pos- 
sible the basis for the decision. The ob- 
jective approach functions as a con- 
stant reminder to tell others the bases 
of our inferences. A subjective approach 
can do this too, of course, but we 
would then have to add a whole series 
of diagrams to reflect what the scien- 
tific observer saw in the situation that 
led him to be able to determine how 
each actor saw the situation. Figure 20-1 
is this last kind of diagram, especially 
for the reader who likes to draw “field 
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diagrams” or use other kinds of sub- 
jective definitions for the things in the 
“internal instigation” boxes. Figure 20-1 
will merely remind such a reader as 
well as ourselves that he must also use 
operational definitions for such states 
in order that we can understand him 
more fully, 

In short, we recognize that different 
aspects of a social situation will be 
“taken up” by different actors in Fig. 
20-1. We do this, for example, by char- 
acterizing the arrows emanating from 
the situation box as (1a) for P, (1b) 
for O and (1c) for TP. Such a nota- 
tion solves no problems, but it may 
state some. There is a similarity in the 
influence of the social or physical sit- 
uation (and via these, of culture) on 
the different actors. There is also, some- 
times at least, a difference. It is a prob- 
lem of how well we can measure these 
influences and how much of such meas- 
urement we have really done that is 
involved here, not whether the influ- 
ences really reside in the situation or 
in the behaving actors, The “influence” 
might be measured in the situation it- 
self, in the physical stimuli impinging 
upon the actor(s), within the actor 
himself (inferred from antecedent- 
consequent relationships), or by the 
arbitrary (but creative and useful) tools 
devised by the scientific observer, in- 
cluding his own judgment (checked for 
reliability and for validity). The cos- 
mology of the place of measurement is 
not so important as the brute question, 
“Have we any way of measuring the 
influence of these things we think may 
be important so that we can get a no- 
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tion how important they are? 


OF THE CHILD'S SOCIAL BEHAVIOR AND ENVIRONMENT 


The reader should note that we have 
made a place for three general sources 
of influence that make up the “external 
instigations” for an actor. We have 


already discussed the cultural and so- 
cial or physical sources of variance. The 
third source of influence is what is done 
immediately by other people that is 


apperceived by the actor. We have 
placed two arrows numbered (2) and 


(3) to illustrate this source of variance. 
The first of these (2) could take the 
concrete form of an act by P whereby 
he asks help from O; the nd of 
these arrows emanates from O's behav- 
ior—helping, for example—w hich pro- 
vides the conditions for P to obtain 
what he wanted help to ger; that is, 
this act of O helps to provide the oc- 


casion of a goal response for P. Such 
arrows, of course, select somewhat ar- 
bitrarily certain things that occur in 
order to give them special attention. 
The “asking” behavior of P may also 
impinge on him as well as on O as a 
self-initiated external instigation, just 
as the act (3) of helping does that O 
then emits. In the case of the first (2), 
once P has asked for help, the prob- 
ability of his asking again may be in- 
creased or decreased, depending, for 
example, on the strength of his “need 
for nurturance” or of his “need for 
independence.” This probability of 
“asking again” may also vary with the 
social situation (there may be rules 
against seeking help, for example, in a 
sport situation) or as a function of cul- 
ture (there may be a general cultural 
value that rules out asking for help 
except in dire necessity). It may also 
vary, of course, with whether or not 
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the request and the immediate problem 
that gave rise to the request is satisfied 
by 


“I hese considerations begin to lead us 
toward the complexities involved in at- 
tempts to study “interaction” directly, 
to which we shall soon turn. We shall 
not dwell, however, on the arbitrariness 
implicit in our arrows, since this is an 
aspect of all attempts to deal analyti- 
and scientifically with any phe- 
non. It is always necessary to leave 


ca 


non 


out of consideration not only many as- 
pects of the “concrete case,” whose 
function we do not yet know, but also 
a good many things that we already 
know may be partly determinative. It 
does no good to argue that “the more 


we can observe and cogitate on, the 
better” because there is no way of 
knowing, a priori, what is relevant, and 
the addition of “some more” that is 
irrelevant may merely frustrate the re- 
searcher and rule out his discovery of 
the clarity or order that otherwise 
might be there. We should also remind 
ourselves that our techniques of meas- 
urement, our theories, and our avail- 
able statistical models always have a 
limit on what they can handle. 
Returning to Fig, 20-1, the external 
instigation is seen as influencing the in- 
ternal instigation(s) to lead each actor 
to behave overtly in such a way that 
this behavior could, in principle, be 
seen by an independent observer. Two 
clarifying statements are called for with 
regard to this complicated process. 
First, there are many problems involved 
in this exchange. “Theoretical differ- 
ences exist as to assumption on such 
issues as whether external or internal 
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instigations are more important (either 
in general or with regard to specific 
classes of interactions), how to concep- 
tualize the components of internal in- 
stigation, how to conceptualize the in- 
teractions among such components, etc. 
Second, great restrictions exist in solv- 
ing such problems because of the limits 
of our measurement skills. 

Our second statement is a reminder 
that most of the components of internal 
instigation are inferred by the observer. 
This means that they should ideally be 
indexed or varied independently of the 
particular interaction observed if they 
are to be used rigorously in an explana- 
tory way. If the hypothesis, for exam- 
ple, is that a child’s interaction with 
another person is dependent upon the 
child’s knowledge (cognition) about 
that person, then we should ideally be 
able to report both the interaction and 
the child’s knowledge by the use of 
separate and independent operations. 
There is more than one way of doing 
this. The ideal approach would be to 
control the children’s acquisition of 
relevant knowledge prior to the inter- 
action, thus providing both a measure 
of the degree of knowledge as well as 
an experimental control over its acqui- 
sition. Sometimes we can only study 
the past history of the children with 
regard to such knowledge and make an 
inference (using what we already know 
or hypothesize about child learning and 
development) as to what the state of 
knowledge is at the time of the ob- 
servation of the interaction. It is not 
our present problem to decide which 
of these independent indexing methods 
is best. It is necessary to point out that 
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some such operations are called for, 
and the more there are, the firmer the 
interpretation of our finding will be. 
The same holds true with regard to the 
components of external instigation, ex- 
cept that sometimes these can be varied 
more directly (though not necessarily 
more easily), for example, when we 
first study a child interacting in church 
and then remove him to a playground. 
Returning once more to Fig. 20-1, it 
should be noted that the scheme in- 
cludes two further elements for each 
of the actors. The first of these is the 
reference to “changes in environment” 
that occur as the result of behavior on 
the part of an actor. This refers to a 
wide array of possible things and can 
be used theoretically in several ways. 
One fruitful way is that of Sears (1951) 
who finds it useful to point out that 
one of the usual changes in the environ- 
ment that occurs following an aggres- 
sive act is a sign of pain in the person 
toward whom the aggression is directed. 
This sign of pain is sometimes a neces- 
sary condition for a goal response to 
occur, and it may itself take on value 
and become a source of satisfaction to 
a child under certain conditions. 
Following the change in environ- 
ment comes the goal response, if the 
interaction is a “successful” one for the 
actor. If the child wanted or at least 
asked for help and received it, this oc- 
currence is a goal response (so that the 
instrumental act of O provides the goal 
response for P). If the help is success- 
ful in solving the original problem 
(which may have been either externally 
or internally instigated), this may also 
be considered as an additional goal re- 
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sponse—“getting the task 
Such usages depend on what theory one 
is entertaining, upon what categories 
one is using, and, of course, what one 
assumes can be abstracted from the con- 


crete flux and flow of action with the 
method being used, 

Some “goal responses” are, of course, 
not available or may not be noticed by 
the independent observer. Perhaps the 
child who asked for help di nainly 
to keep O from doing something else 
or to dominate him. Part of the prob- 
lem of methods in this kind of problem 
involves the question of how to arrange 
a study to provide either di “no- 
ticing” or a basis for inference of such 
goal responses. Theoretically, it may be 
the case that a child who asks for help 
in order to dominate may have tend- 
encies to seek help, as well as a par- 


ticular form of cupidity strengthened. 
If the observer is set for one and not 
the other, he is not necessarily wrong 
in what he does record but may have 
missed something else. 

In the present state of our efficiency 
in research this may not be important 
at all, unless, of course, it so happens 
that the problem at hand is to predict 
P’s future succorant tendencies from 
effects of present interaction and such 
predictions would be improved by 
knowing when succorance has become 
strongly associated with dominance. 
This may be a fault of theory, rather 
than of the method of observation. Of 
course, it may be the case that P asks 
for help in a sports situation for achieve- 
ment purposes as well as for dominance 
(and succorance), and there may be 
three goal responses occurring almost 
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at the same instant. Even if the observer 
is set ‘or all three and the theory for 
prediction is quite adequate, the reli- 
ability may drop drastically because 
one erver cannot attend to such 
compicxity. Here observation method 
is at fault, and another observer or two 
or a different research design may be 


called for, as we suggest later in this 


chapter. There are always limits, how- 
ever, The “totality of interaction” is 
never available for direct study, and 
the notion of the “total field” will al- 


Ways retain an immense component of 


“extra meaning” that is not realized 
(and probably not realizable) in meas- 
ureme terms. 

If the import of Fig. 20-1 is now more 
or less clear, let us use it to help out- 
line some of the kinds of interperson 
units that have been studied or that 
might be studied. Discussion follows 
the order of complexity of such units. 


Single Acts. It is often the case that 
the observer is set to observe the occur- 
rences of a certain class of act, en- 
gaged in by P or O or both, or to ob- 
serve the occurrence of that class of 
act in a whole group. In fact, the ob- 
server may be set to record occurrences 
of a number of such categories at the 
Same time, The main characteristic of 
this kind of unit is, however, its extreme 
simplicity when viewed in the context 
of Fig. 20-1. When he is recording, for 
example, the frequencies of aggression 
or dominance in terms of single acts, 
the observer is watching the totality of 
the interaction as shown in Fig. 20-1, 
but he is recording only some of the acts 
of P, using the totality for purposes of 
assuring himself that he is categorizing 


861 


correctly. He may be recording from 
an a priori or an empirical check list, 
which may be either detailed and spe- 
cific or quite general. 

Suppose O and P are playing a game 
and P asks O for help. An observer who 
is set to record dependent acts may be 
poised to record this as one, but he 
may desist when he notices smiles on 
the faces of both and immediately rec- 
ognizes that this is a joke where both 
actors apparently know the rules that 
obviate such help. Such is the context, 
in part, and such is the way the context 
may be utilized by the observer. Since 
the act has something to do with de- 
pendency—it is a “‘succorance joke,” 
so to speak—it may become recorded 
if the categories the observer is utilizing 
have a place for it, Otherwise it may 
be left out altogether or wrongly in- 
cluded. 

Single-act recording is probably the 
simplest and most reliable way to go 
about studying something about inter- 
personal relations. But there is a sense 
in which it has very little to do with 
anything “interpersonal” at all. The 
researcher is also very limited in the 
uses to which he can put the data that 
emerge from such records. Both points 
call for a bit more expansion. 

First, single-act records are interper- 
sonal only insofar as the actors are di- 
recting their “single acts” toward one 
another and insofar as the observer is 
making use of cues in the total context 
that do not appear on his record. As 
we shall see, there are many other senses 
in which they are not “interpersonal” 
at all but are usually placed in this gen- 
eral category because the category of 
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acts studied often has a social sounding 
name rather than because the unit of 
recording used is actually social in its 
content. 

Second, single-act records can lead 
an unwary analyst into difficulties in 
interpretation, Suppose the observer is 
following children around, one at a 
time, in order to study the individual 
differences in their succorant behavior. 

Even though his records include a 
good sampling and are reliable, he will 
have difficulty in inferring that the dif- 
ferences are due to differences in the 
children because he cannot partial out 
the differences that might be due to the 
instigations to succorance that the chil- 
dren met. The least succorant of chil- 
dren might display a good deal of this 
behavior if they met a number of ex- 
treme problems during the recording 
session. 

Such worries can be greatly reduced, 
of course, if the observation takes place 
under experimental conditions in which 
the instigation is kept carefully con- 
trolled or is equated in some way for 
each subject. When we consider this 
case, we see how this form of “inter- 
personal” study is much the same as 
that used in most individual laboratory 
study of animals or humans. The bar- 
press of a rat is formally similar and 
differs mainly because it is directed to- 
ward a physical rather than a social 
object and is usually recorded by a 
mechanical counter rather than by the 
human observer. q 

Single-act records can also be used 
to characterize some groups in which 
children work or play. One nursery- 
school group may show more frequent 
aggressive acts than another, and single- 
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act recording may reliably and validly 
reflect this fact. Such records, however, 
will usually permit very little analysis 
of the reasons for such group differ- 
ences: for instance, different teachers, 
more frustrating play conditi: the 
presence of one or two bullies, or an 
average difference in one or more per- 
sonality traits in the children studied. 
Analyses of these kinds call for a more 
complex form of recording or a care- 
fully manipulated experimental design. 

Interactions. When we talk about 
interactions, we are referring to situa- 
tions in which two (or more) children 
are being observed and in which the 
recorded unit includes facts about the 
behavior of more than one child. This 
kind of unit can take many specific 
forms but often involves in terms of 
Fig. 20-1 at least as much as the arrows 
in that figure imply. Examples of inter- 
actions might be the following: O asks 
for help and P helps; O strikes P and 
P retaliates; O orders P to do some- 
thing other than what he is doing and 
P complies. Such are the minimum re- 
quirements of interact recording as We 
mean it here—it is action that involves 
at least two persons. It is a double or 
compound act, so to speak, rather than 
a single one. 

Let us quickly make one further and 
important distinction regarding the re- 
cording of sociological interactions. 
The difference has to do with whether 
or not one or more of the subjects who 
are being observed are “replaceable.” 
Sometimes both O and P are in prin- 
ciple completely replaceable by a dif- 
ferent O and P. Suppose one is inter- 
ested in studying the effect of a physt- 
cal setting on interaction. The observer 
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INTE 
might place himself first in a sample of 
large playgrounds and then in a sam- 
ple of small ones, recording all cases of 
aggressive interactions that occur. Since 
the analysis is going to be made in 
terms of settings, there is no need to 
record the proper names of any of the 
children involved. The observer is set 
to record any of a number of actions, 
retaining the relationships between in- 
stigation and action in the record, In 
our sense, all pairs of actors are re- 
placeable with any other pair. 

The analysis can then focus on the 
effects of size of playground on insti- 
gations (e.g., the number of verbal and 


physical “assaultings”), on retaliation 
(the proportion of retaliations to as- 
saultings), or on simple frequency of 


aggressive actions. We assume that 
other control issues have been met, such 
as size of the group of children and 
socioeconomic area from which the 
children are drawn. In terms of replace- 
ability, the study is a sociological or a 
social-psychological one. This observa- 
tional unit is often what is referred to 
as the basis of an analysis of role be- 
havior, in which P is a policeman, for 
example, and O is a motorist, but O is 
no particular motorist and P is no par- 
ticular policeman. 

A middle case occurs when P is not 
replaceable, whereas O is replaceable. 
Such is the case when P’s proper name 
is recorded, and the observer is watch- 
ing both what he does and what others 
do to him with an interest in making 
a diagnosis of P’s personality on the 
basis of such “input-output” informa- 
tion. In such a case it is not necessary 
to record the proper names of the vari- 
ous Others with whom P has com- 
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merce. It is sufficient to record the age 
of O, the relative status of O, or, per- 
haps, whether or not O is a friend of 
P. The Others are replaceable, whereas 
P is not, since he (P) is either an end 
of study in himself or is one of those 
selected as representative of an age or 
population group and therefore a num- 
ber of different observations must be 
referred to him, He is replaceable only 
in the statistical sense. 

A completely interpersonal study of 
interaction is a very different and a 
rather rare thing. Here neither O nor 
P is replaceable. This might occur 
when the observer is interested in 
watching friendships or in long-term re- 
lationships in a family group. Let us 
consider the second case. The observer 
will feel called upon to record the 
proper names of all family members 
who appear in interaction with P or 
with any one of them who is being 
viewed as P. The fact that P is being 
asked to help an “older boy” is not 
enough—the observer is interested in 
the fact that the older boy is P’s brother 
Peter, whose relationships with P have 
become strained, perhaps because of the 
arrival of a new family member or be- 
cause the relationship between the par- 
ents of these two children has been de- 
teriorating in some manner, In this case 
the observer will want to record, at a 
minimum, the names of P and of O, 
the instigations involved, the behaviors 
such instigations invoked, and the name 
of the person who created the instiga- 
tions in the first place. 

A complete interpersonal interaction 
is a complex unit even in its simplest 
form and may require a number of ob- 
servers for its adequate coverage. Each 
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of these observers must supply coordi- 
nated bits of information, which will go 
to make up the interpersonal interact. 
It may also be achieved by one observer 
who ‘samples the interactional life of 
first one then another member of the 
family, viewing each in turn as P in 
Fig. 20-1. It is then extremely vital that 
the proper names of O's be recorded, 
and the interpersonal interaction of P 
with each O,, Os, etc., may need to be 
cross-filed with the data collected for 
O, and Oz when they were serving as P. 

A sample of interpersonal interacts is 
therefore a far cry from a sample of 
single acts or even from a sample of 
sociological interacts. None of these is 
generally more “correct” than another, 
but for any given research question one 
may be adequate, another may be quite 
inadequate, and the third may be over- 
adequate and wasteful of observing 
time and of analysis time. As we shall 
see, child development has hardly be- 
gun to collect the information necessary 
to make researches efficient in this re- 
gard, and such efficiency will not em- 
erge without help from statisticians, 
theorists, and experts in study and de- 
sign. 

Interpersonal Activities. There are 
many units of “interpersonal behavior” 
that call for much less fine texture in 
the record than is called for in the units 
we have so far described. Take, for ex- 
ample, the observer who is interested 
in recording the interpersonal relations 
between two boys or in a group of 
boys. It may be silly to attempt to re- 
cord all the instigations or actions that 
occur between or among these actors 
when the fact that they are “playing 
together” or are “meeting together” or 
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are “getting together to fight” 
some other group is sufficient. In these 
cases the fact of contact of some dura- 
tion is sufficient, and the rate of such 
contact, the initiator of such contact, 
etc., can provide indices of the strength 
or frequency of the relationship’s oc- 
currence. 

It is to these broader aspects of in- 
terpersonal relations that the various 
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forms of sociometric questions are usu- 
ally directed. Here we ask P to tell us 
whom he would choose, or would like 
to choose under certain conditions, for 
companionship in various activities, 
such as going to the movies or having 
a roommate, There is reason to hope 
that systematic study of the relation 
between what O replies to such a ques- 
tion and what he does in directly re- 


corded interaction (either in naturo or 
where social setting is controlled) might 
provide more knowledge of the values 
of sociometry and also provide sub- 
stantive findings regarding “shyness,” 
“status striving,” and the general rela- 
tion between wishing and acting. Such 
study depends in part on recordings of 
what we shall call interpersonal activi- 
ties. We do not mean to rule out the 
possible value of our finer-textured 
units from this kind of endeavor. If P 
chooses O as a roommate and initiates 
such an actual relationship, observation 
of the interpersonal interactions may 
provide a basis for diagnosis of the rea- 
sons for the choice and indices that can 
be related to other aspects of the per- 
sonalities of the two actors or to the 
longevity or success of the relationship. 

Sociological Activities. Closely Te- 
lated to interpersonal activities are the 
units of “interpersonal behavior,” 
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are characteristics of collectivi- 
which, like interpersonal activi- 
void the fine texture of the inter- 
action that occurs between members of 
the collectivity. If, for example, an ob- 
server records the fact of the meeting 
of the Winthrop Street Baptist Church, 
he has provided a fact helpful in con- 
structing certain kinds of indices. He 
may ignore (or not know) the addi- 
tions! fact that no one at this particular 
was present at the last one 
(rep! bility) or that this was a par- 
ticularly friendly meeting, marked by 
a high frequency of single acts of so- 
ciability, the initiation of a number of 
reciprocal interpersonal activities and a 
high number of interpersonal interacts 
of the form in which P reminds O of 
past kindnesses on the part of O. The 
fact of the occurrence of the social pat- 
tern (i.e., that the church meeting was 
held) may be sufficient if one is com- 
paring the Baptist pattern with, say, 
that of the Methodists in terms of fre- 
quency. 

One final remark is called for about 
these kinds of units, which we have 
briefly outlined. This listing in no sense 
exhausts the possibilities but merely out- 
lines some important kinds of units in 
a manner that is useful to us in keeping 
our focus on the direct observation of 
“interpersonal” phenomena. Past re- 
search designs have combined various 
of these units, and there is no reason 
why future ones may not do so even 
more if adequate statistical and study 
designs can be devised. Substantively, 
the phenomena of this realm have 
hardly been explored, much less 
mapped, and almost all probably have 
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some usefulness with regard to some 
hypothesis. 

In a sense we have started with some- 
thing less than an “interpersonal be- 
havior” (single-act records) and ranged 
beyond it to sociological activities in 
outlining units. Such “bracketing” of 
the problems has been done to provide 
perspective; it has also been done be- 
cause the task of our topic embraces 
this wide range. We shall focus as 
much as possible, however, on the mid- 
dle of this range—on the methods 
available for the direct study of inter- 
actions. 

It is perhaps worth pointing out here 
that most of the work of experimental 
behavior psychologists has been focused 
on units like the “operant” response. 
The usual procedure is to select a re- 
sponse class (single-act unit) and to 
study the conditions for its occurrence, 
its suppression, its weakening, its 
strengthening. Such systematic work 
has led to very fruitful description of 
the properties of such a class. Similar 
fruitfulness has come from systematic 
analysis of behavior at a choice point. 
Such work should certainly not be lost 
when we broaden our sights to these 
larger and more complicated units. 
Probably, interactions have all the 
properties of operants and many others 
as well. Their description and even- 
tually their experimental manipulation 
may make clear properties that even 
operants have but that cannot be dis- 
covered in the noble isolation from 
everyday action that is the usual fate 
of operants. 

At the other end of this continuum 
lies, of course, the huge problems of 
the recording and analysis of inter- 
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group relations, international relations, 
intercultural relations, and, perhaps be- 
fore too long, the new field of inter- 
planatary relations. To date, methods 
of direct systematic observation have 
been extremely rarely used in these 
fields, in which the sample survey ques- 
tionnaire, scholarly perusal of docu- 
ments and first-hand accounts, and oc- 
casional content analysis of recorded 
conferences are the presently preferred 
techniques. As progress increases to- 
ward the development of experimental 
paradigms of the situations referred to 
by these terms, any findings of lasting 
value that arise from the study of chil- 
dren’s interpersonal behavior may be- 
come directly relevant, and categorial 
systems developed in child study may 
suggest solutions to problems in the 
study of the larger aspects of the inter- 
play of parents. But international re- 
lations and the study of child develop- 
ment are areas of study that we pres- 
ently see as only too far apart for com- 
mon discussion although Barker (1957) 
has begun work on putting them to- 
gether, 


Strategy Issues in the Direct 
Study of Interpersonal 
Behavior 


We have so far discussed some of 
the kinds of units that one can study 
directly in children’s interpersonal be- 
havior. In some cases, at least, the next 
step in planning a research would be to 
plunge into the issue of what aspects 
or attributes of such units one wished 
to study and what specific categories 
of observation one might wish to uti- 


THE STUDY OF THE CHILD'S SOCIAL BEHAVIOR AND ENVIRONMENT 


lize. In this chapter, however, we shall 
delay such discussion in order to con- 
sider first some of the fascinating and 
complicated issues of the strategy of 
measurement and recording in this field. 
These strategy matters are important 
to the design of studies, second in im- 
portance only to the tools of adequate 
theory or a good clear preliminary 
empirical question. 


SAMPLING OF INTERPERSONAL 
BEHAVIOR 


Sampling Single Acts. The research 
question that motivates a study deter- 
mines to some extent the kind of sam- 
pling one will want to do. Barker and 
Wright (1955), for example, are inter- 
ested in describing the psychological 
world of the children of a Midwestern 
United States town. The population, 
which they wish to sample, is a matter 
of the several million “episodes” of be- 
havior that involve children in a given 
year. Their main emphasis is ecological 
—a desire to reflect accurately the life 
of children in a town, according to a 
set of categories they argue are “natu- 
ral” ones. Whether or not they achieve 
a sampling of their population or 
whether their categories are “natural” 
relative to nature or only to their im- 
plicit and explicit theoretical interests, 
their strategy is based on this aim and 
guided by it in good part. It takes the 
direction of obtaining total day records 
for a sample—or at least a range—of 
children in the town of different ages, 
sex, etc. 

Although it is very easy to become 
fascinated by the wide-ranging aim of 
such a study, and thereby approve of 
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its strategy, these are not the only aims, 
the only categories, or the final strategy 
even for those aims and categories. By 
way of contrast, let us consider a much 
more specific, yet equally legitimate, 
rch problem. Suppose one’s re- 
search aim is limited to the single acts 
initiated by a sample of P’s in the rela- 
tive absence of direct instigation from 
others. The theory may further specify 
the situations of such action by two 
classes: situations in which P is in in- 
teraction with peers (i.e., children of 
roughly the same age) and situations 
in which P is interacting with parents. 
The children themselves may be di- 
vided into two groups on the basis of 
their being samples of two populations: 
those with parents who have punished 
them physically for aggressive behavior 
toward these parents and those who 
have not been so punished (based on a 
previous sample of a sample of all the 
parents of a town or of a subgroup of 
a town). Furthermore, the hypothesis 
being entertained may specify an inter- 
est only in aggressive acts instigated by 
P, as compared to all other acts insti- 
gated by him. The theoretical consid- 
erations may further rule out any ag- 
gressive or other acts instigated by P 
in the midst of long interaction se- 
quences but include those that arise 
after pauses in the interaction of a cer- 
tain duration (based on some as-yet- 
undeveloped theory of memory span 
for immediate interaction). These last 
considerations help to keep the ob- 
served units independent of one an- 
other. The problem is reduced by the 
(rare) clarity of the theory to one of 
obtaining a sample of the things P ini- 
tiates following such pauses, and this 
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may be done by spot checking (walk- 
ing past P’s play ground and visiting 
his home at preset intervals), by stay- 
ing with the child for relevant periods 
of his day, ete. 

Such assistance from theoretical con- 
siderations as this example assumes is 
still extremely rare in the field of child 
development. A survey of the literature 
shows that the aims of most studies are 
less clear regarding aim, the categories 
are usually constructed for purposes of 
exploration rather than to test hypothe- 
ses, and sampling considerations are 
usually much less in the foreground. 
Finally, it is the rare researcher who 
would engage in such a costly proce- 
dure as direct observation with the 
faith in the value of the hypothesis 
that such an investment calls for. Most 
researchers would usually content 
themselves with ratings by, or inter- 
views with, teachers and parents, or 
they may content themselves more by 
the use of experimental tests in which 
a parent is present and is then replaced 
by a peer or two. He then hopes that 
the findings are representative of the 
unstudied question of the spontaneous 
instigations by P in his daily life in 
these two kinds of situations. Such a 
situation is not so much to be deplored 
as to be understood. What is usually 
done is often right and proper, but for 
some purposes it is often incomplete or 
inefficient. 

The Sampling of Instigations. The 
strategy of a researcher is only slightly 
different when the question is one of 
trying to sample the pressures or insti- 
gations on a child. Theory may again 
help limit the task by limiting the sit- 
uations under which such sampling 
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needs to be done, or it may specify the 
classes of instigative act more or less 
closely. Such specification may take a 
general form, such as “impositions,” 
“opportunities” (see Baldwin, 1955, pp. 
131 ff.), “lures,” and inferred internal 
instigations, or it may make further 
distinctions within these or other gen- 
eral categories, such as “blocking of 
activity of P,” “attacking P,” or “ini- 
tiating help for P.” If the observer is 
comparing two groups in terms of the 
pressures of certain kinds placed on chil- 
dren, then, in terms of Fig. 20-1, these 
pressures or instigations come not only 
from initiations or retaliations from O 
but also from the third party of that 
figure, Consider the case in which O 
attacks P but in which P is punished 
by the mother (third party) for re- 
taliating to O’s onslaught, Such actions 
are often vital if groups or individuals 
are being studied over time, since the- 
oretically these effects of O’s own ac- 
tions are important psychological in- 
stigators to learning and conflict. 

The aim of sampling is to render 
equally probable the presence in the 
sample of each of the kinds of things 
considered important. When one is 
studying instigations, there is a wide 
array of possible important instigation 
sources. This array is so wide that to 
some researchers the task seems hope- 
less. Perhaps one of the rarely men- 
tioned effects of Freudian theory was 
to convince some researchers that 
mothers and fathers were such impor- 
tant sources of instigation that most 
others could be ignored. This at least 
helped lead to empirical observations, 
whether the theory was correct or not. 
The problem here is that for every 
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category of single acts that one con- 
siders, such categories multiply when 
situation factors, factors describing who 
O is, etc., are considered. If archer 


is interested in comparing, for example, 
the occurrences in two nursery schools 
of instigations of the type “assault upon 
P by an older child, the older child 
being a sociometrically chosen friend,” 
the classification may have become so 
refined that an observer would have to 
be present for many hours before such 
a case would occur. Of course, such a 
type of instigation may be exactly the 
vital one for theoretical analysis. Three 
things can be done in such cases (short 
of giving up the endeavor altogether): 
the theory can be developed to handle 
more general classes that provide more 
frequent occurrences; a pretest may be 
used in which a child is followed around 
so that the observer can estimate where 
such occurrences are most probable; or 
the whole study might be shifted as 
soon as possible to a more experimental 
arrangement, so that the probability of 
such occurrences could be increased at 
the will of the experimenter, 

One simplifying possibility should be 
mentioned in connection with the sam- 
pling of instigations, which also holds 
for the sampling of “single acts by P.” 
This is the situation that arises when a 
sample of P’s has been selected from 
some population of children and in 
which a number of these P’s interacts 
frequently with one another. Then, if 
each child is being followed one at a 
time, what P does to O by way of in- 
stigating him can also be recorded as 
an instigation at the same time that O's 
instigations of P are being recorded. 
There are occasions when this may 
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save recording time; there are also oc- 
casions, however, when this may be 
more trouble in analysis than it is 
worth. If, for example, the O in the 
foregoing case is being time-sampled 
on a different schedule, the additional 
information obtained while sampling P 
may merely confuse this time sampling. 
Further, the independence of such ob- 
servations may be jeopardized unless 
great care is taken to make sure that 
what P does to O is not directly related 
to what O has done (or will do) to P 
that is also recorded as an instance. This 
double recording becomes clearly valu- 
able when interactions are being studied 
and when the hypothesis calls for the 
interactions to be used to infer traits 
of personality in O and P. This is dis- 
cussed below. 

One additional warning about studies 
that involve the sampling of instiga- 
tions: any analysis of the external in- 
stigations that impinge upon P is limited 
indeed, unless the study design permits 
the analysis of some of the conditions 
that lead to such instigations. Consider 
a study in which mothers’ behavior to- 
ward P’s (i.e. their children) is being 
sampled and the differences in the in- 
stigations are being related to P’s be- 
havior on a TAT (or to P’s own be- 
havioral initiations). Once the analysis 
is done and the relationship reported, 
the researcher may well want to ex- 
plore the issue of why some mothers 
“wheedle” or “coerce” more than 
others, and the researcher may then 
wish to relate such “instigations of P” 
to personality traits, situational factors, 
or broader family relationships. Such 
questions may focus upon the imme- 
diate instigations that impinge upon the 
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mother and that in turn lead to the in- 
stigations of P already recorded. Such 
explorations may be ruled out, unless 
the broader scene was recorded in 
which mothers are viewed as O’s upon 
whom other O’s impinge and who in 
turn impinge in certain ways upon P’s 
(their children). In short, a study of a 
mother’s interaction may be the best 
strategy to pursue when in search of 
a portrait of the external instigations 
that are visited upon the child. 

We must quickly add, however, that 
if the hypothesis deals only with the 
relationship between the distribution of 
certain classes of instigations that im- 
pinge upon a child and his subsequent 
TAT or doll-play performance the test 
of the hypothesis should not be jeop- 
ardized merely to obtain exploratory 
data, It might be best to limit observa- 
tion to the “instigations by mother of 
P” and interview the mother on the 
other matters, Observation is a costly 
procedure, and our methods are still 
distinetly limited in their ability to cap- 
ture the broader reaches of the “struc- 
ture of an event.” 

At a practical level, the recording of 
instigations is beset with the recurring 
fact that P often acts without any ap- 
parent external instigation, This is the 
case that may arise when P joins a 
group by verbally attacking one of the 
members of the group or when P has 
been playing quietly alone and sud- 
denly rises up and, for no apparent 
reason, decides to destroy the next 
child’s sand castle. This problem arises 
more clearly when the researcher is in- 
terested in interactions, but it is some- 
times germane even when only exter- 
nal instigations are being studied. First 
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of all, it can lead to errors in recording 
external instigations. If a researcher is 
watching P mainly in order to “catch” 
the instigations that impinge upon him 
and if P suddenly initiates an act with 
no apparent external instigation, then 
the fact that some other child had sud- 
denly changed his place of play may 
appear in retrospect to be the instiga- 
tion to P to go and strike his neighbor. 
This is a problem that only careful ob- 
server training and clear definition of 
instigation classes can overcome. 

A more insidious problem sometimes 
arises, Suppose the recorder is working 
with a conceptual scheme that defines 
instigations in terms of their inferred 
or observed effect on P. It is quite sen- 
sible in some theories (including com- 
mon sense) to suggest that a “frustrat- 
ing instigation” is not frustrating to P 
unless P shows by his behavior (anger, 
aggression, leaving the field) that it is 
so. In short, instigations are viewed by 
the researcher as “perceived or apper- 
ceived—by the subject.” These, of 
course, are a subclass of instigations, 
but they are sometimes defined in such 
a way that they are a subclass so close 
to P’s action as to be almost equatable 
with it (an easy way indeed to get high 
correlations between “instigation” and 
a particular class of action). Except 
where this issue is not important to the 
hypothesis under test, it is the “duty” 
of a theory or of an operational pro- 
cedure to state what a stimulus is, and 
this can rarely be left to the subject 
to do for us. We must in the long run 
judge the state of our field in part by 
the degree it can unequivocally state 
that “it is this physical pattern of ener- 
gies or this situational factor—measured 
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independently of P—or this cultural 
factor or this personality factor meas- 
ured such-and-so that disposes P to- 
ward behavior of class X or that changes 
in such-and-such a way the probability 
of his behaving according to class X.” 

Regardless of how we go about de- 
fining or pointing out instigations, we 


should expect P to act sometimes in 
the apparent absence of such things, 
and, if, shortly after he has initiated an 


aggressive act, we note that someone 
is hitting him (in return), it is wise to 
have distinctions in our categories for 
such self-initiated retaliations, which 
otherwise might merely appear to be 
cases of poor P being picked on by his 
neighbors. These retaliations, which 
arise from P’s own initiations, may have 
different effects on P. He may, for ex- 
ample, be feeling guilty by then. 

Sampling Interactions. There are 
two importantly different strategies in- 
volved when the sights are raised for 
capturing interactions. The one is in- 
volved when interactions are being re- 
corded for purposes of studying per- 
sonality differences among children; the 
other is used when the study aims at 
characterizing diads, groups, settings, 
or cultures. 

In the first instance, in which the 
aim is to characterize the action tend- 
encies of individuals (or more compli- 
catedly, their interaction tendencies), 
theory may specify cultural factors, 
and the only problem is that of arrang- 
ing the external instigation situations 
for each child so that the full range of 
action (or interaction) tendencies is 
possible. This permits the internal in- 
stigating (or personality) factors to €x- 
press themselves in determining differ- 
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ences in the distribution of actions or 
interactions obtained from one child to 
the next. The best way to handle this 
problem, short of having more knowl- 
edge than we usually have, is to pro- 
vide a representative array of instiga- 
tions but to have them the same for 
each child. This is, unfortunately, a 
contradiction in itself, since a “repre- 
sentative array” of instigations that 
would fit the everyday life of one child 
would not necessarily be representative 
of the instigations in the everyday life 
of the next; yet, unless we use the 
“same” array of instigations for meas- 
uring, we lose one of our bases of com- 
parison. There is an indeterminacy here, 
since a child who meets a limited array 
of instigation situations in real life 
(compared to the next child) may have 
personality properties of a certain kind 
or intensity just because of the limited- 
ness of the array he usually meets! Per- 
haps the best compromise available for 
nomethetic study is to provide a rep- 
resentative array of external instigations 
that would fit the everyday life of a 
modal child of the group or an array 
of instigations that would fit the every- 
day life of all the children in the group, 
dropping out those instigations that are 
rare. 

Such considerations lead rapidly to- 
ward the simplifications of our usual 
test procedures, in which we attempt 
to provide the same “standardized” ex- 
ternal instigations to all subjects, letting 
what is “in them” determine the dis- 
tribution of answers. It is even some- 
times argued that it is wise to control 
on the “reportable” reception of the 
external instigations by the subject or, 
more broadly, the perception of the 
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external instigation. This is one of the 
virtues of the laboratory or clinic sit- 
uation for testing, as long as the opera- 
tion used to make the stimulus “grasp- 
able” does not itself deform the response 
distribution too much. If, in an intelli- 
gence test, we rule out all questions 
(external instigations) that the subject 
cannot cognize, then we may have lim- 
ited the range of responses that define 
the kinds or degrees of “intelligence” 
so that it no longer fits with the theory 
with which we started. The report “I 
am confused by the question” may be 
a valuable clue to intelligence. 

These test criteria raise problems for 
direct observation. In fact, they usually 
raise such complicated questions that 
this form of validation is often com- 
pletely avoided. Such avoiding is often 
the better part of valor but not when 
the theory or measurement under test 
is supposed to apply to everyday be- 
havior. The only way to test the as- 
sumption that “this hypothesis applies 
to everyday behavior” is to order or 
categorize everyday behavior and find 
out. Direct observation is one of our 
ways of doing exactly that. The prob- 
lems must sometimes be faced. 

In practical terms, the problems too 
often arise after the data have been 
collected—in the analysis phase—as any 
experienced statistician would sadly tell 
us. Suppose a researcher has time-sam- 

led the interactive behavior of a care- 
fully selected sample of subjects—re- 
cording all the data necessary to an 
analysis of both instigations and result- 
ing actions. The data are all in, the 
observers have gone home, and suddenly 
it is discovered that there are only four 
cases of “O attacks P” for child A, none 
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for child B or C, 15 for child D, etc. 
The analysis sometimes has to stop at 
this point, and it is necessary to fall 
back on the still more mysterious 
process of “ratings” in order to save 
the research. Such problems can be 
avoided by careful research strategy. If 
one is aware ahead of time of at least 
some of the crucial instigation classes 
in which the researcher is interested and 
if the observations are analyzed, at least 
roughly, for the occurrences of these 
in the time-sampled material, then extra 
recording can go on for those children 
who, for example, are infrequently 
attacked by peers, Further recordings 
may be obtained for all children in situa- 
tions in which rare instigations are 
more frequent. If a careful record is 
kept of the protocols taken according 
to the sampling schedule, and which 
went beyond the schedule, then the 
latter can be included for studying the 
personality dispositions of the children 
(the ratio, for example, of retaliations to 
attacks), whereas they could be left out 
of the analysis of the sample of every- 
day pressures or of the interactions that 
characterize the sample of the child’s 
life. Such planned checks on the in- 
coming data may even be formalized by 
the use of sequential analysis, which 
could save time and money by giving 
estimates on how soon a given instiga- 
tion category is going to be filled (or 
not filled). 

In the example immediately above we 
have dealt with a research case in which 
both personality study and representa- 
tiveness of instigations are involved. 
Relatively few studies of personality 
attempt to achieve the latter. Situational 
representativeness rarely goes beyond 
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the time sampling of a few situations 
and that usually involves observing the 


child at play with his peers or occa- 
sional observation at school or at home, 
In most Western European countries 


the home is viewed by parents and 
researchers as a bit too sacred to permit 
one-way screens or even the randomly 
timed presence of a silent observer. It 


is probable that some personality de- 
velopment problems can best be studied 
in societies in which the privacy of 


home life is less highly valued. 

Representativeness of 
situations is generally ignore 
more insurmountable reason—there is 
a dearth of adequate analysis of the 
problem at both the empirical and 
theoretical levels. Barker and Wright 
(1955) have made a valiant effort in this 
direction, but despite their interesting 
index of the similarity between two 
“settings” for action the categorizing of 
the settings themselves is still at an 
extremely empirical descriptive level. 
We are still awaiting, from whatever 
source, some generally useful methods 
for “placing” settings (and their at- 
tendant instigations) in various concep- 
tions of “social space.” It is largely on 
these issues that the operational value 
of Parsons’ patterns variables (Parsons 
and Shils, 1951) may be evaluated as far 
as child development is concerned. In 
the meantime however, we need a num- 
ber of alternative theoretical approaches 
to these problems. Such theoretical con- 
tributions could have a very salutary 
effect on our strategies in the observa- 
tion of interpersonal behavior. 

In research cases in which only per- 
sonality differences are involved the 
main focus is on the distribution of 
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things done by P to a given class of 
instigation, which is compared to the 
distribution provided by another P. 
/hen we are sampling the interactions 
that occur in a group, then, as in the 
ase of the Murphy study on sympa- 
behavior (Murphy, 1937), the 
observer is set to note all occurrences of 
a certain class of instigations (in that 
case, forms of “distress” in any O) dur- 
ing the time sampled. When two groups 
> being compared, in short, the repre- 
sentativeness of both the instigation and 
of the responses to the instigations are 
equally vital, since the minimal unit 
under study is an instigation-action unit. 

Such equal emphasis on instigation 
and action defines what we referred to 
as the second basic strategy in the 
sampling of interactions, and we shall 
discuss other strategy factors in con- 
sidering this case, First, however, let us 
pause again on this issue of the unit to 
be observed. The minimal unit is, in the 
Murphy study, “distress in O, sympa- 
thetic or nonsympathetic behavior in 
P.” This permits analysis of the per- 
sonality dispositions of the various P’s, 
assuming that the frequency of instiga- 
tions is equal or at least large enough for 
each. It permits also the comparison of 
two groups in terms of the frequency 
of occurrence of such units. It permits, 
in addition, the analysis of the “initiation 
of signs of distress” by the various O's, 
provided proper names are always part 
of the protocol. However, full analysis 
of the conditions under which the 
various O’s initiate signs of distress is 
not possible unless the observer is also 
set to record all instances of the occur- 
rence of “instigations to O to initiate 
signs of distress such as hurts, frustra- 
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tions, etc.” A matrix of group structure 
is also possible with these data, in terms 
of who gives sympathy to whom, 
viewed simply as a description of the 
present group in action. 

Our point here, however, is to ponder 
some possibilities that are not usually 
considered. If the unit is expanded to 
include not only distress in O and P’s 
response but also O’s response to P’s 
response (or, whenever necessary, the 
behavior of a third party in bringing 
the sequence to an end, for example, 
when the teacher intervenes to com- 
fort O), the additional analyses are pos- 
sible. It would be possible to study the 
success O has with showing distress 
in terms of whether he gets comfort or 
not and whether the comfort tends to 
come from peers or adults. Such analysis 
might provide a basis to follow the 
changes in O’s distress-showing be- 
havior tendencies as a function of 
whether such tendencies are rewarded 
or not rewarded. Further, knowing 
how O responds to sympathy or lack 
of sympathy may not only provide a 
check on the classification of his signs 
of distress but also a basis for an in- 
ference of his intention in making the 
original outcry. If he is quickly or easily 
“pacified” by help or sympathy, then 
the strength of his succorance need may 
be inferred and compared with that of 
the child who maintains the succorance 
demand as long as it brings help or 
attention. Such inferences would re- 
quire independent measurement in the 
long run. 

In terms of Fig. 20-1, there is no end 
but that of practicality to the exact size 
of units of action that might be useful 
for some purposes. Some children are 
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characterizable as leaders who like to 
initiate interaction sequences that are 
long and complicated, yet may have im- 
portance as dimensions of personality. 
A given P may show persistence in 
initiating a form of imaginative play, 
which may aim to include all the other 
children present, may or may not in- 
volve content of a reciprocal, aggres- 
sive, or dominating nature, and may 
extend over days instead of hours. It 
is not only “leaders” who initiate long 
interaction sequences, of course. Some 
children initiate play patterns of a form 
that permits them to follow some more 
imaginative child or merely to watch 
while some conflict or constructive 
play emerges from the watcher’s initia- 
tion. 

The usual considerations involved 
when children are being sampled to 
represent a town, city, or tribe are age, 
sex, socioeconomic status of the family, 
and the placement of the child in the 
family. We shall not discuss these 
points, since they are (in principle) part 
of the strategy of any research in child 
development. What we would like to 
point out is that there are practical as 
well as theoretical issues involved in 
the selection of the population out of 
which the sample is to be drawn. The 
issue of the “population” is usually a 
very bewildering one in a science in 
which the theory being tested is often 
couched in universal terms (“all chil- 
dren tend to regress in their play when 
frustrated”) but in which the subjects 
available to test the theory are very 
limited in number and representative- 
ness. Some researchers dream of a 
national sample as the ideal, others limit 
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their aims to a city or town or even a 
school, and still others dream of a 
world-wide sample of children for a 
study. It should be said that none of 
these populations is sacred. If the hy- 
pothesis contains within it certain rec- 
ognized population limits (“all 
children who grow up in the United 
States tend to regress in their play when 
frustrated”), then the population prob- 
lem is clear. There has been an increas- 
ing tendency to place “cultura!” as well 
as national limits on hypotheses in child 
development, but our general ignorance 
on the specific import of such factors 
leads more often to vagueness on these 
issues. 

Some of these matters are discussed 
in a useful way by Whiting et al. 
(1953), who recommend a “primary 
sampling unit” as a reasonable popula- 
tion focus for some kinds of child-de- 
velopment studies. Such a unit has a 
number of virtues, which are not dis- 
cussed here. It has one lack of virtue 
as presently formulated in publications. 
This is that for complex research de- 
signs, in which several variables are to 
be analyzed to bring out the interactions 
among the variables, the number of 
children available in a PSU may be too 
small. There is one consideration, how- 
ever, that may have important conse- 
quences, even for one-variable studies 
of interpersonal behavior. This is that 
the PSU may facilitate the study of 
interpersonal interactions and interper- 
sonal activities because any given P is 
usually found interacting with the same 
O’s. This means that a given sample 
of P’s instigations and/or reactions may 
include a sufficient number of inter- 
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actions between that P and his par- 
ticular friends to provide a basis for 
such additional analyses. All of this 
possibility arises from the smallness of 
the PSU and from the criterion of 
sociometric homogeneity called for by 
this “population” unit. The possibility 
of such additional analyses cannot, of 
course, be assumed, but such considera- 
tions may prove important in develop- 
ing multipurpose research designs in 
the future. 

It is interesting to consider the future 
problems in the strategies that may be 
evolved in attempts to define the popu- 
lation from which PSU’s themselves, 
or some similar units, should eventually 
be sampled, Perhaps Murdock (1957) 
has come closest to this with his notion 
of a world ethnographic sample, but 
much further work will have to be 
done to carry his large idea into use in 
complex modern societies. It is in- 
teresting that the technology of the 
sample survey has so often focused on 
“national” samples, possibly because of 
the relatively atheoretical quality of 
most of the work done and because of 
the importance of national power units 
in current applied work. Such a popula- 
tion definition should by no means be- 
come assumed as the most useful or 
“natural.” Research endeavors, which 
are guided by more theory, may even- 
tually require or find “natural” a num- 
ber of different notions on this score. 
Since work done by the use of direct 
observation has not yet found any ready 
definition of research efficiency, as 
has perhaps occurred in the use of the 
sample survey, new notions of popula- 
tions and of subunits may be developed 
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that may prove of general value in 
behavioral science. 


STRATEGY ISSUES IN RECORD- 
ING INTERPERSONAL BEHAVIOR 


There are several issues of a strategic 
sort involved in deciding what kinds 
of records to take while observing 
children’s interpersonal behavior. These 
issues include whether or not to try 
to record everything that goes on, 
to permit inference in the protocols, 
to use preset categories or leave the 
categories to be developed a posteriori, 
and to try to save the time and cost of 
an analysis phase in the research by 
providing the recorder with preset 
forms or IBM cards that are punched 
while observing. 

There is no absolute answer to any 
of these questions, and the dearth of 
methodological research makes even 
partial answers difficult. Let us consider 
some of them, however, in the hope of 
creating a “sense of problem” for at 
least those readers to whom these prob- 
lems are new. 

The major strategy issue, with regard 
to field recording of interpersonal be- 
havior, is that between an approxima- 
tion to “total records” and the use of 
preset categories. Let us state again our 
opinion that the mystic notion of some 
absolute totality of record should be dis- 
carded. If so, we can define the “total 
recording” approach more usefully as 
one in which the observer is set to re- 
cord a rather wide array of interpersonal 

henomena in a form that necessarily 
calls for a further analysis stage. This 
can then be compared to the strategy 
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of using precoded categories to avoid 
such an additional analysis state, 

A number of rather imponderable 
issues then arise, imponderable at least 
in terms of our present methodological 
knowledge. One thing seems to be clear 
at first blush, however, and that is that 
when a very few categories of action 
are being observed preset codes can be 
very efficient compared to total rec- 
ords. Observer reliability can be ex- 
tremely high, a costly phase of research 
involving protocol analysis is avoided, 
and statistical analysis can be effectively 
speeded. It might also be said that the 
researcher is left with no more data than 
his theory calls for—assuming that he 
has such a theory. 

On the other hand, if a large number 
of categories is being observed, then it 
may be better to try to record a good 
deal of the flow of action in a manner 
that is determined more by the require- 
ments of reliable and valid recording 
(whatever they may be) than by the 
requirements of efficient analysis. De- 
spite the much higher cost in this 
strategy, the virtues are many, par- 
ticularly in exploratory stages of 
research in a given area. One can change 
his categories of analysis if he has such 
rich protocols to work from. This is 
helpful in exploratory work but dan- 
gerous in hypothesis testing, unless the 
bases of such decisions to change the 
categories are purely measurement 
criteria and the change is not done in 
such a way as to improve the prob- 
ability of upholding the original hy- 
pothesis. It is also possible that in the 
use of an additional analysis phase the 
reliability of the final categorizing may 
be improved, since the task can be done 
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under conditions in which speed is not 
so important. This depends upon how 
well the coders are trained, of course. 
There is also some reason to hope that 
protocols collected under what we are 
calling a “total recording” set may 
provide some fairly permanent data to 
which a wide array of category systems 
can be applied. We discuss this hope 
further in subsequent sections of this 


chapter. 

Another virtue of the “total record” 
is that it can provide some kinds of 
data that are rarely achieved with the 
use of preset categories, except perhaps 
when the preset categories are all-in- 
clusive. Consider a research case in 


which a given P is being observed 
systematically and in which both “in- 
stigations” (or at least the basis for an 
analysis of instigations) and P’s actions 
are recorded. Compared to other P’s, 
this P may be found to initiate many 
helpful acts toward others. According 
to some rules of inference, he would 
then be considered as a highly nurturant 
child. If the protocols include data on 
a wide array of other kinds of actions, 
however, such data may provide the 
basis for a measure of this P’s general 
activity level, as compared to the other 
P’s. It may then be discovered that rela- 
tive to all his initiations his helpful 
initiations are a very small proportion 
indeed—although they are higher in 
absolute frequency than those of others. 
It may then appear to be more reason- 
able to place him in the “low nur- 
turance” group of children for purposes 
of analysis. Furthermore, if the re- 
search is focused on this P’s place 1n 
the group interaction, then his helpful 
initiations relative to all the initiations 
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of all the P’s studied may also place him 
or low, depending on whether 
or not these other initiations are avail- 
able for index construction. 

here is an additional feature to total 
records, which, though imponderable 
in any case, is sometimes of great value. 
f the record of interaction is kept 
in this way, it is possible to change a 
definition of a particular category in 
the analysis stage, thus improving re- 
iability of coding and the fitness of the 
code to the hypothesis under test. This 

ises because the cues to be utilized 
or categorizing are available for such 
reshuffling, and their presence provides 
the researcher with an opportunity to 
report in great detail just what the 
contents of a category are. This may 
facilitate communication and may in- 
crease the possibility of exact replica- 
tion of studies. 

‘The recording of “cues” so that they 
may be perused at leisure is a virtue 
that may, of course, be purchased at 
great cost. How many cues are lost in 
such recording that might be utilized 
by an observer who is set for on-the- 
spot categorizing? Under what condi- 
tions are more cues recorded under a 
“total set” than are noted by a preset 
categorizer? How speedy is the ob- 
server's reception of interpersonal be- 
havior? How important is having time 
to receive information for categorizing 
as compared to “total” recording? 
What are the reliable and valid cues to 
watch for or to record for a given 
class of behavior? Most of these are 
unanswered research questions. Mead 
(1953) has provided a clear example of 
the ethnographic value of the use of 
cameras and tape recorders as well as a 
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larger number of observers in attempts 
to capture more of the action for later 
analysis. There are still, however, few 
systematic studies of the value such 
devices add to direct observation. (See 
Heyns and Lippitt, 1954.) 

Let us turn to the issue of “inference 
or no inference” in the basic protocols. 
In its absolute form, of course, the argu- 
ment is rather empty, and the discussion 
should rest on a clearer definition of 
such terms as “inferring,” “deducing,” 
and “inducing” than we will attempt 
here. It seems clear, at least, that the 
more complex the judgment required 
of an observer, the more the chance 
for difference to occur in the judgments 
of two observers, unless the observers 
have been trained to differentiate the 
particular complex pattern, This may 
mean that the judgments are more in 
error or that the phenomenon being 
judged is too complicated to be en- 
compassed in toto by one human’s span 
of apprehension. Perhaps the two dis- 
agreeing observers are seeing correctly 
two or more facets of the phenomena 
or are responding to different kinds of 
information in the action being ob- 
served. As soon as we put the problem 
in such modern terms as “there is al- 
ways a limit to the momentary capacity 
for information reception,” then we 
commence to see the enormity of our 
ignorance regarding what is going on 
in the observation situation, In short, 
only basic theory-directed research on 
the process of observing and recording, 
and on the process of learning to ob- 
serve and record, will ever provide us 
with other than facile verbal solutions 
to this dilemma. How can we specify 
the kinds and contents of “cues” that 
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must be recorded or observed so that 
we can reliably and validly infer that 
two people are angry at one another? 
How many forms of “anger” expres- 
sion are there? Is the matter one that 
should be viewed in terms of “arousal,” 
for which instruments of a more 
physiological kind might be more 
fitting? Having decided some of these 
issues, we could then turn to a con- 
sideration of how adequately a human 
observer can function as a recording in- 
strument for such events and approxi- 
mate a clear answer to questions of how 
fast and how complicatedly he can 
operate. Such research has hardly be- 
gun, at least in any form of direct use 
for naturalistic observation. 

In the meantime, however, observa- 
tion will continue. When theory is 
adequate to specify what should be 
observed and recorded, then the only 
issue is that of reliability (and attendant 
issues of reportability and replication). 
When theory is not adequate to the 
task (as is so often the case), then all 
the researcher can do is develop a “feel” 
for the phenomena he is interested in 
and develop his observation method ac- 
cordingly, hoping that basic research 
will prove him correct in the long 
run. As Heyns and Lippitt (1954) and 
Gellert (1955) have suggested, recent 
practice has been to let “theory” or the 
“feel” discussed above determine what 
is to be attempted in observing inter- 
personal behavior rather than let the 
matter be decided totally by the re- 
liability obtainable between two in- 
dependent observers. Perhaps the modi- 
cum of success achieved in studies of 
direct observation means that there is 
more information about motives, feel- 
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ing, and intentions that is “imminent in 
behavior” than we have generally sup- 
posed. This cannot yet be formally 
documented, but it may be that this is 
a constructive assumption to make. 
There are degrees of inference, prob- 
ably; or, in more everyday terms, some 
judgments are more complicated than 
other judgments, It is here that re- 
searchers who have used some form of 
total recording have made their com- 
promise. Observers are often asked to 
record “what is said and done by each 
actor” (the overt behaviors of Fig. 
20-1). They are asked also to record ad- 
jectives, adverbs, or other inferences but 
to place them in parentheses (O struck 
P; P smiled (painfully) ). The “insights” 
are recorded, at least, and can be used as 
the coding rules permit. Sometimes in- 
ference goes further, as when, in the 
Bales technique, an observer is expected 
to make category placements of par- 
ticular communicative acts accord- 
ing to their function in the group 
process—a rather complicated judg- 
ment but one in which a surprising 
degree of observer reliability can be 
obtained under the right conditions 
(Bales, 1950). Most observers of be- 
havior are still queasy, about records 
that might read, “O’s libido cathected 
P; P set up a defense (sublimation) and 
talked rapidly.” It is difficult, however, 
to rule out such records on rational 
grounds, particularly if theory calls for 
such inference and if the record is 
reliable and provides indices that relate 
meaningfully to other kinds of indices. 
Is an inference to the “cognitive field” 
of a child so different in kind from an 
inference to an unconscious state? Com- 
mon sense suggests that if protocols are 
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to read like this last one then they 
should include “what O said and did” in 
some other sense as well as in the sense 


outlined above. In short, whenever our 
language of description wanders too 
far from that mysterious realm of 
“everyday discourse,” we should spice 
it with terms from “everyday dis- 


course” for the help of those colleagues 
who have not yet learned our more 
special theoretical language. Later in 


this chapter we turn to this issue and 
discuss our best guess as to how far we 


might be able to go in the recording 
of interpersonal behavior with just 
“everyday discourse,” though even this 
can be viewed as hardly more than a 
temporary stop-gap (Heider, 1958) or 
at best as a source of hypotheses for 
basic research of the kind we have de- 
scribed so briefly. 

Field Strategy. There are practical as 
well as theoretical issues in the follow- 


ing questions, which arise in most re- 
search in interpersonal behavior. How 
many observers? Where should they be 
placed to observe? How many children 
should they watch? How long should 
they watch at a time? 

Let us first consider some of the pos- 
sibilities of error in the observer-subject 
relationship itself, avoidance of which 
May sometimes determine the answers 
to such questions as those above. One 
source of error is that of the effect of 
a particular observer on a particular 
kind of child; another is the consistent 
differential effect of different observers 
on all the children. The first of these 
can be controlled or studied only by 
the systematic use of a number of 
Observers or by the use of a one-way 
Vision screen, This source of variance 
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can never be completely ruled out 
except by statistical means, The sec- 
ond is handled in the same way but 
with at least one complication. This 
is the fact that shifting observers keeps 
up the children’s interest in these 
novel stimuli longer than if one ob- 
server is used and he becomes part of 
the “background” as Spiro (1958, p. 
465) has described so clearly. It is 
probably worthwhile to spend many 
days or weeks having children become 
habituated to an observer before re- 
cording starts, as recommended in the 
field guide for the Cornell-Harvard- 
Yale Socialization Study (Whiting et 
al., 1955). Even this precaution may 
not be enough. The writer recalls 
vividly how the interest shown in him 
by a group of nursery-school children 
continued to grow for several weeks. 
He never managed to become part of 
the background because he was observ- 
ing the children in an experiment in 
which the children received candy on 
a random reinforcement basis! 

Another source of error is the effect 
of the children, so to speak, on the 
observer. This can take many forms, 
but one of the more dramatic has been 
described by Campbell (1959) as an 
error that arises because of the order in 
which stimuli to be judged are pre- 
sented. Professors who visited one Rus- 
sian city first found it drabber than the 
second one they visited, whereas those 
who visited the second city first found 
it drabber. Such errors unquestionably 
occur in the observation of interper- 
sonal behavior. They can be controlled 
at least in part by planning the order 
with which a given observer watches 
particular children, groups, or settings. 
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In an unpublished research work done 
at Cornell, in which whole families 
were observed, John Harding em- 
ployed a design that required a num- 
ber of observers, a simplified version of 
Bales’ category system, and a time- 
sampling procedure in which each ob- 
server watched one family member for 
two minutes, then shifted systematically 
to other members. A counterbalanced 
design helped to control at least some 
observer errors that arise from order 
effects. Gellert (1955, pp. 185-186) has 
compiled a useful list of 13 additional 
sources of observer error, which the 
careful research strategist will wish to 
peruse before considering the field plan 
a final one. 

Let us now return to some of the 
questions raised at the beginning of 
this section. It should now be clear that 
it is difficult to make the job of direct 
observer of interpersonal behavior one 
for a single man. Even if no attempt is 
made to provide interobserver reli- 
ability estimates for more than a small 
portion of the observation, the kinds of 
errors outlined above suggest that it 
may usually be better for at least two 
observers to spend less time at what one 
could do less well with more time. But 
the number of observers needed de- 
pends upon how complex the judgment 
task is, how many categories of be- 
havior are being observed, how many 
actors must be watched in order to 
obtain the necessary information, and, 
of course, on how ambitious the unit 
of observation is, among other things. 
It is strange that animal psychologists, 
who often observe rather simple actions 
or interactions, have developed the 


THE STUDY OF THE CHILD'S SOCIAL BEHAVIOR AND ENVIRONMENT 


most objective means of recording, al- 
though it is also true that a pigeon in 
a Skinner box can peck away at a spot 
at a rate impossible to count unaided. 
At the other extreme, however, it is 
only fair to say that some anthro- 


pologists—particularly Mead (in Car- 
michael, 1954)—have developed highly 
ingenious and careful methods for re- 
cording the rather complex occurrences 


at tribal rituals, including the use of a 
good many brass instruments. Modern 
television reporting may be the current 


apex of field recording methods. Tele- 
vision recording of the tribal rituals of 
more complex cultures may someday be 


useful to all students of human be- 


havior. Television coverage of even a 
baseball game is no one-man job. If 
only the frequency of dependent and 
aggressive behavior of a particular child 


in nursery school is wanted, then one 
man may do very well. 

How many children or children and 
adults can one person observe? Many of 
the foregoing considerations apply to 
this question also. The clearer and sim- 
pler the cues to be observed, the more 
available and geographically concen- 
trated the action under perusal; the sim- 
pler the unit of observation, the more 
homogeneous the goals of the action, 
etc., the more people can be observed. 
Whiting reports that one observer could 
reliably record two classes of action of 
several children in the Iowa studies 
directed by Sears et al. (1953), but 
that when instigations were added to 
the task and nine categories with a 
quasi-total recording set were intro- 
duced, as in the Cornell-Harvard-Yale 
Socialization Study, then no more than 
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ne person could be observed at all 
reliably (Whiting, 1959). In that 
project, we should also report, there 
æ usually two field personnel in- 
d before a protocol was finally 
written up. Although more complex, 
this was a procedure somewhat akin to 
that of Barker and Wright (1955), 
in which the observer is queried on 
cular points by another person 
ie preparing the final draft of the 
tocol, This last is a notion that (like 
any others) deserves methodologi- 
al research in its own right. 
The question of how to place the 
erver or observers relative to the 
tor (or actors) leads us to suggest the 
owing analogy with the problem 
of defense in the game of basketball. 
One technique is to have one man 
follow his opposite number wherever 
that actor goes, This is called a “man- 
to-man defense.” Another strategy is 
to employ a “zone defense,” in which a 
defender watches over whatever goes 
on in a given area of the playing field, 
regardless of who is doing it. Barker 
and Wright have employed both 
strategies, at least in principle. The One 
Boy’s Day (Barker and Wright, 1951) 
approach is a man-to-man observation; 
their later work, with its heavy em- 
phasis on “setting,” includes some 
analyses that are akin to “zone” meth- 
ods, in which, ideally, an observer 
stands in Kane’s drug store and records 
what goes on regardless of who does it. 
Either of these placements of the 
observer can achieve an adequate sample 
of the behavior of the children in a 
group in the sense of sociological single 
acts or sociological interactions. Zone 
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observation, however, may make analy- 
sis of personality differences more dif- 
ficult, since the observer must know 
so many names and because much re- 
shuffling and cross-filing of data would 
be necessary. It provides data, how- 
ever, in a form that may facilitate an 
analysis in which an inference to the 
group may be made easier—since a 
given observer’s protocols may provide 
a fair time sample of “what instigations 
and/or reactions occurred at ‘Kane’s 
Drug Store’ (or in this area of the play 
school).” 

Many more strategies are possible, of 
course, many of which have not even 
been tried seriously. It is possible, for 
example, to divide labor by having one 
observer look for instigations and 
another for resultant or contiguous 
action. Another would be to have one 
observer specialize in one class of in- 
stigation, while the other watches for 
something else. Still another variation 
would be to have a pair of observers 
watching a pair of actors, with one of 
the observers trying to watch the things 
that P watches and to record what P 
does and the other trying to look where 
O does. Coordination of such protocols, 
their statistical analysis, and the re- 
search designs that would be necessary 
in order to rule out various observer 
errors would present many fascinating 

roblems. 

Possibly a definition of efficiency in 
the observation of interpersonal be- 
havior will arise by maximizing the 
values of different possible strategies for 
a given research problem. In the Cor- 
nell-Harvard-Yale study it was hoped 
that the field strategy would permit 
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the sampling of both instigation and 
action of nine behavior systems, with 
man-to-man observation within pre- 
arranged (and representative) settings. 
Tt was further hoped that these settings 
were representative of fairly clear sub- 
units (PSU's) of strategically selected 
cultures, It will be interesting if this 
was achieved. Spiro (1958) has reported 
on observations obtained by use of a 
rather complicated strategy in which 
man-to-man observation was inter- 
spersed with observation of particularly 
relevant instigation situations, both of 
these set within a rather constant use 
of ethnographic observation. His re- 
marks on the differences between sys- 
tematic observation and observation of 
the usual ethnographic sort are par- 
ticularly valuable (see Appendix), 


Clues Used in Observing 
Interpersonal Behavior 


At the heart of most definitions of 
the categories into which observed be- 
haviors are sorted lies the issue of the 
cues observers should be attuned to 
while observing. Considering the case 
of the recording of single acts is fruit- 
ful in clarifying some basic distinctions, 
which may also prove useful when more 
complicated units are being dealt with. 
We have made a place for these distinc- 
tions in Fig. 20-1 by the use of the 
entities “P’s response—environmental 
change—goal response” in describing 
P's (and the others’) action. Let us 
outline some of the implications of 
these entities for the choice of more or 
less ambitious aims in the cues to be 
utilized in observation of single acts. 
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MOVEMENTS DIRECTED 
TOWARD THINGS AND 
OTHER PEOPLE 


Some words in our everyday lan- 
guage refer to movements in space-time 
of an actor, such as “kick,” “strike,” 
“push,” and “pull.” These terms refer, 
with some vagueness or ambiguity, only 
to the movements in space-time co- 
ordinates. Such movements are the 
classical ideal data of some forms of 
behaviorism, and the coordinates repre- 
sent for some psychologists the basic 
coordinates for the description of action 
in the sense that “observable behavior” 
is limited to movements that, in prin- 
ciple at least, can be reduced to such 
terms. This has been an idea of great 
importance in the development of 
modern psychology, holding out as it 
does the lure of an objective and strictly 
public referent for our behavior cate- 
gories, 

The history of this idea should be 
written and its present place evaluated, 
but we shall not attempt such a task 
here. There are two remarks, however, 
that we would like to make on the 
topic, even at the risk of superficiality. 
In an important sense this idea has 
been applied more superficially than 
wisely to observation work, and in 
some respects it has only recently been 
applied at all. Let us expand briefly. 

In work on behavior observation we 
often interpret the notion of movement 
(and the scientific principle regarding 
its objectivity) as counseling us to 
“record the actual movements and noth- 
ing more.” This is a very simple render- 
ing of the notion we have outlined, 
since we have suggested that our rec- 
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ords should in principle be reducible 
to movement terms in space-time, not 
that the protocols be rendered in noth- 
ing but space-time terms. It is interest- 
r that the purists in this regard have 
ver worked out a serious, generally 
useful language of “merely movement,” 
nor have they approached movements 
of people with the kind of analytic 
intention toward developing basic units 
1 the way linguists have brilliantly 
done in their treatment of the sounds 
emitted by people. Short of having such 

pure movement language, or a set of 
generally accepted units such as the 
linguists have, it has been sensible, 
perhaps, to try to keep to terms that re- 
fer as much as possible to “what the 
subjects do.” The hope has been that 
such a set in the observer will call out 
in him his naturally coded terms for 
describing movement, He will record 
“P struck O in the face with the flat 
of his hand” rather than “P struck O 
in the face with clear intention to hurt.” 
The trouble is, of course, that within a 
given “behavioral language” it may be 
quite reasonably possible that there 
are “movement components,” which 
might validly (either conceptually or 
directly) distinguish between a hurting 
act that was intended to hurt and one 
that was not so intended. Primitivizing 
the set of the observer may merely rule 
out the recording of solid empirical 
facts, which our natural movement 
language code may be inadequate to 
capture. Perhaps the nearest we could 
come would be something like this: 
“P struck O in the face with the palm of 
his hand, but with that characteristic 
decrease in speed, as the palm ap- 
proached the face, which usually means 
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that a modal P’s intention is to be play- 
fully aggressive.” The statement would 
ideally end with a reference to some un- 
written “dictionary of movement” in 
which the correlation between the de- 
crease in speed and the intention 
(directly or conceptually measured) is 
quoted. In short, we have lost much in 
the observation of movements because 
we have hardly begun to face the 
complexities and the fascinating re- 
search problems involved in untangling 
our language code from the phe- 
nomenon we wish to code, 

Two factors have probably en- 
couraged our naivete on these matters. 
The first is the fact that in important 
experimental work we can focus on a 
fairly limited set of movements or ac- 
tions, properties of which can be auto- 
matically measured and used as de- 
pendent variables, Second, in this way 
it is possible (by the use of some im- 
plicit assumptions) to work with move- 
ments or actions in such a clearly 
objective manner that the broader im- 
plications of the relation of the position 
to our usual language codes need not 
be faced, The observation of inter- 
personal relations in everyday or even 
laboratory settings brings all these im- 
plications clearly into the open. For 
such purposes even the purest be- 
haviorist must “botanize,” despite 
Skinner's well-taken worries on this 
score (1938), and such botanizing will 
either be done well or it will be done in 
a naïve manner, which may cover up a 
good many of the phenomena that 
occur in a replicable and ordered way 
even in our “usual” space-time CO- 
ordinate system. 

So we have argued that the implica- 
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tions of a behaviorism of the observa- 
tion of interpersonal behavior have 
hardly begun to be investigated, much 
less “used.” There may well be limits 
to how much of the action of children 
can be caught by protocol sentences, 
which are, in principle at least, re- 
ducible to sentences having to do with 
movements in space-time. We just do 
not know. It may, of course, turn out 
to be rather impractical to make such 
an attempt with some categories or 
properties of interpersonal behavior, 
since the complexity of the movement 
pattern corresponding to the property 
may be very great and may change 
greatly from child to child. There is 
some hope, however, in the linguistic 
approach by researchers like Hockett 
(1958), who suggests that at least for 
English speakers there may be fairly 
reliable psychophysical aspects of 
speech pauses, which may reflect such 
subtle matters as the fact that someone 
has “changed his mind” in the middle of 
a sentence, 

Past attempts to keep observation 
categories close to movements have 
often been accompanied by a relentless 
search for high reliability in the records. 
Two such criteria used too stringently 
can serve to lead research up a path 
toward highly specific indices and cate- 
gories that predict only themselves and 
that do not satisfy legitimate curiosity 
regarding other aspects of the inter- 
personal behavior of children. Such em- 
phases, although they should not be lost 
sight of, may well be held in abeyance 
for a time in order that the complexities 
of interpersonal interactions and such 
matters may be explored. 
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Bott (1933) is quoted by Gellert 
(1955): 


ented (in 


In surveying the results pre 
various studies), it is evident 
has been focussed, first on the 
of observers, which is regard 
basic condition of reliability 
studies. Constancy of the data has re- 


ceived the next amount of attention, but 
the difficult question of validity has been 
passed over with comparatively little 
study. We would like to suggest a re- 
versal of this order; that is, that attention 
should be focussed on discovering what 
units of behavior have representative and 
predictive significance in n to the 
whole of the child’s behavior. When this 
has been done, it will next be in order to 


1e con- 


proceed with detailed studies o 
> train- 


sistency of the behavior items. “ 
ing of observers can then be per 
till a sufficient reliability has been re 
If interest is focussed first on the training 
of observers, there is danger that the ele- 
ments of observation are so reduced in 
the interest of reliability that they are 
largely emptied of significance. On the 
other hand, if significance can be demon- 
strated with observations which have only 
a modicum of reliability, it is safe to 
conclude that a more rigorous control of 
procedure will only heighten the sig- 
nificance of what has already been 
demonstrated to have meaning in relation 
to the whole of the child’s behavior (p. 
184). 


It may be valuable at this point to 
make a distinction that seems to be 
rather important in any future search 
for movement-pattern correlates to 
theoretically important aspects of inter- 
personal behavior of children. To some 
this reduces to a problem in the “per- 
ception of action or behavior” by an 
observer, calling for experiments of 
the usual perceptual variety, ideally 
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psychophysical in method. Such an 
erpretation of the problem may be a 
costly error because of the limited 
resources of information and procedure 
usually permitted a subject who is 
“perceiving” someone else’s action. The 
ject in an experiment needs to be 
granted the status of a “scientific ob- 
server” before such psychophysical re- 
lacionships become directly useful to 
the improvement of behavior observa- 
tion techniques. Regardless of the logi- 
cal or doctrinal issues in this distinction, 
the “scientific observer” is usually per- 
mitted to record copiously, to make 
tematic (often statistically compli- 
ited) inferences, and to use all modern 
technical means to get information 
bout what the “object of judgment” 
; doing. It is the rare study in the 
“perception of persons” or “perception 
of behavior” that permits such exten- 
sive behavior on the part of the subject 
or observer. It is just such procedures 
that are at issue in our present problems 
—not merely simple percepts based on 
relatively little information, Psycho- 
physics must become more complicated 
in its operations in order to become use- 
fully “global” in this area. 


EXPRESSIVE ACTS 


Some of the energy transformations 
or movements that children and adults 
engage in or find themselves doing 
have little to do with changing the en- 
vironment or with obtaining resources. 
In fact, probably more often than not, 
these things are not even meant to be 
communicated to others but are often 
the crucial cues that provide others 
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with an interpretation of large segments 
of the interpersonal behavior of the 
actor who emits them. We shall refer 
to these events as “expressive acts” 
merely because this term comes closest 
to the meaning we intend, even though 
it is far from accurate. Perhaps our 
very ignorance of these events makes it 
difficult to find a more useful word. 
One of the main troubles with the term 
“expressive movements” is that it has 
been used in personality study to refer 
vaguely to the personal style that sets 
a person apart, often uniquely, from 
others, With the broadened perspective 
of our modern social-psychological or 
communication approach to person- 
ality, such usage is too limiting and 
constraining. Some expressive acts or 
movements are possibly as much shared 
and equally as objective and general as 
“forms of request” in language or 
modes of chopping down trees in motor 
behavior. 

Consider, for example, Hockett’s 
statement that grows out of his in- 
tensive experience in trying to use 
linguistic and other tools to name some 
of the interactional modes that occur 
in a psychiatric interview. 


In one brief episode—about three sec- 
onds long—in a taped psychiatric inter- 
view which I have been studying, the 
patient goes through an observable process 
of emotional editing. She manifests, in 
turn, the signs of three different nameable 
emotional reactions to what has gone 
before. The only way I can understand 
this episode is to assume that the patient 
cannot know what emotion she “feels’— 
that is, which emotion is culturally ap- 
propriate to the setting—without trying 
a bit of each and judging it by feedback. 
The manifestation is at a lower power 
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level, giving the impression of a sort of 
pastel kaleidoscope of emotionality; but 
the power level is not so low as to render 
the sequence indetectable (1958, p- 11). 


Although, like any class of behavior, 
“editing behavior” can be studied in 
terms of individual differences or even 
unique patterning, there is no necessity 
to do so, What is needed is the descrip- 
tion of the linguistic and other move- 
ment patterns that correspond to the 
subtle activity of editing (or at least 
vary the way they should according 
to some theory of editing); this cate- 
gory can be studied, presumably, in 
any person and with all the methods 
available to any general behavioral 
‘phenomenon. Such research as that of 
Hockett and the attempts of Birdwhistle 
(1952) to develop methods for a purely 
movement-based study of interaction 
may provide us with some important 
categories for the study of interpersonal 
behavior of exactly this kind. 

Another problem with our term “ex- 
pressive acts” is that it is often used 
solely in the context of a trait or dis- 
positional psychology, in which we 
study a person’s expressive actions dur- 
ing interactions solely to obtain in- 
dices of supposedly deeper intervening 
traits. We wish merely to point out 
that, here again, this is not the only 
service to which such indices can be 
put. They may serve as independent or 
dependent variables in their own right. 
They may also serve to enhance the 
reliability and validity of more complex 
interpersonal categories into which we 
may wish to place movements and 
actions. 

The potential value of expressive acts 
or movements as dependent, nomothet- 


ically viewed variables in their own 
right is clearly shown in a study by 
Levin et al. (1959). They tape-recorded 
speeches made by children under 
various audience conditions, then had 
two coders, working independently, 
score the following categories: (1) 
length of speaking in seconds; (2) num- 
ber of words uttered; (3) number of 
Stutters; (4) prolongations (like 
“speeeech”); (5) shifts (starting a word 
or sentence and shifting before the 
utterance is finished, as “the b-girls 
want”); (6) sighs; (7) pauses (any 
cessation of speech for two seconds or 
more was scored, and the actual length 
of the pause was timed); (8) word 
whiskers (these are the ubiquitous “uhs” 
around words); and (9) catches in the 
voice. They found the lowest correla- 
tions between the scores of the two 
coders was .80 and the highest was 
above .90. They were able to show that 
these indices were sensitive to system- 
atic changes in audience conditions and 
that they were also sensitive to dif- 
ferences in certain personality prop- 
erties of P’s, measured independently. 

Acts Which Have Certain Visible 
Effects on Others. The basic units of 
many Category systems for the observa- 
tion of interaction include reference to 
actions of P that have certain visible 
effects on others. Such units are a 
subclass of what are usually termed 
“instrumental acts,” and we have made 
a place for this category in Fig. 20-1 
by noting the “change of environment” 
that follows an action. Some observers 
have even tried to be systematic in 
constructing categories that have as 
their referents such observable changes 
in the environment. These categories 
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have a certain elegance from a neobe- 
haviorist point of view, since they per- 
t definitions of actions that do not 
ire a detailed description in terms 
of movements but still permit an im- 
mediately observable datum that de- 
1 the class of actions under study. 
is the case in which responses are 
defined as “the number of times an 
imal managed to push down a bar” 
the speed with which the rat ran 
to the goal box at the end of the maze.” 


Th 


ìr 


It is also the case in which observers 
define aggression as “that class of acts 
wherein the behavior of P causes signs 


1 in O” or in which dependency 


of pa 
i led as “that class of acts of P 


1S reco; 


that eli helpful actions from O.” 
A great many of our everyday terms 
for action refer at least in part to the 


effects of behaviors. This means that 
categories defined in this way are close 
to our usual language codes, at least 
in English, and represent one level 
of refinement above everyday usage. 
This may have both advantage and dis- 
advantage in training observers. Ob- 
servers need not be so well trained to 
record these kinds of behaviors as they 
must to record expressive actions, but 
they must practice ignoring some of the 
cues they habitually use in observing 
behavior in everyday life. 

The visible effects of a child’s actions 
on others have considerable theoretical 
importance in some conceptual schemes. 
If they happen also to be the effects that 
are visible, or at least apprehended by 
the child, then they may be the things 
that will become important aims of the 
child’s striving behavior (learned goals 
or subgoals), or they may provide a 
fairly objective referent in analyses of 
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the reinforcements that maintain the 
child’s relationships with particular or 
generalized others. If a child, for ex- 
ample, plays often with children who 
do provide help when he asks for it 
(or when he merely gestures for it or 
shows some signs of need) but tends 
to drop out of diads in which such 
effects are not forthcoming, then we 
have information about the child and 
also about his interpersonal relation- 
ships. 

This example points up one of the 
problems of categories defined in terms 
of the effects of P’s behavior on others 
(or the physical environment). This 
problem is that the observer would 
often like to have a record of the 
attempts at succorance as well as the 
failures and successes in order to make 
ratios of attempts to successes and 
failures to predict the development of 
interpersonal associations based on 
partial or total reinforcement of “de- 
pendency needs.” To achieve such ad- 
ditional recorded information, at least 
one and possibly two further kinds of 
categories of actions must be con- 
sidered. 

Acts Which Usually Have a Certain 
Effect on Others. One step toward the 
fuller information called for in the fore- 
going example is to set the observer to 
look for acts that usually have a certain 
effect on others, whether they succeed 
in any particular case or not. There is 
a limited, but fairly large, number of 
ways, for example, by which P can 
usually hurt another. One is to hit the 
other physically, and this is a clear and 
objectively recordable class of complex 
movements. But suppose a very young 
child strikes a very much bigger and 
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older child and merely achieves a laugh- 
ing retreat from the other? In such 
cases the important thing is that in 
terms of the usual “language of action” 
there is an interesting disjointedness in 
this interaction, which requires that we 
distinguish attempt from effect if we 
are to describe the matter with fidelity. 
P’s act of hitting can only be placed in 
the category “aggressive,” in which it 
probably belongs (particularly if P 
continues to try to hurt and says angry 
things at the same time, despite O's 
laughter and retreat), if we refer it to 
the aggression category on the basis that 
such acts (or movement patterns) 
usually achieve hurt in others, 

There are at least two ways of 
providing a rationale for placing the in- 
effectual but usually hurting act into 
the aggression category. One is to 
refer to a table that shows that such acts 
usually (statistically) are followed by 
visible signs of hurt in most others, The 
other is to refer to the conventional 
“language of action” for which, un- 
fortunately, no dictionary has yet been 
written but on which most experienced 
members of a culture might well agree. 
The code for such a language is prob- 
ably pretty much a learned thing, and 
it would be very helpful indeed to have 
age-norm data on this learning, since 
very young children may be behaving 
in such a way that the adult or mature 
code can serve merely as a source of 
observer reliability but provide no 
validity. 

Acts Which Achieve (or Usually 
Achieve) an Inferred Effect on Others. 
In the last two categories we have dis- 
cussed we have assumed that there is 
(or usually is) a visible effect on O 
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following P’s action, which can be 
recorded and which can even serve as 
the defining characteristic of the cate- 
gory. One of the main problems in 
the observation of interpersonal be- 
haviors is the brute force that often 
there are effects that are not immediately 
visible or that may never become visible. 
If, for example, a very large boy strikes 
a rather small one with all his strength, 
we would expect that even if the small 
chap laughs he has been “hurt.” His 
laugh may, of course, have a different 
intonation or stress pattern, which may 
be a reliable sign of the otherwise in- 
visible “hurt,” or he may laugh first 
and then grimace or cry, providing 
visible indices of the class of “visible 
hurtings.” Fortunately, young children 
often carry their feelings “on their 
sleeves,” so to speak, but it would be 
naive to hope that this would always be 
the case. The observer then must have 
recourse to the mysterious process of 
“inferring that O is hurt,” and this is 
often done with considerable interrater 
reliability. The mysteriousness of the 
process arises because the observer often 
cannot state the objective basis of the 
inference. The trouble with the process 
arises when the basis of inference 
shifts, thus destroying the systematic 
quality of the report in the protocol. 
Suppose the beleaguered younger child 
in our example gives no cues in his reac- 
tion that he is “hurt,” other than the 
visual signs that some of his skin has 
been temporarily displaced by the fist 
of the older boy. The observer may 
decide that the young fellow is hurt 
because of the very strength of the 
blow, because some onlookers gasp in 
apparent sympathy, because he feels 


r 


INTERPERSONAL BEHAVIOR 


that he himself would feel hurt by such 
a blow, or because of the clear signs 
provided by the older boy that he 
clearly intended to hurt the younger. 
ty or all of these sources may provide 
a tentatively valid contextual basis for 
inferring the hurt of the brave young 
5: there is probably no way of 
knowing for sure, But it would be most 
uncomfortable if the observer later ran 
a correlation between “inferred 
achievement of hurt in interaction” and 
“inferred intention to hurt in interac- 


tion” and discovered a high intercor- 
relation that was in large part due to 
the fact that the information regarding 


the intention of the large boy was also 
used in judging the hurt in the other. 

\ good deal of the time spent in 
training observers should be spent in 
differentiating the classes of informa- 
tion to be used for making, for example, 
inferences of hurt. Usually the direc- 
tion of such specification will be toward 
becoming sensitive to the expressive 
acts of the other or to higher-order 
patterns or even “higher-order vari- 
ables” that are involved in such action. 
Unfortunately, we lack terms for most 
of these or the knowledge on which to 
base a scientific language. Until such 
language has been developed, or until 
the knowledge has been gained, prob- 
ably the second best strategy is to train 
observers to be sensitive to their own 
empathic response, assuming that they 
share the same codes with the other 
about whom they are making an in- 
ference. Such empathic methods place 
great stress, of course, on the observer 
and the persons observed having the 
same cultural and language background 
and argue for the use of trained native 
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observers in doing work in foreign cul- 
tures. 

Interacts in Which the Inferred In- 
tent of P Has an Inferred Effect on O. 
The full complexity of the operations 
involved in recording interpersonal be- 
havior becomes clear when we analyse 
a unit in which the observer reports, 
for example, that “P responded with a 
(hidden) anxiety to the aggressive in- 
tention of O.” There is no doubt that 
we make judgments of this double- 
pronged, two-person, double-inference 
kind in everyday life and that we do it 
with some (unknown) degree of 
fidelity. But it is also quite true that we 
make a good many errors in often un- 
known places. It may also be true that 
we make fewer errors than some people 
think we do; such, in fact, would be 
the present writer’s bias. Such con- 
siderations, however, provide only a 
background for a scientific strategy but 
can never replace a strategy in which 
the degree of error, as well as the in- 
formation used for the inference, is 
known. 

In a sense, the scientifically trained 
observer is given more to go on than 
the participant in interaction has. He 
is, for example, permitted to remain 
detached from the ongoing action. He 
is, however, also called upon to provide 
more than the participant usually has 
to do. He must state the basis for 
his inferences and, preferably, the 
further evidence that can provide a 
check on the name he gives to the in- 
ference—the further implications of its 
substantive content, so to speak. The 
raw data in a behavior protocol are 
only as good as the information used in 
recording them, and this in turn has 
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import only insofar as the inference 
is shown to be useful in ordering in- 
terpersonal phenomena so that they 
bear on our theories and assumptions. 
It is probably the best strategy, there- 
fore, to have the observer record as 
fully as possible the information avail- 
able about P and about O from which 
an inference is made. He can then also 
record, probably in parentheses, the 
inference that comes to mind. The final 
inference can be made more system- 
atically by a coder, who can apply the 
kinds of rules that can, for example, 
make sure that the information regard- 
ing the intent of O is not also the only 
source of the inference to the effect 
on P and vice versa. The final judg- 
ment on the entire procedure is left to 
the outcomes of the research in which 
the theoretical import and degree of 
ordering of the data can be evaluated. 
The extra coding operation may be 
extremely costly, as we have mentioned 
before, and calls for costly verbatim 
records, which, in their turn, may keep 
the observer from noting some of the 
subtler cues that should go to make 
up the record. This may merely mean 
that nature happens to be a rather diffi- 
cult task-mistress, who requires much 
in payment for relatively little. Inven- 
tiveness is desperately needed in order 
to cut down the exorbitant cost of 
obtaining good behavior records. 
Quantification of Single Acts. Any 
one or all of the foregoing distinctions 
may be used to attempt to define a 
unit single act. Once this is done in a 
clear way, then additional properties 
of the act may be scored. If the act is 
defined, for example, in terms of “its 
visible effect or its usual effect or mean- 
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ing” [Heyns and Lippitt (1954) refer 
to this as recording while taking the 
role of the “generalized other], then 
the additional properties may include 
degrees or kinds of intensity, latency, 
intention, inferred effect, probability, 
etc. This is one approach to the quan- 
tification of action and often leads to 
the characterization of the single-act 
unit in ways similar to Gellert’s defini- 
tion: 


Any scorable action, terminated by in- 
terruption by O, or by changes of intent, 
direction, or additions of new ideas by 
P (ie., “Let's put on the shoes, and then 
let’s play house,” is two units). If a series 
of interacts turn into a ritualistic, repeti- 
tive game, only the first two scorable units 
are recorded for each subject (1956, p. 
12). 


The last point is, of course, arbitrary, 
but it is clear, and the number of repe- 
titions may be recorded and explored 
as to its usefulness as a personality, 
diad, or group property. 

There are many other approaches to 
quantification, One may, for example, 
take very short time samples, as did 
Bishop (1951), who recorded the state 
of mother-child interaction every five 
seconds regardless of whether a change 
such as that required by Gellert had 
occurred. She therefore emerges with 
a measure of the percentage of total 
time filled by a certain kind of activity 
by mother or child. This has the advan- 
tage of clarity in terms of time but 
makes it more difficult to rule out ritu- 
alistic repetitions as Gellert wishes to 
do. In the long run, issues of this kind 
must be decided by the scientific utility 
of the scores obtained. On this score, 
both Bishop and Gellert, as we shall see, 
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able to obtain high reliability and 
ve also able to test important hy- 
potheses with their measures. 

We have reiterated in this chapter 
that considerations such as those in this 
section, which have to do with the 
on of the informational basis 
for single-act scoring, are related to the 
problem of scoring interacts, but they 
do not solve it by any means. We shall 
discuss some recent attempts to deal 
more directly with this complex prob- 
lem. 


defin 


Typical Sets of Categories for 
Recording Interpersonal 
Behavior 


So far in this discussion we have at- 
tempted to create a sense of problem 
regarding the study of interpersonal 
behavior, we have discussed some of 
the possible kinds of interaction units 
that have been or could be studied, we 
have outlined some strategic questions 
involved in planning research of this 
kind, and, finally, we have looked at 
close quarters at the kinds of units out 
of which interaction categories can or 
have been built. It is now time to list 
some of the categories with which re- 
searchers in child development have had 
experience. Our aims here are not only 
to provide useful categories for further 
research but also to show how some 
of these categories represent the dis- 
tinctions we have made so far in this 
chapter. 

It is not our aim to give an exhaus- 
tive list of categories but rather to il- 
lustrate the issues involved in usin 
them in research. We shall also point 
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out in passing the kinds of criteria that 
should be considered in using the cate- 
gory systems we shall outline or in de- 
veloping new category systems. 

In our discussion of the kinds of con- 
crete information for which observers 
may be set to record, we focused upon 
particular kinds of information. We 
discussed expressive acts or movements 
as distinguished from the visible effect 
of an act on someone else. Some kinds 
of interpersonal or personality theory 
might require that a category set should 
be consistently defined in terms of only 
one or another of the kinds of infor- 
mation available to the observer. In 
that case consistency is certainly a vir- 
tue. We see no reason, however, to 
criticize empirically derived or, for 
that matter, theoretically derived cate- 
gory sets merely because they do not 
remain consistent in this sense. In fact, 
it will be interesting to comment on 
the mixtures of such information that 
creep into various categories in the sets 
we shall describe. The close and inti- 
mate contact with interpersonal be- 
havioral phenomena, which comes from 
hundreds of careful observations, is a 
positive source of possible inductive 
truth that any empirical scientist must 
respect. In fact, theories about what 
information should be accepted or re- 
jected in this realm are probably still 
less powerful than the detailed infor- 
mation that emerges from direct obser- 
vational experience. As we have men- 
tioned above, it is still incumbent upon 
a scientist to state as fully as possible 
the classes of information used in a 
study. 

Gellert (1955) has listed seven major 
divisions into which observational re- 
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search in child study has fallen in ap- 
proximate order of the emphasis they 
have received. Her breakdown is in- 
teresting: (1) affiliation (sociability, 
friendship, popularity, ethnocentric- 
ity); (2) aggression (conflicts, anger); 
(3) ascendency (rivalry, demands); 
(4) conformity and cooperativeness 
(compliance, helpfulness); (5) fear 
and insecurity (tension, clinging tend- 
ency); (6) extraversion-introversion; 
(7) sympathy and social sensitivity. 
She suggests that the emphasis in re- 
search to date on these particular di- 
mensions of interpersonal behavior has 
been determined by such matters as the 
relevance to broad theoretical interests, 
the pressure of educational, social, and 
clinical problems, by the fact that these 
behaviors command attention, and 
finally by expediency. She ponders the 
further question of such uniformity in 
the variables that have been investi- 
gated and bemoans the lack of imagi- 
nation so far displayed in this sphere. 
We feel similarly and wish to make 
clear that the category systems that 
follow should not be assumed in any 
sense to be tools around which even 
a major portion of future research 
should revolve. The area of interper- 
sonal behavior in children has hardly 
been tapped, and we hope our pre- 
ceding discussion has helped to make 
this fact more salient. 


THE RELATION OF STRATEGY 
TO CATEGORY SYSTEMS 


Many of the situations and strategies 
we have discussed are those in which 
a child’s interpersonal behavior might 
be studied. We have distinguished be- 
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tween the use of a priori categories and 
those used to code a fairly verbatim 
protocol. Presumably, either of these 
categories could be used to record ag- 
gressive single acts, aggressive interac- 
tions, aggressive activities, or aggres- 
sive relationships. Presumably, each of 
these might be used to study sociologi- 
cal (or cultural) aspects of aggressive 
interpersonal behavior (in which P and 
O are replaceable in principle, except 
that they may have some property such 
as a role or status relation to one an- 
other or a friendship relation) or the 
more directly interpersonal aspects (in 
which at least P, and possibly O and 
TP as well, is in an important sense not 
replaceable). The last category would 
also include the very frequent case in 
which a set of categories has been 
worked out for the purpose of record- 
ing the interaction of a child with his 
parent or with his teacher or psychi- 
atrist but under the conditions that 
both remain unreplaceable during the 
observations. Finally, we have outlined 
two possible tactics of observation, the 
man-to-man method as compared to 
the zone method, and we have named 
a number of kinds of information for 
which an observer might be set. 
These distinctions can help to classify 
studies in which children are observed, 
and they may even prove useful to 
those who are planning research in this 
area by helping them to make the data 
fit more closely their theory or ques- 
tion of interest. They are not neces- 
sarily useful in toto for classifying 
category schemes. This is because a 
given category scheme (a priori or a 
posteriori) might be used for a good 
many or even all of these purposes. If 
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ermute some of the major distinc- 
in the foregoing paragraph, we 
at one end with a study “fo- 
J on sociological single acts based 
upon an inference regarding the visi- 


le or usual effect of the acts of P with 
the use of a zone observation tactic,” 
whereas at the other end we emerge 
1 study “focused on interpersonal 
interactions based upon information re- 
garding the instigations directed to- 
ward P, his response (visible or usual 
effect of same), and the response of 
O (or of TP) to the behavior of P 
d by a man-to-man tactic.” It 
is quite conceivable that the same cate- 
gory set could be used in both of these 
otherwise quite different studies. If, for 
example, the content of interest were 
aggression, then the same categories 
could be used for recording the be- 
havior of the two kinds of P’s in these 
studies. It might also be the case that 
the categories for recording instiga- 
tions directed at P would be the same, 
except for a residual category for non- 
aggression, because of the broadly re- 
ciprocal quality of aggression interac- 
tion. Finally, the effect acts (the coun- 
ter-response of O or the intervention 
of TP) would be largely the same ex- 
cept for the same residual category. 
There is considerable independence 
between the kinds of categories one 
uses and the other parameters that de- 
scribe a study. Consider the category 
system of Bales (1950). It is basically a 
System established on information re- 
garding the usual effect (i.e., meaning ) 
of the actions of P and is most often 
used to record the frequency of the 
single acts of a group of essentially 
replaceable P’s, It is possible, however, 
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to record interpersonal single acts hy 
the simple expedient of keeping se 
of who “sends” the single act and w ho 
“receives” it, and this is often done A 
fact. The category sets could also pro : 
ably be used in some cases to Tecor 
units of interacts by keeping the aa 
ords so that instigation-reaction-¢ ect 
reaction could be retained as a Ea 
Perhaps some changes would be a Ş 
for this last purpose. We shall there- 
fore discuss some category Systems in 
terms of the content of the phenomena 
recorded, but we shall also point out 
some of the uses to which the category 
system has been put according to Our 
other dimensions. We shall be particu- 
larly careful to point out the content 
and strategy limitations of present ex- 
isting category systems. 


OBSERVATION OF AGGRESSIVE 
INTERACTION 


Faigin (1958) has used a set of cate- 


gories to study the frequency of e 
gressive behaviors displayed by mg 
children of two different Kibbutzim 
in Israel. The categories Were marked 
out prior to the beginning of ee 
tion, and time samples were taken o: 
the behavior of each individual child 
in ten- or fifteen-minute time units. 
She obtained, in all, three hours of sys- 
tematic observation on each child, the 
time periods being allotted to the vari- 
ous activities in which the child iets 
his day: free play, organized play, 
meal time, and getting up in the a 
noon period. The observer reliabi ity 
of the categories of aggressive responses 
ranged from 75 to 95 per cent, using 
this formula: per cent agreement is 
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equal to the number of agreements of 
two observers divided by the number 
of agreements plus the number of dis- 
agreements. When the subcategories 
were totaled to provide a total score, 
the agreement was found to be 92 per 
cent. 

Faigin’s study is of interest partly 
because of the relatively young age of 
the children, the range being 19 to 38 
months at the time when the study 
was begun. The categories she used 
were (1) physical aggression, which 
she defined as hitting for the purpose 
of hitting or hurting, apparently uti- 
lizing information regarding the inten- 
tion of the child; (2) instrumental ag- 
gression, which is defined in terms of 
hitting or pushing in order to obtain 
some other goal such as a toy, play 
space, or attention, apparently utiliz- 
ing information regarding the visible 
effect or usual effect of hitting or push- 
ing movements; (3) defense of toys; 
(4) defense of self; (5) aggression to 
objects; (6) aggression to smaller chil- 
dren; (7) verbal aggression; (8) dis- 
obedience. In her research report Faigin 
does not go into the definitions used 
for more than the first two categories, 
partly because the frequency of any 
but the first two was so small that the 
main effect was their lack of occur- 
rence, possibly due to the extreme 
youth of most of her subjects. She also 
found that half the children expressed 
very little or no physical aggression 
and that 18 out of the 31 children dis- 
played almost no instrumental aggres- 
sion. These categories produced a con- 
siderable range of individual differences 
but showed no differences between 
Kibbutzim, which led to the interpre- 
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tation that the instigation to aggres- 
sion is due to factors other than, or in 
addition to, the frustrations inherent 
in group living, since the social and 
cultural environment of the Kibbutziz 
differed considerably. It is also inter- 
esting that among children of the age 
Faigin studied there was a significant 
negative relationship between the total 
aggression scores and those for de 
pendency. Other studies, with older 
children, tend to find a low or medium 
positive relationship. This leads Faigin 
to the suggestion that at this earlier age 
dependency and aggression are alter- 
native responses to frustration and that 
the children who have been successful 
in using one or the other type of re- 
sponse seem to persist in its use. 

An important study of children’s in- 
terpersonal aggression is the lowa study 
of Sears et al. (1953). Among other 
measures taken on the 4o children (21 
boys, 19 girls) involved in the study, 
each child was observed for 16 fifteen- 
minute periods and his aggressive be- 
havior was recorded in six categories. 
These categories are fairly homogene- 
ously defined as being in “the nature 
of instrumental acts designed by the 
child to place himself in such context 
with another person as to provide 
gratification of the acquired drive of 
aggression.” The categories utilized for 
aggression were (1) injury to person 
(including hitting someone, getting 
someone punished, damaging another's 
property, or derogating status). This 
category appears to refer to the physi- 
cal effects of a child’s actions, except 
for the last example in which judgment 
must be made of verbal behavior on 
the basis of its usual effect or meaning 
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to the recipient. (2) Discomforting an- 
other (including threatening gestures 
and language); (3) insuring compliance 
with demands; (4) destruction of in- 
animate objects or constructions (in 
the form of displaced aggression or 
nondirected aggression); (5) having 
object in own possession (taking things 
away from another); (6) removal of 
immediate or anticipated frustration 
(removing barriers, minimizing injuries 
from another person, maintaining 
status, as in “saving face,” taking back 
an object that has been previously taken 
away by someone else, and retaliation). 
It is interesting to note that these cate- 
gories are delicately worded in terms 
of the change in the environment that 
follows behavior and that this corre- 
sponds to a basic element in interac- 
tion analysis, as displayed in Fig. 20-1. 
The attempt here is to capture the num- 
ber of times the child maneuvers a 
change in the environment, which puts 
him close to a theoretically important 
goal response. There is little doubt that 
a great deal of information from vari- 
ous aspects of the context is necessary 
for making the judgments called for in 
these categories. However, by using 
the same formula as Faigin did to cal- 
culate reliability, Sears et al. were able 
to report that the mean agreement on 
the six aggression categories was 86 
per cent, with a range of 67 to 100 per 
cent. The agreement on the total ag- 
gression score was gi per cent. 

The interest of the observers in the 
Sears study was not sociological, but 
individual and psychological. They de- 
sired to use the frequency of aggres- 
sion as an index of the strength of a 
child’s aggression drive. The categories 
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do not systematically take instigation 
into account, nor is any additional scor- 
ing of the length of an act or of its 
intensity attempted. The authors, how- 
ever, were not unaware of the need for 
control of instigation to aggression. 
They attempted to make the condi- 
tions of observation as comparable for 
each child as possible. They propor- 
tioned the observations according to 
the total time spent by preschool 
groups in settings of free play, story 
and music, and art. They also at- 
tempted to ensure that the conditions 
would allow for maximal interaction 
among the children and teachers, with 
minimal interference on the part of 
the teachers on the action of the chil- 
dren. 

Two additional methodological re- 
marks seem called for with regard to 
this category system. The first is that 
a correspondence of +.64 was reported 
between the total frequency of ob- 
served aggression and the combined 
scores on ratings of aggression provided 
by teachers. The second is that the 
aggression scores correlate highly with 
a rating of the activity level of a child, 
and the authors report that the direct 
observations (since they were pure fre- 
quency measures) were more influenced 
by activity level than were the teach- 
er’s ratings. The last two points are 
based on the data for the boys in the 
sample, but they also hold with some- 
what less clarity for girls. These re- 
marks provide a basis for the sugges- 
tions we made earlier in this chapter 
with regard to providing residual cate- 
gories in observation, which will give 
an index of activity level, or to achieve 
somewhat the same effect with a rat- 
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ing. The Sears study had the positive 
value of being theoretically guided so 
that it is possible to report that the 
validity of the observations is some- 
what strengthened by the fact that 
these indices behaved, at least to some 
degree, the way the theory called for. 


Observation of Dominant and 
Submissive Behavior 


Dominant or ascendant behavior has 
been studied a number of times by di- 
rect observation methods. Some of the 
important studies in this regard have 
been those of Jack (1934), Page (1936), 
and Anderson (1937, 1943). But our 
present discussion will be limited to 
the more recent work of Gellert 
(1956), both because she built upon 
the work of past students of these 
classes of behavior and because the cri- 
teria by which she judged the fruitful- 
ness of her category system can pro- 
vide some useful hints for future work 
in other areas of interpersonal behavior. 
She outlines the criteria she used for 
selecting the variables for her study, 
suggesting that behavior systems should 
have the following properties: 


1. The behavior systems should be 
universal. They should not be peculiar 
to a particular kind of social situation 
nor to a special group nor to a stage of 
genetic development. 

2. The behavior systems should have 
attained a demonstrable degree of con- 
solidation. 

3. It should be possible to divide the 
behaviors described into comparable 
units. 
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4. Instigations and opportunities for 
exhibiting the acts to be studied should 
be numerous and, if possible, control- 
lable. 

5. The behaviors in question should 
be overt and maximally unequivocal. 

6. For the sake of economy, the be- 
haviors studied ought to have theo- 
retical interest beyond the study of 
variability, per se. 


She evaluates the dimensions for 
variables of dominance and submission 
systems in the light of each of these 
criteria. Gellert’s treatment of the sec- 
ond and third criteria in the foregoing 
list are particularly interesting, as is the 
fact that her category systems have 
been used to record both free play ob- 
servations and observations of children 
under experimental conditions. We 
shall return to these points after out- 
lining her category systems for domi- 
nance and submission. Her preliminary 
a priori category systems follow, being 
partially quoted: 


Domination Categories 


Mandate, positive. Spontaneous order or 
suggestion: “Come here,” “Let me take 
it,” “Put it down.” 

Mandate, negative. “Don’t,” “Stop,” 
when they are used as an order rather 
than in the context of an altercation or as 
defense against clear domination on O's 
part. 

Dominates play; instructs. Attempts to 
gain precedence in play; pushing O out 
of the way; changing what O has just 
built; grabbing toys O is using in order to 
use them; initiating rules or referring to 
real or contrived “adult established rules” 
in order to direct play; also verbal threats 
to gain control; also instructions; also giv- 
ing permission or assigning roles in play. 

Suggests orientation. This is a gentler, 
or more genteel, form of domination than 
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the preceding ones. It includes suggestions 
to which qualifications such as “please,” 
& ° “Would you like,” or “Lers” 
-nded or prefixed. 
zermandate. Domination in response 
mination, noncompliance, or frustra- 
asioned by O. Any scorable form 
omination or noncompliance on O's 
provide a stimulus to a counter- 
Also, conditional surrender: “I 
you...,” “Pll give it to you 
-”; also, counter-suggestions; also 
counteraggression, counterteasing, coun- 
terboasting, etc. 

Calls attention. “Look at... 
what I made,” “Hey. . . .” 

‘ż. Favorable comparison of self 
other child; also of self’s prowess, 
rty, or creations, 

Initiating a threatening gesture 
“in fun”; name calling; derogation 


.” “See 


made 

of O or of O's property or efforts. 
Aggression, per se. Physical aggression, 

when it seems to be spontaneous or mainly 


with intent to hurt rather than in the 
course of conflicting aims in play, etc. 

Noncompliance; self-defense. Resistance 
to O's domination, per se, not in conjunc- 
tion with attempts to counterdominate. 
Persistence in the direction of own activity 
in the face of O’s attempt to change it; 
defense of self, property, or rights. 


Submission Categories 


Comply; submit. Complying in response 
to O’s directions or domination or bid 
for attention; obeys or accepts O’s imposi- 
tions passively, desists, gives in; concedes 
actively in response to O’s domination; 
does not defend self, property, or rights. 

Agreement, verbal. Verbal concession, 
not accompanied by active compliance of 
any other sort, in response to domination. 

Asks permission, directions, or orienta- 
tion, Almost any question is included here. 
_ Imitates. Spontaneously copies what O 
is doing or saying. 

Withdraws. As a result of conflict or 
dominance of O, P leaves play area, or 
what he is playing with, without expres- 
sing objections. 
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Having devised these categories for 
the observation of children, Gellert 
considered the next logical task, that 
of testing the psychological coherence 
of each of the sets of categories, a cri- 
terion that relatively few sets of cate- 
gories have ever had to face. She did 
this in the following way: 


The best test of common motivation for 
a series of acts is to vary the conditions 
that are thought to evoke them. For 
instance, if increased domination by O 
raises the degree of P’s submission, there 
should tend to be an increase, for P, of 
unit acts in all the categories listed under 
submission (1956, Appendix). 


In the case of dominance, one of the 
main findings of her study was that as 
O's relative dominance decreases P’s 
relative dominance increases. She there- 
fore compared the scores of P’s when 
paired in a partly structured experi- 
mental situation with more dominant 
O’s with the scores of P’s when paired 
with less dominant O’s and demon- 
strated a positive direction of change 
in all the subcategories of dominance. 
A similar test was applied to submis- 
sion. Another of her findings was that 
“as O’s dominance decreases P’s sub- 
mission score will decrease.” She found 
a consistent decrease in all subcatego- 
ries of submission when P was paired 
with a less dominant O than when P 
was paired with a more dominant O. 
The second method, a weaker one, 
which she used to check on the psy- 
chological coherence of her categories, 
was to compare the correlation with 
the total score of each behavioral sys- 
tem taken as a whole. Although all the 
correlations were in the expected di- 
rection, there was a great deal of varia- 
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tion found in the extent of the rela- 
tionship. She therefore retained only 
those categories that correlated at a 
level of significance of 5 per cent or 
less with the over-all scores. This led 
her to drop calls attention and aggres- 
sion and countermandate and noncom- 
ply from the categories that made up 
her composite dominance score. Coun- 
termandate and noncomply were con- 
sidered together under a third heading, 
resistance and viewed as a separate be- 
havioral system in testing the substan- 
tive hypotheses of her study, On the 
same grounds, Gellert dropped the 
category imitate from the submission 
heading and also deleted withdraw be- 
cause it was too rare to be useful. 
Gellert made a consistent effort 
throughout her observations to dis- 
criminate between the intent and the 
effect of each act. Behavior was always 
scored from the point of view of the 
actor rather than the recipient of an 
interact; thus an attempt on the part 
of P to take away O’s toy was scored 
as dominates play, whether it was suc- 
cessful or not. The average in observer 
reliability between Gellert and her two 
assistants was .81 and .84, respectively, 
the interobserver reliability being com- 
puted by this formula: twice ie num- 
ber of agreements divided by twice 
the number of disagreements, plus 
A’s omission, plus B’s omissions. The 
omissions were unit acts recorded by 
one observer but not by the other, 
Agreements were the number of scores 
per category that were recorded b 
both observers. These reliability data 
were obtained when observers were 
watching two children in the experi- 
mental play situations. The category 
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system was also designed to record the 
behavior of only one child at a time 
in samples obtained in the free- -play 
period in the children’s nursery school. 
Since only 15 of the 50 children who 
took part in the experimental obser 
tions were also recorded in the free 
play, the comparison of findings is un- 
fortunately not very meaningful. How- 
ever, the systematic observations in 
conjunction with ratings and with the 
systematic experimental pairing of chil- 
dren for comparison purposes per- 
mitted Gellert to test several hypothe- 
ses concerned with general principles 
of interaction. 

These hypotheses have much to do 
with the implications of Fig. 20-1 in 
that they show quite clearly and ex- 
plicitly some of the w: ays in which the 
overt or scoreable behavior of P is a 
function of social-setting factors and 
of the properties of O and TP. Since 
Gellert worked within a very restricted 
cultural group (unless we view families 
or groups of families as having differ- 
ent cultures, as Roberts (1951) has sug- 
gested), she was unable to vary these 
factors systematically. 


OBSERVATION OF DEPENDENT 
BEHAVIOR 


A number of category sets have been 
developed for the observation of de- 
pendent behavior in children in either 
laboratory or everyday-life situations. 
Sometimes these categories have been 
developed for observations of diads; 
for instance, when the child is inter- 
acting with one parent or with a teacher 
or psychiatrist. Dependency categories 
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of this kind are illustrated in a subse- 
quent section. 
The lowa study of Sears et al. (1953) 


called for the observation of depend- 
ency according to five categories, 
which are analogous to those discussed 
for aggression. In fact, observation was 
done with both sets of categories as 
part of the set of the observers. The 


dependency categories were also “of 
the nature of instrumental acts designed 
by the child to place himself in such 
contact with another person as to pro- 
vide gratification of the acquired drive 
of dependency.” The five categories 
were as follows: (1) touching or hold- 
ing, in which the child is conceived as 
displaying evidence of dependency 
drive when he behaves in such a way 
as to secure touching or holding from 
another person; (2) being near; (3) 
securing positive attention (including 
praise, talking, playing such an insti- 
tutionalized dependent role as the baby, 
inviting cooperative activity, asking 
questions unrelated to ongoing activity, 
giving unasked for information); (4) 
securing reassurance, information, or 
consolation (including apologizing, 
asking permission, voluntary conform- 
ity, asking protection, asking for help 
or guidance); and (5) securing nega- 
tive attention (by disruption and ag- 
gressive activity and defiance). A to- 
tal dependence score for each child was 
secured by summing the frequencies of 
the foregoing categories for 16 fifteen- 
minute periods. The mean interjudge 
agreement on the five dependency cate- 
gories was 91 per cent, with a range 
of 76 to 100 per cent. The agreement 
on the total dependency score was 94 
per cent. 
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In this group, in which the ages 
ranged from 3 years, 4 months to 5 
years, 5 months, there was a significant 
difference in the frequency of depend- 
ency behaviors, more dependency to- 
ward teachers being shown by girls 
than by boys. A similar difference was 
found for the rated dependency meas- 
ures. The correlations between ob- 
served total dependency and the rat- 
ings of dependency were only +.44. 
The observed dependency score cor- 
related, as did the aggression scores, 
with the rated activity level of the 
child. In the case of the girls, there 
was a zero correlation between the 
total rated dependency and observed 
dependency. The problem in this case 
appears to reside in the judgment of 
the teachers with regard to negative 
attention-seeking on the part of girls. 
To quote the authors’ interpretation: 
“Tt is as if the teachers, who were 
women, saw negative attention-getting 
girls as being aggressive but not de- 
pendent, while they saw such boys as 
being both aggressive and dependent.” 
(Sears et al., 1953, p. 158.) Although 
the authors worry about the inexactness 
of the dependency observations when 
viewed as indices of the drive con- 
struct to which the theory referred, 
the results based on these measures were 
to some extent consistent with what 
the theory predicted. 

Gewirtz (1956) has developed more 
pointed categories for categorizing 
specifically the attention seeking of 
young children in a free-play experi- 
mental setting in which only one adult 
was present and the children were en- 
gaged one at a time in the activity of 
painting. Despite the fact that the cate- 
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gories were developed to some extent 
for use in this limited situation, we in- 
clude them here because they repre- 
sent one of the directions in which 
interests in child-observation study will 
go in the future—toward more careful 
specification of more restricted aspects 
of behavior systems. Gewirtz devel- 
oped six observational categories for 
scoring the attention-seeking behaviors 
of the children, three of them for the 
purpose of capturing verbal behavior 
and three for the capturing of non- 
verbal behavior. We include here brief 
definitions and examples of the cate- 
gories. 


1. Comments. Included in this cate- 
gory were casual remarks (never ur- 
gent), which usually required no re- 
sponse from the adult. “Typically the 
child made his remark and either just 
continued on to another remark or re- 
sumed his painting.” 

2. Questions. This category included 
casual questions, “never urgent, which 
required only brief replies from the 
adult.” 

3. Attention seeking. “Included were 
responses characterized by urgency, ap- 
parently designed for active notice 
from the adult. The adult’s overt re- 
sponses indicated, in a manner appro- 
priate to the child’s request, that he 
was attending to the child.” 

4. Glances. “Included were the 
child’s glances and stares at the adult, 
which involved no verbal accompani- 
ment, required no responses from the 
adult, nor initiated sequences of inter- 
action when it was apparent to the 
child that his glance had been noticed 
by the adult.” 
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5. Paintings. “When a child in- 
formed the adult that he had completed 
a painting, the adult removed it from 
the easel, attached a blank sheet of 
newsprint in its place, and wrote the 
child’s name in a corner of that sheet. 
The number of paintings on which the 
child worked during the session was 
recorded.” 

6. Time. This category refers to the 
time that elapsed from the moment the 
subject first arrived at the easel until 
he stated that he no longer wished to 
paint. 


Gewirtz reports the usual indices of 
percentage observer agreement for 
these categories, and he also reports 
computations of percentage observer 
agreement when disagreement included 
those of omission as well as of com- 
mission. A disagreement of commis- 
sion is indicated when two judges as- 
sign a particular event to different cate- 
gories, whereas a disagreement of omis- 
sion refers to the case in which one 
observer assigns a particular event to 
a given behavior category and the other 
omits that event entirely. The per- 
centage-agreement figures are not ex- 
tremely high but seem adequate for 
the purposes of Gewirtz’ study (Ge- 
wirtz, 1956, p. 21). He further reports 
the correlations of the scores obtained 
for the six dependence variables for the 
56 children in his sample. The corre- 
lations show two clusters of measures, 
the first involving reliable intercorre- 
lations among the three indices of 
verbal attention seeking and the sec- 
ond involving reliable intercorrelations 
among the three nonverbal indices. The 
only index showing reliable correlation 
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in both clusters was the number of 
paintings done by a child in a given 
sitting. It may also be useful to point 
out that category 3, attention seeking, 
proved to be most sensitive in reflect- 
ing the effects of the independent vari- 
ables employed by Gewirtz in his ex- 
perir nental design. 

se categories may not be directly 
useful in all studies of dependency, 
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since they were constructed for an ex- 
tremely limited interaction situation. 
However, it is just such categories as 


this that have reasonable reliability and 
are related in a reasonable way to ex- 
perimental variations in settings that 
may provide rich categories in future 
studies, 


OBSERVATION OF SYMPATHETIC 
BEHAVIOR 


One of the classical studies in inter- 
personal behavior, and a study of great 
yield, is the analysis of sympathetic be- 
havior in children’s play groups by 
Lois Barclay Murphy (1937). This 
study ranged from extensive explora- 
tory observation of children, mostly 
between 2 and 4 years of age, to ex- 
periments or “framed situations,” 
which increased the frequency per unit 
time of the phenomena of interest. In- 
terviews with parents, parental records, 
and ratings were also used at one point 
or another. 

The observation method used was 
that of objective diary records of re- 
sponses of P to distress situations oc- 
curring spontaneously in the play 
groups. Because of the low frequency 
of occurrence of “distress situations,” 
it was felt that it was possible to ob- 
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tain protocols of all occurrences of 
these situations and to record the “to- 
tal setting” of the distress, the behavior 
of the “stimulus children,” the response 
of the other children, and teacher sug- 
gestions or interference that might 
have a bearing on the behavior of the 
children who responded to distress sit- 
uations. In short, there was an attempt 
to record almost all of the elements out- 
lined in Fig. 20-1, which were relevant 
to “distress” in O and the subsequent 
behavior of P and TP. We do not know 
to what extent other instigating situa- 
tions, which also lead to “sympathetic” 
behavior on the part of P, were over- 
looked because of the focus on “dis- 
tress” instigations. But the long quali- 
tative exploratory procedure probably 
led to a broadening of the meaning of 
“distress situation” and possibly some 
refinement of the notion of “sympathy 
response” to provide a scientifically 
valuable denotation of an important in- 
stigation-response unit, at least for some 
purposes. 

Murphy gives us the following clas- 
sification of “the most characteristic 
stimuli for sympathetic responses 
among the preschool children.” 


Physical cause. (1) Accident; (2) at- 
tacked by child; (3) physical discomfort 
(cold, uncomfortable position, caught in 
play apparatus, etc.). 

Mental distress. (1) Toy snatched or 
threatened; (2) play hampered or in- 
truded upon; (3) disciplined; (4) separated 
from mother, father, etc.; (5) fear. 

Emotional expression without knowl- 
edge of stimulus. (1) Crying; (2) holds 
hands over face; (3) pained expression— 
inhibition of crying. 

Evidence of injury without evidence 
of present pain. Sore lip, mercurochrome, 
bandage, etc. 
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Wishes, needs. (1) Expressed wish (“I 
want... ,” “Help me,” or some action 
that reveals want or need); (2) unex- 
pressed wish or need, assumed or implied; 
(3) precarious situation (need of care). 

Adult in distress or want. Chiefly physi- 
cal danger or physical need. 

Animal in distress or want. Physical 
danger, need such as hunger, real or sup- 
posed attack by another animal or human 
being. 


Murphy’s corresponding list of re- 
sponses to the “sympathy stimuli” fol- 
low: 


Helps. (1) Helps out of physical distress 
situation—picks child up after falling; (2) 
helps with play—gives toy, plays with 
child, pushes swing, helps climb. 

Removes or attempts to remove cause 
of distress. (1) “Stop it!” “Don’t!” to 
aggressor of other child; (2) pushes away 
child attacking another (when this solves 
difficulty); (3) explains child’s attitude to 
adult causing resistance; (4) removes cause 
of physically distressing situation—re- 
moves a tricycle from fallen child, gets out 
of child’s way, uncatches child’s foot; (5) 
stops play when it is in opposition to other 
child’s need—e.g., stops jumping on board 
while child climbs off 

Comforts. (1) Pets, pats, hugs, kisses; 
(2) reassures, e.g., “I wont hurt you”; 
(3) expression of solicitude, e.g., “That’s 
too bad,” “That hurts, doesn’t it?” 

Punishes cause. (1) Attacks the aggres- 
sor of the other child before the aggressor 
has withdrawn, but not thereby remedy- 
ing the difficulty; (2) attacks aggressor 
of other child after difficulty; (3) speaks 
sternly to aggressor (after difficulty); (4) 
retaliates on aggressor (does to aggressor 
what aggressor did to first child). 

Protects, defends. (1) Defense of other 
child’s material; (2) “It belongs to... ,” 
“o... was using that,” “You can’t have 
that”; (3) helps other child hold toy 
against aggressor; (4) Moot hurm. s 
“Don’t touch . . .” (before difficulty oc- 
curs but is imminent); (5) retrieves 
snatched property for other child; (6) 
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explains other child’s wants to aggressor; 
(7) verbal defense of other child: “He can 
get in,” “That’s not Jenny—she’s crying”; 
(8) protects from hurt—catches child as 
he falls. 

Warns. (1) Warnings of doubtful 
motive—“Look out,” “Watch out,” etc.; 
(2) sympathetic warnings: “You might 
hurt yourself,” “You might fall.” 

Tells adult or other child. (1) “Look 
at..."; (2) “He’s crying”; (3) “He 
wants. . . .” 

Questions. (1) “Why is he crying?” (2) 
“Did it hurt you?” (to child); (3) “What 
are you going to do about it?” (to 
teacher); (4) “What’s the matter?” (to 
animal or to a child). 

Anxious, disorganized. (1) Stares with 
anxious expression; (2) evidence of worry, 
shakes head, frowns, lips pressed together, 
etc.; (3) cries, whimpers. 

Suggests or effects solution. Tells adult 
to keep other child out of danger; tells 
child how to get out of the situation. 


She also categorizes certain “non- 
sympathetic” or egocentric responses 
to distress situations: 


Laughs. (1) Scoffs, criticizes, (2) 
laughs, smiles; (3) contrasts own behavior 
(“I don’t cry”). 

Dramatic. Uses situation to play own 
role, usually dominating. 

Sadistic-destructive. (1) Attacks child 
in distress or his material; (2) attacks child 
who helps child in distress; (3) teases, e.g., 
makes loud noise at. 

Ambiguous. (1) Stares; (2) comments 
on situation (usually not addressed to 
anyone). 


To some analysts, of course, these 
categories cut across other ways of 
categorizing these behaviors. In case 
of controversies of this sort, only re- 
course to the kinds of criteria quoted 
from Gellert can lead to a suspension 
of argument. Besides, that which cross- 
cuts other ways of thinking, along with 
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the categories that are so crosscut, may 
continue to provide useful data. The 
main problem in the Murphy study is 
a limitation set up by the fact that 
although the main interest was in sym- 
pathetic behavior as a category of in- 
teraction, or at least as a phenomenon 
of interaction, it was basically observed 
as “merely” a stimulus-response entity 
sympathy-expressing” behavior, 
which was a response to “distress” in 
someone else. The limitations set up 
are, first, that only limited aspects of 
individual differences can be analyzed 
with this kind of unit and, second, that 
the full possibilities of interaction 
analysis cannot be explored. Murphy 
states the first limitation. 


Of course, inferences regarding the 
degree of sympathy of the children are 
difficult to draw, since we have no measure 
of the opportunities for sympathetic re- 
sponses; all we have is a rough statement 
of the proportion of the child’s nursery- 
school life which is occupied by the sorts 
of behavior we have grouped under this 
heading (1937, p. 122). 


Despite this limitation, Murphy de- 
veloped “observation scores” on sym- 
pathy by weighting the total number 
of sympathetic episodes recorded for 
each child for the hours observed. It 
is very interesting that these scores cor- 
related around +.80 with the (highly 
reliable) teacher's ratings. Furthermore, 
the odd-day, even-day reliability co- 
efficient for the observation scores was 
+.94 and +.99 for two different groups 
of children. Murphy herself is led to 
remark in a footnote (p. 163) that the 
“ratings on an inventory (of objective 
items of behavior) by teachers or ob- 
servers who have watched the children 
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over a period of months may be more 
valid than some of the procedures now 
in use of obtaining samples of behavior 
by so-called objective recording tech- 
niques.” Such a conclusion appears 
rather problematic. The high correla- 
tions between the rating scores and the 
observed (categorized) behavior may 
have been due to the fact that each was 
based on the raw frequency of the em- 
pirical occurrence of the sympathetic 
response class (itself highly correlated 
with activity level and therefore with 
“visibility”) rather than on some ratio 
of “opportunity” to be sympathetic to 
the number of sympathetic responses. 
In short, Murphy was very sensitive to 
the conditions for the instigation of 
sympathy, but “distress” was used as 
an aid in the recording of sympathy 
rather than as another parameter in it- 
self of the total situation. It is to the 
latter problem that some recent explo- 
rations have been turned. 

Murphy’s study is extremely rich in 
information and description (as well as 
wisdom) regarding the behaviors that 
were “caught” in her categories. In all 
her considerations of sympathetic epi- 
sodes in what we have termed a so- 
ciological manner (comparison of fre- 
quencies in groups, according to age, 
intelligence, etc.) there are rich sug- 
gestions if the report is read with a 
sensitivity to the possible effect of ac- 
tivity level. The study is a landmark 
in the area of interpersonal relations. 


OBSERVING PARTICULAR DIADS 
OR TRIADS 


In our highly selective outline of 
typical category systems we have so 
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far dealt mostly with systems that have 
been constructed with a particular class 
of behavior in mind. In recent years, 
however, there has been an interesting 
tendency to develop category systems 
for the study of particular diads or 
triads, in which content of the cate- 
gories is important, of course, but in 
which the outstanding characteristic is 
that they are developed for handling 
intensively the behavior of two people 
at a time. A very impressive work of 
this kind is that of Barbara Merrill 
Bishop (1951), done under the direc- 
tion of Robert Sears at the Iowa Child 
Welfare Research Station. Bishop’s aim 
was to develop a category system for 
the study of parent-child relationships 
under varying degrees of controlled 
experimental conditions. The most 
elaborate set of categories is those for 
recording mother-child interaction, 
which are probably also useful for re- 
cording a child’s interaction with any 
other adult as well, if the hypotheses 
call for the contents that Bishop is in- 
terested in. But whether one is inter- 
ested in Bishop’s contents or not, the 
categories and their apparent research 
success provide an example of our in- 
creasing sophistication in handling sub- 
tle aspects of behavior in an objective 
manner. 

Bishop states the aims of her cate- 
gories: 

The categories had originally been 
selected partly on theoretical and partly 
on empirical grounds. From a theoretical 
standpoint it was desired to have cate- 
gories which would reflect accurately 
the different amounts and kinds of in- 
fluence the adult’s behavior had on the 
child, i.e., the stimulus properties of her 
behavior. From an empirical standpoint 
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it was necessary to select categories that 
were clearly definable, easily recorded in 
the intricacies of social interaction, and 
sufficiently comprehensive to express all 
possible behavior incidents that would 
appear during the play session (p. 8). 


Despite her general worries about 
rating methods, she has included rat- 
ing scales in several of the behavior 
categories in order to capture the sub- 
tler concomitants of the objective be- 
havior. We shall not include all of her 
rating distinctions, but in the following 
brief description of her behavior cate- 
gories we mention the kinds of dis- 
tinctions she made. 

Bishop’s categories for describing the 
mother’s behavior begin with three 
that describe the behavior of the child 
as well, then go on to eleven additional 
major categories reserved for only the 
mother’s behavior. The categories are 
(1) out of contact, where the mother 
is sitting apart from C and is either 
doing something on her own or silently 
watching; (2) contact, where M is in 
contact with C either verbally or 
physically or both; (3) playing inter- 
actively, where the mother plays as 
though she were another child; (4) 
teaching, where M gives information 
to C for the purpose of increasing his 
knowledge; (5) helping, where M 
gives physical help to C; (6) struc- 
turizing, where M facilitates activity 
on the part of C in a manner that 
stimulates independent thinking in C 
and leaves the responsibility of deci- 
sion to C (a rating is used here to dis- 
tinguish between a suggestion for 
which C has the option of acceptance 
and the giving of incomplete cues that 
call for some individual thinking on 
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ut of C); (7) directing, where 
nother states specifically the course 
she wants C to follow (ratings are used 
here to distinguish five points along a 


fad 
oe 


continuum from M anticipating non- 
compliance through matter of fact di- 
recting to angry demand); (8) inter- 


ference, where M interferes with C’s 
activity with intent of stopping it (rat- 
ings are also used here to distinguish 
similar points from those under direct- 
ing); (9) restriction, where M modifies 
C’s behavior but does not stop it; (10) 
interfering by structurizing, where M 
points out undesirable consequences of 
a course of action, but the final deci- 
sion is left to C; (11) criticism, where 

ticizes plans or punishes C (here 
‘e used to distinguish intense 
nild statement from actual physi- 
cal punishment); (12) praise or affec- 
tion, where M praises or gives encour- 
agement to C, including patting or 
hugging C (ratings of two kinds are 
used here, the first to distinguish prais- 
ing C’s activity as compared to pro- 
viding reassurance and the second to 
distinguish points on a dimension, from 
mother complying to C’s demand, 
through submission to enthusiastic re- 
sponse to C’s stimulation in a spirit of 
having fun with C); (13) noncoopera- 
tion, where M ignores or refuses to 
comply or accept stimulation from C 
(ratings are used again to distinguish 
five points on a continuum, from sim- 
ply ignoring through noncooperation 
to angry nonacceptance). 

We list here only the major category 
totals for the 15 categories that are 
used specifically for recording the 
child’s behavior: (1) seeking informa- 
tion, (2) seeking help, (3) suggestions, 
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(4) seeking attention, (5) seeking con- 
tact, (6) directing, (7) interference, 
(8) restricting, (9) criticism, (10) 
seeking praise, (11) praise, (12) asking 
permission, (13) indications of anxiety 
(where C expresses concern over con- 
sequences of his activity), (14) coop- 
eration, and (15) noncooperation. Rat- 
ing scales or further subcategories were 
also used for all but six of these cate- 
gories. 

Bishop’s method of recording was to 
use one block square for each five sec- 
onds of thirty-minute play sessions. The 
observer was behind a one-way screen, 
and five-second intervals were signaled 
by an automatic light. The observer re- 
corded the status of the interaction (or 
at least of the first stimulus-response 
that occurred) in the five-second in- 
terval. The time interval was short 
enough so that there were very few 
instances in which one or two symbols 
did not completely describe the ob- 
served behavior as far as the categories 
were concerned. The observer reliabil- 
ity was correlated by dividing the num- 
ber of agreements between two ob- 
servers by 360, which is the number of 
five-second intervals in thirty minutes. 
The total reliabilities averaged .87, and 
Bishop has reported the reliability for 
each category as well (1951, p. 12). 
The reliabilities for the ratings aver- 
aged .88. On the whole, the experi- 
mental setup and the categories pro- 
vided a situational framework to the 
mothers, which evoked and recorded 
behavior in a half-hour session that was 
rather highly consistent under repeti- 
tion in another session on a different 
day. The average correlation for the 
34 mothers (computed by major cate- 
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gories) was .71. A similar statement 
can be made with regard to the repe- 
tition of the child’s behavior in two 
sessions, in terms both of frequency 
and reliability. 

Using this procedure, Bishop was 
able to investigate hypotheses regard- 
ing correlations existing between 
mother and child behavior, individual 
differences among mothers and chil- 
dren, and indications of stimulus gen- 
eralization on the part of the child 
when he was tested in a similar way 
in the company of a neutral adult fe- 
male who was a stranger to him. This 
work is of the kind that leads to new 
developments in a field, and we shall 
report subsequently on two further 
category-system developments, which 
have grown out of Bishop's excellent 
start. We have mentioned that this ap- 
proach characterizes one way in which 
interpersonal interaction can be quan- 
tified, while at the same time retaining 
by the use of ratings some of the quali- 
tative dimensions of action. The scor- 
ing technique provides a kind of con- 
tinuous record; it displays a careful 
sensitivity to the problem of instiga- 
tion and presages later attempts at the 
development of an interact unit that 
retains characteristics of the successive 
flow of interaction. 

Henrietta Smith (1958) has recently 
used some of Bishop’s categories for a 
study of the dependency behavior of 
children in interaction with their 
mothers, which is interesting on two 
counts: it is interesting as a method- 
ological study, and it included a dif- 
ferent manner of scoring that brings 
up the possibility of studying sequen- 
tial aspects of mother-child interaction. 
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This last aspect, although not fully uti- 
lized in her analyses, may be one of the 
new important developments in scoring 
interpersonal behavior. In order to 
achieve a parsimonious and precategor- 
ized record of the sequence of theoreti- 
cally relevant action, Smith dropped 
the five-second time record of Bishop 
and substituted for it a scoring sheet 
containing a column for each category. 
The behavior of both mother and child 
was recorded, the units being set down 
in rows in the order of occurrence. 
This recording procedure makes it pos- 
sible to indicate number and nature, as 
well as sequence, of the scored elements 
for each person. Smith recorded the 
interaction of thirty minutes with the 
child (18 girls and 12 boys). The chil- 
dren’s ages ranged from 3 to 4.1 years. 
The mothers were observed for thirty 
minutes from behind a one-way screen 
and then for an additional fifteen min- 
utes during which the mother was re- 
quested to fill out a questionnaire sheet 
while still in the presence of her child. 
This permitted an experimental meas- 
ure of her behavior towards the child’s 
dependency solicitations while she was 
busy. 

The main content interest of Smith’s 
study was to compare the usefulness 
of the observation procedure with rat- 
ings obtained from open-ended inter- 
views with the mothers on comparable 
category topics. She reports interob- 
server reliability varying from .75 to 
1.00, depending upon the category, 
with the over-all average over all cate- 
gories used in five sessions of .94. She 
also reports significant correlations in 
a number of the mother-behavior cate- 
gories from the first fifteen minutes to 
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the second fifteen minutes of the ob- 
served sessions. The scores on the third 
fifreen-minute session (when the moth- 
ers were busy) were studied to deter- 
mine which mothers changed the most. 
Only six mothers changed their scores 
more than one standard deviation from 
the mean in 4o per cent or more of 
the categories. The responses of these 
mothers on the open-ended interview 
were studied, and it was discovered 
that these six mothers were consistently 
within one sigma of the mean on ten 
interview questions, five of which were 
judged to involve some anxiety and 
five, which were considered to be neu- 
tral, selected at random. Smith’s analy- 
sis along these lines led her to suggest 
that these six women who changed so 
much in their behavior toward the chil- 
dren when they were experimentally 
caused to be busy are “defensive moth- 
ers,” who had entered the observations 
with a strong set to behave in a cer- 
tain way while under observation but 
who were forced to reflect some of 
their more usual behavior when their 
set was experimentally altered. Smith 
concludes that such “defensive moth- 
ers” can possibly be more easily dis- 
criminated by the change in their be- 
havior under observation conditions 
than by the answers to interview ques- 
tions, since they were all close to the 
mean in the interview answers, though 
discrepant from the mean in behavior 
when they were busy. On other 
grounds, however, the interview was 
found to be equally useful in reflecting 
early findings with regard to the imme- 
diate instigating conditions to depend- 
ency behavior as well as to some of 
the home-condition antecedents. 
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Smith used 15 categories drawn from 
Bishop’s categories described above, 
plus one category, which was used to 
record positive social interaction of a 
verbal sort, that was not concerned 
with dependency behavior. She used 
four categories for children’s behavior 
and 11 categories for the behavior of 
the mother. The mothers’ behaviors, 
which were considered theoretically 
relevant, were those that served as 
either stimulus or response to the 
child’s dependency behavior or that 
were related to forms of control of 
the child’s behavior. We shall not list 
Smith’s categories here, since the major 
points of interest in this study are (1) 
that portions of someone else’s cate- 
gory systems can be used effectively 
for more limited theoretic purposes; 
(2) that by giving up the small time- 
sample feature of Bishop’s approach 
sequential aspects of interaction can be 
recorded, including the possibility of 
drawing from the record higher-level 
units, such as interacts or interact se- 
quences in the sense of these terms dis- 
cussed in the first part of this chapter, 
(3) the study provides a nice example 
of how observation records can be in- 
tegrated with a study that has ante- 
cedent-consequent as well as experi- 
mental features. 

The most ambitious attempt at the 
development of objective scoring cate- 
gories to grow out of Bishop’s work is 
reported in a paper by Moustakas, 
Sigel, and Schalock (1956). The aim 
again is to develop categories for re- 
cording adult-child interaction. Bish- 
op’s categories were used as a point of 
departure but were then refined, the 
definitions clarified, and numerous cate- 
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gories added “to make our schedule 
more inclusive, detailed and widely ap- 
plicable” (p. 110). A scale for record- 
ing anxiety and hostility was added. 
Since much thought, experience, and 
hard work have gone into the develop- 
ment of this set of categories and since 
the reliability seems to be quite ade- 
quate, anyone planning observation 
work on child-adult diads should seri- 
ously consider using this system to pro- 
vide continuity and comparability in 
research. 

Such a decision, however, must be 
made with the knowledge that consid- 
erable time will have to be spent in 
training observers, since the system in- 
volves a total of 89 adult and 82 child 
categories as well as anxiety-hostility 
ratings. Furthermore, the categories are 
of various kinds, calling for attention 
to all of the classes of information that 
we have discussed in outlining different 
ways of defining individual acts. For 
example, several categories involve an 
attempt to elicit a particular kind of 
response or behavior by the adult or 
the child. These “stimulus categories” 
are (1) seeking information, (2) seek- 
ing help, (3) seeking reassurance, (4) 
seeking recognition, (5) seeking praise, 
(6) seeking affection, (7) seeking re- 
ward, and (8) seeking permission. This 
requires attentiveness on the part of the 
observer to information regarding the 
observable intention or the “usual in- 
tention” of the actors. Other catego- 
ries, such as those involving “orienting 
and directing,” appear to be judged 
more in terms of the visible effect or 
usual affect of overt actions, The at- 
tempt to “objectify” in defining the 
categories appears to lead to a focus on 
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verbal social behavior, and although 
nonverbal matters are probably a help 
in categorizing they are not specifically 
included in the definitions. The authors 
suggest that meaningful behavior of the 
kind that involves tone gestures, pos- 
tural signals, and facial expressions 
might be the subject of further con- 
struction of categories. 

The actual recording procedure used 
by Moustakas et al. is modeled after 
that of Bishop. The basic unit is a 
five-second time interval, which is re- 
ported by putting symbols on a printed 
square on the recording sheet. Symbols 
referring to the various categories of 
behavior are placed in different posi- 
tions in the square or above the square, 
according to set rules. For example, 
the first act by either person in a five- 
second interval is placed in the upper 
left of the box. It is not quite clear 
whether or not this act is always an 
“instigating” one. This seems in places 
to be implied by the writers, but since 
they are not focused conceptually upon 
what we have called an interact, in- 
volving an instigation-reaction unit, this 
question might be further clarified in fu- 
ture work for at least some theoretical 
purposes. The location in the square of 
behavior subsequent to this initial entry 
depends upon whether the adult or the 
child provides the initial behavior. If 
the adult provides the initial behavior, 
the child’s behavior is recorded in the 
lower right corner of the square. If the 
child provides the initial behavior, the 
adult’s behavior is always recorded 
above the square. Only the adult’s be- 
havior can be recorded in the space 
above the square, and only the child’s 
behavior can be recorded in the lower 
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right corner. Such conventions provide 
the eye with easy access to informa- 
tion that facilitates plucking out five- 
second intervals in which the child or 
the p is the initiator, but it sharply 


limits the suwmber of persons whose 
behavior can be recorded. Such a scor- 
ing procedure is obviously not set up 
for solving the problems of recording 
interpersonal interacts in a playground 
situatior 


such as that which interested 
Squares do not have enough 
z dealing with three or more 
people, and the complexity of the 
placement code could quickly become 
a deterrent to interobserver reliability. 

The procedure here permits the re- 
cording of more than one sequence 
within a five-second interval; for in- 
stance, when the mother is engaged in 
a common activity with the child and 
also chats about something else. It has 
not been elaborated, of course, to cap- 
ture the double or triple functions or 
the nuances involved in a particular act. 

When an interaction takes more than 
five seconds to complete, the same sym- 
bols will appear in the next five-second 
square. Although the present writer has 
not actively worked with the Bishop 
scoring procedure, nor that of Mous- 
takas et al., he feels that there might 
occasionally be confusion in interact 
analysis in deciding whether a particu- 
lar series of squares, which may retain 
the same scored form for three five- 
Second intervals, is in fact the same 
Series of acts, That is, there seems to 
be some need for symbols that would 
differentiate when one interact of the 
same form leaves off and another iden- 
tical-looking interact begins (as called 
for in Gellert’s unit rules). Such con- 


909 
siderations would arise only when 
someone was interested in using this 
kind of record for analyzing the prop- 
erties of interacts (such as their extent 
in time) or the antecedent situational 
or cultural conditions for a particular 
class of interacts (i.e., instigation-reac- 
tion units or instigation-reaction-effect 
act units). What is needed here per- 
haps is further movement in the direc- 
tion of placing within Bishop’s form 
of analysis the possibilities for the de- 
scription of the sequence of events in 
an interaction situation that we found 
in Smith’s approach. Moustakas et al. 
have done something in this direction 
for the case of diad observation. A 
truly general useful set of categories 
may call for assuring the virtues of 
Smith’s approach along with the virtues 
of Bishop’s elements or time units but 
at the same time, solving problems that 
will permit observations of larger 
groups or at least of individuals studied 
by the man-to-man strategy acting 
within larger groups, such as class- 
rooms, playgrounds, or family settings. 

Moustakas et al. present data regard- 
ing the reliability of the adult and the 
child categories, and except for cases 
in which a category has not yet been 
observed to occur these reliabilities gen- 
erally work quite adequately (Mous- 
takas et al., 1956, pp. 129-30). They 
have reported data on therapist-child 
interaction and mother-child interac- 
tion in a playroom as well as mother- 
child interaction in the home. Although 
we would feel better about these cate- 
gories if they had been shown to pro- 
vide the basis for constructing cate- 
gories to measure entities in systematic 
theory, there is little doubt that such 
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attempts as this will soon be available 
because these categories obviously re- 
flect much experience and wisdom in 
the observation of interpersonal be- 
havior. 

Strodtbeck (1958) has developed an 
interesting use of the Bales scoring 
categories (Bales, 1950) in connection 
with the problem of quantifying the 
power relationships in a family. In this 
case, the experimenter and his assistant 
visited the homes of a sample of school 
children from Jewish and Italian ethnic 
backgrounds. A questionnaire was ad- 
ministered to the mother, father, and 
son, the answer sheets were collected, 
and the assistant compiled a set of items 
that were then discussed by the three 
members of the family. He selected 
items for family discussion with an eye 
to creating three coalitions of the fol- 
lowing types: (1) mother and son 
agree, father disagrees; (2) father and 
mother agree, son disagrees; and (3) 
father and son agree, mother disagrees. 
The family members then moved into 
position around a tape recorder and 
were asked to try to agree on one po- 
sition on the issue that would best rep- 
resent the family’s opinion, if this were 
possible. The family members were re- 
minded of their individual answers on 
the previous questionnaire. The experi- 
menter and the assistant then retired 
to an adjacent room and worked at the 
controls for the recording equipment 
until the family had come to some de- 
cision. For most families, it was pos- 
sible to go through this procedure nine 
times, providing three replications of 
each coalition type. All protocols were 
scored directly from the recordings 
and were not transcribed; the catego- 
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ties developed by Bales were used for 
interaction recording. 

These categories are discussed in an 
illuminating way by Heyns and Lip- 
pitt (1954). Despite the name Bales has 
given the method, the basic unit re 
corded is a communicative act of the 
kind that we have called a single act, 
and it is scored on the basis of its 
meaning to the observer “in the role of 
the generalized other.” The scoring 
also calls for the observer to judge the 
function of the act with reference to 
the group problem. These requirements, 
plus the fact that the scoring system is 
limited almost completely to verbal be- 
havior in a discussion situation, places 
limits on its usefulness for child-devel- 
opment study. The category system 
has some elegant properties, however, 
and observers who are well trained can 
achieve considerable reliability. The 
number of categories (12) is extremely 
parsimonious compared to the large 
number involved in the foregoing sys- 
tem. 

Strodtbeck’s method of obtaining a 
quantified index of family power was 
to assign arbitrary scores to the final 
family decision, so that winning or 
holding one’s position when in the mi- 
nority is weighted more heavily when 
one is an isolate than when one is a 
member of the larger coalition. An- 
other way to devise a power score is 
to analyze the total number of acts 
initiated by each member of the family 
during the discussion. According to 
Strodtbeck, “Previous research leads 
one to expect that persons who talk 
most should have most power in the 
sense of winning the most decisions” 
(p- 165). In addition to this kind of 
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index, the use of the Bales categories 
enables one to arrive at an index of 
“supportiveness,” in which “we utilize 
jointly the information concerning the 
originator and target of each act, as 
well as the category in which it is 
placed, to form an index which reflects 
the tendency of a particular actor, 
number one, to give positive responses 
to the attempts at problem solution by 
another actor, number two” (1958, p. 
167). This index is constructed some- 
what indirectly (since the basic unit is 
not that of the interact in our sense) 
by subtracting the sum of the “negative 


acts” from person one to person two 
from the sum of the positive acts and 
the total is divided by the number of 
acts from person two to person one in 
the problem-solving categories of 


Bales’s system, We shall not attempt 
to summarize the findings in Strodt- 
beck’s study that are related to these 
indices except to say that the indices 
act in such a way that they may be 
very valuable additions to our tools in 
the study of interpersonal behavior. 
They might be even more valuable, of 
course, if the technique were used to 
retain sequence to provide direct clas- 
sification of interact units for construc- 
tion of indices. 


OBSERVATION OF “INTERACTS” 
IN EVERYDAY SETTINGS 


The category systems described 
above were selected because they il- 
lustrate a range of problems and pro- 
cedures in the technical problems of 
observation method. Observations have 
been made in free-play period (the 
Iowa study, Gellert’s work, Murphy’s 
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study), in homes (Strodtbeck, Mous- 
takas et al.), in a range of everyday 
life situations (Faigin), under labora- 
tory conditions (Bishop, Smith, Ge- 
wirtz). We have also shown how cate- 
gory systems vary from those that 
focus upon very specific behaviors of 
the simple act or movement kind 
(Levin et al., Gewirtz) to general pur- 
pose categories that may serve several 
theories and may also serve to measure 
more than one kind of unit (Moustakas 
et al., Bishop). 

It is to this point that we wish to 
turn for a moment: there seems to be 
an interesting sequence of development 
in our observation techniques over the 
last twenty years or so, and the devel- 
opment seems to center particularly 
around attempts (1) to solve the prob- 
lem of the control of external instiga- 
tions; (2) to record explicitly such in- 
stigations; (3) to search for efficiency 
by the use of precoded categories; and 
(4) to acquire an increasing sophisti- 
cation with regard to issues of sam- 
pling. The first two tendencies are in- 
terwoven with the challenge of learn- 
ing how to record the interact as a 
unit in an efficient manner. Murphy 
used the relatively inefficient diary 
method to capture “interaction,” but 
the notion was not achieved opera- 
tionally because instigations were 
merely recorded as (largely) a help in 
defining the single-act class of “sym- 
pathy behavior.” The Sears group 
(Iowa studies) were very sensitive to 
instigation control but worked mainly 
on controlling the social structural (and 
cultural) sources of external instiga- 
tion. They did not control the inter- 
personal sources, as in the case in which 
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“P acted aggressively in retaliation to 
O’s instigation,” nor did they vary it 
systematically, The laboratory experi- 
menters who also observe interpersonal 
behavior have, of course, achieved 
much more control or variation of in- 
stigation sources but usually within the 
limits of the “rules of standardized test- 
ing,” which, as we have suggested, have 
their limits of usefulness. The first real 
approach to a solution occurs, as we 
have seen, in Bishop’s work, but here 
the unit act used was really a unit time 
measure, and changes would have to 
be made in order to achieve a clear and 
interpretable interact record. We sug- 
gested some of these changes in our 
discussion of the intriguing categories 
of Moustakas et al. Smith’s work sug- 
gested the possibilities of an efficient 
sequential record, although she has not 
yet displayed what could be done with 
such added power. Finally, in Mous- 
takas et al. we see the near realization 
of an interact sequential record for the 
diad. 

But it is only for the diad that these 
advances have been partly achieved. It 
is interesting that the final category 
system, which we shall discuss—that 
for the Cornell-Harvard-Yale Sociali- 
zation Study—attempts to utilize and 
broaden these gains by trying to cap- 
ture interacts (in the sense of instiga- 
tion-reaction-effect act units) as they 
occur in a sample of a child’s life in a 
number of important “settings” and 
for a number of different cultures 
(PSU’s). In order to do this, it was 
necessary to go back to a somewhat 
improved version of the old diary 
method (“total record” set). 

The total project (see Whiting et al., 
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1955) was organized around a concep- 
tualization of nine behavior systems or 
sets of behavior dispositions, cach of 
which was defined as integrated around 
a particular motive. These behavior 
classes are aggression, sociability, nur- 
turance, succorance, independence, 
achievement, obedience, responsibility, 
and self-reliance. The study design in- 
cluded other data obtained in a num- 
ber of ways, but major emphasis was 
placed on systematic behavior sampling 
of 12 five-minute periods spread over 
several weeks for 24 carefully sampled 
children (12 boys, 12 girls; 12 younger, 
12 older) in each of five (later six) so- 
cieties. 

The field workers spent consider- 
able time before leaving the country in 
becoming trained in observation meth- 
ods and worked over the definitions of 
the behavior systems by use of multi- 
ple examples, much discussion, and 
practice in observation with children 
in the United States. Tentative defini- 
tions of the major instigators were 
worked out that were consistent with 
theory and with the contents of the 
interview used with the mother. The 
instigating situations of particular im- 
portance were O hurts P, O insults P, 
O hurts self, O has difficulty, O asks 
for help, P hurts self, P has difficulty, 
O tries to dominate P, P breaks rule, 
O tries friendly approach, O repri- 
mands P. 

Arriving in the foreign culture, the 
field personnel worked with bilingual 
assistants for several months before at- 
tempting observations and watched for 
examples of the response classes and of 
the instigating classes. The classifica- 
tion problem was in a sense an iterative 
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one. They were “set” to record all in- 
stances of the response classes and to 
capture the sequence and context of 


events in order that the instigation 
might be reconstructed in case it was 


not of the theoretically expected class. 
A child might display aggression, for 
example, not because he had been 


“picked on” or frustrated but as a re- 


sponse to a friendly advance from a 
younger child. Sometimes when O is 
hurt, P hits him (because O had been 


careless), ete. The records of verbatim 


verbal and nonverbal behavior included 
a “cast” of all people who were pres- 
ent and a sketch of the physical setting. 
Interpretations and inferences were in- 
cluded but placed in parentheses. 

The record was typed and sent to a 
central office for coding, and the cod- 


ing was done by people who had been 
intensively trained and who read the 
field notes for the culture ‘and other 
relevant material, The coder then 
“mapped” out the interaction sequen- 
tially, using a column for each actor 
and a row for each act. This permitted 
the coder to abstract out those inter- 
acts that were of any theoretical rele- 
vance. The coder was in search of any 
act on the part of P or O or of any 
TP that was theoretically relevant and, 
in the case of the last two, also had 
Something to do with P. In other 
words, if any instigation, any response 
of P, or any effect of any response of 
P, or any combination thereof, was of 
theoretical interest, it was retained for 
purposes of analysis. 

Tt was necessary, of course, in cases 
in which, for example, only the re- 
sponse of P was relevant, to provide 
categories for the (otherwise) irrele- 
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vant instigation and effect acts. Thus 
the category system grew toward a 
rather broad and possibly fairly “gen- 
erally useful” system. It was elaborated 
particularly in the direction of forms 
of control and of discipline. 

The information regarding the insti- 
gation (including the category “no ex- 
ternal instigation”), the reaction of P 
(including the possibility of “no act of 
P” or “P ignores”), and the effect act 
(if any) were coded for placement on 
an IBM card. Such a card—an “‘inter- 
act” card—is the first approximation to 
this level of analysis, exploratory and 
costly though it may be, and the use 
of these cards may provide a stimulus 
to our theoretical and practical imagi- 
nation regarding new possibilities in the 
study of interpersonal behavior. 

The information mentioned above is 
merely the nucleus of “fact” around 
which the interact card is organized. 
The card is literally filled with addi- 
tional information—eighty columns of 
it. Some of this information is, in fact, 
the basis for new kinds of exploration. 
It includes, for example, the place in 
the sequence of interacts of the par- 
ticular interact on “this card”—its num- 
ber, so to speak, in the five-minute ob- 
servation time. 

It was also early recognized that a 
particular act (on the part of P or O 
or of TP) might deserve more than one 
classification. Therefore, the IBM in- 
teract card includes such considerations 
in the form of adverbs, which, so to 
speak, “modify” the main classification 
of an act. Suppose P initiates a request 
for help in an angry and demanding 
manner. It is, then, a “‘succorant re- 
sponse,” but it is also done “aggres- 
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sively” or “dominantingly” or both of 
these (there is provision on the card 
for “triple adverbs”). These adverbs 
were quite important, for as long as the 
first adverb on at least one basic act in 
the interact was theoretically relevant 
then the total interact was kept for 
analysis. 

Let us review briefly the other kinds 
of information included on the card. 
There is information about P him- 
self (age, sex, etc.); the setting of the 
action is there (play, work, authority 
present or not, at home, etc.); the size 
of the group P is in is there; the identity 
of the instigator (at least in relation to P 
or to the society—but he is replaceable 
in any other sense); the act of O is 
coded; the purposive nature of the in- 
stigating act is separately coded (was 
the act accidental, instrumental, imi- 
tative, retaliatory, etc.); the adverbs for 
the instigating act are coded; the in- 
tensity of the act is coded. All of this 
information is present for each of the 
basic elements of the interact, that is, 
not only the instigating act but the 
response of P and the effect act that 
flows from P’s act. Repetitions of the 
same interact within the five-minute 
period were weeded out in order to 
keep independent the interacts retained. 

The purpose of all this is to provide, 
in effect, the possibility of analyzing 
the impact on the behavior of P of 
aspects of all of the major elements in- 
volved in our diagram in Fig. 20-1: cul- 
ture, setting, identity of O (or TP), im- 
mediate social structure, instigating act 
of O (if any), and (for some analyses) 
the effect act. The IBM card will even- 
tually contain background information 
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on P, providing at least one independent 
check on some of the internal instiga- 
tions that might be acting in P to help 
determine his behavior. Given such a 
card, all of the kinds of analyses that 
were discussed in the first two sections 
could, in principle, be done and a num- 
ber of different kinds of hypotheses in- 
vestigated. All, that is, except one kind 
of analysis, as we shall see. Further, 
these “exploratory” cards have been 
obtained at a great price in energy and 
money because of the costly recording 
and coding procedures that had to be 
used at this stage in the technical history 
of observation methods. Thirty thou- 
sand IBM cards emerged from this 
analysis of the data from six societies 
or PSU’s (one each from the United 
States, India, Mexico, Okinawa, the 
Philippine Republic, and Africa). And 
much must be done before we will 
know what these particular cards can 
tell us. 


Summary and Discussion 


In this chapter we have attempted to 
create a “sense of problem” regarding 
the strategies and techniques involved 
in the direct study of interpersonal be- 
havior. Having provided a scheme 
(Fig. 20-1) for analyzing an interaction 
situation, we outlined a number of 
kinds of unit that might be abstracted 
from such a situation for study, ranging 
from a single act of one actor through 
an interact encompassing aspects of the 
energy displacements of two or more 
actors, to interpersonal activities and 
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relationships. The focus of the re- 
mainder of the chapter was on the first 
two of these. 

A number of aspects of strategy in 
this research realm were discussed, as 
were some of the practical problems. 
This led to a discussion of some of the 
di snt ways in which single acts can 
be defined in terms of the source of 
“information” used for classification 
a to a discussion of the further 
quantification of unit acts and of in- 


S. 
A discussion of a selected group of 

zory systems followed. An attempt 
was made to outline a historical de- 
pment, so that new inventions in 
nique, which had been tried first 
with inefficient methods, could then 
be consolidated into efficient techniques 
that usually involve precoded category 
systems. Historical development toward 
operational recognition of momentary 
instigations as a parameter or variable in 
its own right was emphasized. Recent 
developments appear to achieve an ap- 
proximation to this end but at a cost 
in efficiency and with problems still 
present regarding the independence of 
informational sources (Whiting et al., 
1955; Moustakas et al., 1956) and the 
fact that one technique (Moustakas et 
al.), though efficiently precoded, is too 
limited in the number of people who 
can be recorded. The other, though 
broader in its grasp, still leans on the 
more costly method of post-recording 
coding. 

As we move toward efficiency in 
these regards, it should be pointed out 
that there will be an increasing need for 
better statistical models to help solve 


tec. 


915 
problems. It will be necessary to solve 
problems inherent in multiple con- 
tingency (see, for example, Sutcliffe, 
1957) and to investigate the implica- 
tions of the concepts of experiment- 
wide error (see Ryan, 1959) when 
applied to studies with a number of 
different dependent variables. Progress 
must be made in the direction of 
comprehending the implication of 
Kish’s (1958) analysis of the problem 
of estimating confidence limits when 
simple random sampling is given up in 
favor of more complex designs. 

At the risk of being oracular, let us 
end on a note regarding future de- 
velopments. Once we get to know more 
about the possibilities involved in the 
analysis using interacts, we may then 
wish to analyze these units (or those 
more complex) in their own right. This 
will lead to the solution of how to in- 
clude the immediate instigations (work- 
ing on two or more people) that pre- 
dispose a diad or triad toward a 
particular class of interacts. This may 
be the next big step beyond the compass 
of present methods. 

The step prior to that, however, will 
probably be to render interact record- 
ing, itself, efficient by the use of pre- 
coded categories, while at the same 
time retaining the additional informa- 
tion needed for making inferences to 
the personality of P, the settings of P’s 
action, the modal personality for a 
group, and the social or cultural con- 
tributions to “external instigation.” 

Let us hope that these developments 
can be achieved in conjunction with 
some agreement on one, or a series, of 
generally useful categories, so that the 
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values of replication and of cumulation 
in research can be enhanced. 
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Tue contributions of anthropology to 
the study of child rearing may be di- 
vided into three categories—concep- 
tual, methodological, and substantive. 
On the conceptual side, the field of an- 
thropology has contributed the concept 
and analysis of culture, On the meth- 
todological side, anthropologists have 
developed the ethnographic interview, 
participant observation, and the gene- 
alogical methods. On the substantive 
side, the major contribution of anthro- 
pology has been to report a range of 
variation in many child-rearing prac- 
tices that goes far beyond any found 
within Western European society and 
thus provides a framework for defining 
cultural variables that are not bound to 


a single society. It is our purpose to 
discuss each of these classes of contri- 
butions in order, 


CONCEPTS 


Anthropologists, like other social 
scientists, have been concerned with 
human behavior and, like other social 
scientists, have reported norms and 
variations in the behavior of people be- 
longing to various societies the world 
over. They have even at times been 
concerned with the behavior of indi- 
vidual members of these groups, as evi 
denced by the numerous biographies 
and life histories they have produced 
Their methods of observing, recording, 
and assessing human behavior are not 
strikingly different from those of other 
behavioral scientists, such as sociolo 
gists and social psychologists. In fact, 
they generally have not been nearly 
as systematic in sampling and counting 
or describing the norms and variations 
of behavior. Where they have, how- 
ever, made a unique conceptual contri- 
bution is in the formulation of the 
shared symbolic determinants of be- 
havior, which, in our view, is the es- 
sential feature of the concept of culture. 

Although, as we explain below, this 
concept is often used to include overt 
behavior, and even the products of be- 
havior such as artifacts, we feel that its 
essential contribution lies in providing 
a method for coding and classifying the 
shared ideas of the members of a so- 
ciety or group, rather than in describ- 
ing norms of behavior, and it is this 
view of culture that we would like to 
present here. 

The concept of culture as it is used 
today is generally attributed to the 
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British anthropologist, E. B. Tyler in 
his classic work Primitive Culture 
(1871). His much cited definition pro- 
vided the original impetus for a way of 
looking at social life that has persisted 
among anthropologists for the last 
seventy-five years. Many books and 
articles have been written to suggest 
modifications and elaborations of his 
formulation, the latest and most ex- 
haustive being Culture, A Critical Re- 
view of Concepts and Definitions, by 
Kroeber and Kluckhohn (1952). It is 
not appropriate in this chapter to re- 
view the various points of view and 
nuances that have been given to the 
concept but rather to describe the fea- 
tures of the concept of culture to be 
used. These features are technology, 
ethnoscience, and ethics. Each of these 
symbolic or cognitive systems is pre- 
sumed to influence the customary be- 
havior of the members of any society. 
he elements of these systems consist 
of cultural units, shared habits, or cus- 
toms. Each of these units, however, has 
different attributes, and we have chosen 
to label them, respectively, techniques, 
beliefs, and values. 

A technique is a recipe for action. 
Techniques are found in cook books, 
books on infant care, and manuals on 
how to operate a car. If you wish to 
bake a cake, “take three eggs, a cup of 
flour, etc.” “To wean a child, first, 
withdraw the two a.m. bottle, feed 
orange juice from a spoon, remove the 
bottle at the noon feeding, and feed 
milk from a cup, ete.” A technique does 
not imply moral compunction, There 
is no implication that anyone should 
bake a cake or ought to drive a car— 
simply that this is the way people in 
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this society do drive cars and bake 
cakes. The manual on child care, how- 
ever, is likely to contain not only tech- 
niques but also beliefs and values. A 
technique should also be distinguished 
from a skill. The fact that a given in- 
dividual knows how to bake cakes or 
drive a car is of a different order. It is 
a statement on the behavioral rather 
than on the cognitive level. A technique 
is something that is or can be stated in 
the language of the society in which 
it exists. Some techniques are covert; 
that is, the members of the society are 
not constantly aware of them. How- 
ever, a deviation from a technique 
would be recognized immediately, and 
the technique could be formulated and 
stated by a member of this society if 
it were brought to his attention. The 
recognition and definition of techniques 
and their transmission to the next gen- 
eration is, of course, one of the most 
important aspects of the process of 
socialization. 

Belief, the second major type of cus- 
tom, may be defined as a statement of 
the relationship between events. For 
instance, greenheaded flies bite, the east 
wind brings rain, sinners will suffer in 
hell, and 6-year-old children will get 
sick with less than ten hours of sleep 
a night are examples of beliefs. Whether 
or not they are true is irrelevant. Every 
culture has a set of beliefs about the 
social and the spiritual environment that 
may be defined as the ethnoscientific 
and religious dogma of the culture. 
This set of beliefs is the second major 
aspect of the culture that must be trans- 
mitted to the children of the new gen- 
eration. 

Values comprise the third basic type 
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of custom. A value, as it is used in this 
chapter, may be defined as a statement 
that attributes goodness or badness to 
any event. Thus murder is bad; honesty 
is good, people should be successful, a 
man ought to obey the law, and chil- 
dren should love their parents are all 
statements of values. All of the values 
held by the members of a given society 
may be said to comprise the ethical sys- 
tem of that culture. The transmission 
of values to children is generally held 
to be one of the most important duties 
of a parent and the one most difficult 
to accomplish, 

Each of these three types of custom 
is internally organized in a somewhat 
different manner. Techniques are or- 
ganized so that the responses they 
specify usually achieve the desired re- 
sult in the most efficient manner and 
are thus subject to pragmatic test. A 
technique that does not work is often 
abandoned, but many magical rituals, 
such as rainmaking or water dowsing, 
have surprising persistence in the face 
of failure.* 

There are two primary principles 
that govern the organization of a be- 
lief system—that of coherence and of 
correspondence. In other words, be- 
liefs in any given system ordinarily do 
not contradict one another, nor are 
they in general stated so that they are 
clearly false in terms of empirical 
reality. They may, however, be stated 
in such a way that it is difficult to sub- 


1 The resistance of culture to change, par- 
ticularly the persistence of belief systems and 
value systems, is presumed and not discussed 
in detail here, largely because the precise 
reasons for such persistence are not well 
understood. 
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ject them to empirical test. This is par- 
ticularly true with respect to beliefs 
about the spirit world. Statements 
about the supernatural are, by defini- 
tion, such that they are not directly 
subject to empirical test. In all societies 
ghosts and spirits are incorporeal. Per- 
haps because such beliefs cannot be 
tested, they tend to be all the more 
logically consistent with one another, 
and thus magical belief systems gener- 
ally are beautifully coherent. 

Values systems are still further re- 
moved from empirical fact. Although 
coherence characterizes a values sys- 
tem, correspondence with empirical 
reality does not. Whether an event is 
good or bad cannot be tested in the 
same way that a belief is. In other 
words, values cannot be false. They can 
only be accepted or rejected, usually 
on the basis of their consistency or in- 
consistency, Again, in contrast to be- 
liefs, values are generally given priority 
with respect to one another.’ Although 
one may be skeptical with respect to 
some beliefs and hold them in doubt, 
there is a great number of beliefs about 
which the members of any society are 
certain. Values, however, even those 
most strongly held, are continuously 
being opposed to one another and re- 
viewed with respect to the strength 
with which they are held. For example, 
in our society, it is held to be wrong 
to lie and to be bad to hurt another’s 
feelings. What does one do in a situa- 
tion in which one must either lie or 
hurt somebody’s feelings? 

The pragmatic test is thus applied to 

*Beliefs do have, however, a priority of a 


somewhat different order, that is, the cer- 
tainty with which they are held. 
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techniques, the test of coherence to 
both techniques and values, the empiri- 
cal test to beliefs, and, finally, values 
are ordered with regard to the strength 
in which they are held. Thus the cus- 
toms that comprise a culture are not 
a simple list of items but systems of 
techniques, beliefs, and values, each in- 
tegrated in accordance with respect to 
its own principles. 

Not only are each of the foregoing 
systems internally integrated, but each 
type of custom is systematically related 
to the others and each performs a dif- 
ferent function in guiding action. If one 
wishes to sail a boat, the acceptance in 
the first place of a large number of be- 
liefs is required. First, that the boat will 
not sink or capsize; that the sail, if held 
at a certain angle will propel the boat; 
that the wind blows laterally to the 
earth rather than up and down; that 
the rudder will guide the boat; etc. 
Certain techniques are also accepted. 
“If you wish to turn to starboard, push 
the tiller to port.” “In sailing into the 
wind, maximum speed is obtained when 
the sail is sheeted in just so that it 
doesn’t luff.” Comparably, in the field 
of child training one may believe that 
physical punishment administered at 
the moment of transgression teaches a 
child to perform desired acts or, con- 
trariwise, that such punishment teaches 
only the avoidance of transgression but 
not the desired habit; or, again, it is 
useful only as a teaching technique 
when the child is more than 2 years 
old. Values are also involved. In the 
first instance, the act of sailing itself is 
generally considered to be good by 
those who sail, although, of course, 
others may feel that it is an inefficient 
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means of getting somewhere. In the 
child-training example the acts desired 
or not desired involve a decision re- 
lating to value judgments. Further- 
more, one may have values as to 
whether physical punishment is good 
or bad for the child in general. These 
values may outweigh the belief that 
physical punishment i is an effective way 
of teaching. 

Quite clearly, the techniques, beliefs, 
and values described here are very 
closely related and are often in practice 
very difficult to distinguish one from 
another. In fact, all three are often im- 
plied, if not explicitly stated, in any 
given sentence, It is, we think, useful 
to distinguish them not only because of 
the different integrating principles but 
because they form in combination a 
blueprint for action that has been called 
the custom complex (Whiting and 
Child, 1953). 


THE CUSTOM COMPLEX 


For the visually minded, the organi- 
zation of the customs that comprise a 
custom complex, together with the way 
in which they are related to one 
another and to behavior, is presented 
in the diagram on p. 922. 

To elucidate the diagram, let us take 
the example of the influence of a 
custom complex upon the responses of 
an individual. Let us suppose there is 
a 4-year-old American middle-class 
boy who is playing with a toy car in 
the living room at 6 o’clock in the 
afternoon and that his mother enters 
the room and says, “Put away your 
toys BO and wash your hands for 
supper.” This situation calls forth a 
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number of values for the child: it is 
fun to play; it is bad to wash your 
hands; it is good to obey. Let us sup- 
pose that obedience is the strongest 
value and is therefore selected. The 
next problem for the child is to for- 
mulate a plan whereby he can obey 
and at the same time maximize his 
value that hand washing is a nuisance. 
He then calls upon his belief system— 
that if he goes into the bathroom and 
runs the water and makes splashing 
sounds his mother will think that he 
has obeyed her, This belief selects his 
technique, and he goes into action. 
The effect of this response, however, 
may be either that his mother is taken 
in by his behavior, which confirms his 
splashing technique, his belief that he 
can fool his mother, and his value 
compromise, or that she will say to him, 
“Look at your hands. You didn’t really 
wash them.” This puts a strain on his 


cognitive system. Something went 
wrong. He may say to himself, “I guess 
I didn’t splash loud enough—my tech- 
nique was bad.” Or he may say to 
himself, “I guess I can’t fool my mother 
that way,” and modify his belief 
system. He may say, “J guess it’s best 
to do just what mother says after all,” 
and obedience moves up in his value 
system and not washing hands, rel- 
atively speaking, moves down. 

Now, suppose the mother asks him, 
“Why did you wash your hands that 
way?” If the boy replies, “I thought 
I could fool you,” this would be, in 
our scheme, a reference to his belief 
system and rationalization of his action. 
If, however, he says, “But, I don’t like 
to wash my hands,” this would be a 
reference to his value system and a 
justification in our scheme, 

It may seem from this example that 
we are talking just about the cognition 
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of an individual child, and in fact we 
are, Culture as we see it operates in 
just this way. It is only the extent to 
which the members of a society share 
their systems of values, beliefs, and 
techniques and the extent to which 
these are customary rather than idio- 
syneratic matters that carries them 
beyond the realm of individual psy- 
chology. 

Culture thus provides a cognitive 
map or blueprint, which governs the 
action of the members of every society, 
and the transmission of this blueprint 
is the rather imposing task faced by any 
parent if he wishes to bring up his 
child to be an adult who can operate 
effectively in his society. 

The custom complex, however, is 
but a beginning. It describes the blue- 
print for but a single action of a single 
category of persons in a single situa- 
tion. Custom complexes are, however, 
organized into roles that are played 
in institutional settings. Most readers 
will be familiar with the concept of 
role, and it need not be elaborated 
here. We would simply define it as all 
the customs that a given society expects 
an individual to accept and perform in 
a given status, which is in turn a part 
of a given institution. We would adopt, 
with slight modification, Malinowski’s 
(1944) definition of an institution as 
being a group of people occupying 
different statuses (personnel), who are 
expected to perform certain roles de- 
fined in terms of techniques (norms and 
rules), agreeing upon certain values, 
and accepting certain beliefs (charter). 

Roles and institutions thus bring a 
new principle of integration into cul- 
ture, They refer to individuals working 
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together toward a common end and 
assume agreement among the individ- 
uals as to certain beliefs, values, and 
techniques. In other words, these con- 
cepts refer to the effect of society in 
integrating the customs that comprise 
a culture. In sum, a culture consists of 
a set of customs, which may be divided 
into techniques, beliefs, and values that 
are in turn integrated into the systems 
of ethnoscience, ethics, and pragmatics. 
Customs are combined into roles that 
are combined into institutions. We feel 
that anthropologists, in isolating the 
concept of culture and by studying its 
manifestations in societies the world 
over, have brought a perspective that 
has enabled them to make a contribu- 
tion to a more precise understanding 
of that which is transmitted to a child 
in any society by those responsible for 
child rearing, F urthermore, the concept 
of culture focuses attention not only on 
the overt behavior of parents in train- 
ing their children but on the cognitive 
map that influences the content of what 
is transmitted, the techniques the 
parents employ, and their behavior as 
role models. 

This view of culture as essentially 
symbolic or cognitive will not be 
agreed upon by many nor perhaps by 
most anthropologists. They say quite 
legitimately that anthropology is con- 
cerned with behavior as well as ideas, 
and this is, of course, true. It is fruit- 
less to quibble over labels, and if one 
wishes to use the term culture to refer 
both to the symbolic and the be- 
havioral, this is also legitimate. This 
analysis would suggest only that the 
sy mbolic and behavioral aspects of cul- 
ture be distinguished. We would sug- 
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gest the term customary behavior to 
refer to action that has been selected in 
the manner indicated in our chart— 
that is, by reference to the cultural 
symbolic systems or cognitive maps. 
Both, together with the relationship 
between them, may be taken as the 
ethnographic domain. Idiosyncratic, or 
noncustomary behavior, could then 
refer to action in which this process of 
selection has not taken place or is not 
used as an explanatory principle. In 
the examples to be given, both cultural 
data, e.g., beliefs, values, and tech- 
niques, and customary behavior are re- 
ported. 


CULTURE 
AND CUSTOMARY BEHAVIOR: 
ILLUSTRATIONS 


Consider, for example, the beliefs 
concerning the nature of a child and 
how they may influence the teaching 
techniques employed by parents. 
Rajput mothers in a northern Indian 
village believe that the fate of a child 
is written on his brow at birth—that 
not only his physique but his tempera- 
ment and behavior are predetermined 
(Triandis and Hitchcock, ms.). Mothers 
in a small New England town, on the 
other hand, report the belief that an in- 
fant is born “a bundle of potentialities” 
and that his personality and behavior 
are the result of the molding of these 
potentialities by parents, teachers, 
and peers (J. and A. Fischer, ms.). 
Mothers should seek to discover the 
latent potentialities and should be at- 
tuned to every indication the child 
gives, should look to the school psy- 
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chologists and his battery of tests, and 
to any other available information so as 
to know how to help the child to 
realize these potentialities. It is not 
surprising that the child-training tech- 
niques employed by the Indian mabihiers 
differ from those of the New England 
parents nor that the interest in and 
anxiety about child rearing is 
greater among the New England 
mothers. The Indian mothers showed 
little interest in the questions asked by 
the ethnographer and found it hard 
to believe that anyone would travel so 
far to do research about such a subject. 
They were laconic in their answers and 
had few rationalizations or justifications 
for their actions as compared to their 
New England counterparts, who were 
eager to discuss the subject and anxious 
for new insights and advice. 

In the field of learning mothers also 
held different beliefs. Indian mothers 
believe that their children learn from 
observation and that direct tuition or 
verbal communication is not necessary. 
New England mothers put great faith 
in the spoken word and in demonstrat- 
ing with constant verbal statements and 
explanations. Related to these belief 
systems is the greater use of reasoning 
and lecturing: by the New England 
mothers and the more frequent use of 
physical punishment and threats by the 
Indian mothers. 

Okinawan mothers make a sharp 
distinction in the nature of children 
according to age. They consider any 
child under five as “senseless” and in- 
capable of learning right from wrong. 
They excuse many of the transgressions 
of such “senseless ones.” Once the child 
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nt off to school, however, and be- 
s a first grader, he is expected to 
learn to behave and is constantly told 
that he now is capable of “knowing 
I ‘r°’ (T. and H. Maretzki, ms.). 
This belief in the changing nature 
of a child with maturation is shared 
by parents in many societies. It is 
paralleled in our society by those who 
believe that the superego is not de- 
until around 6, when the 
oedipal complex is resolved. 

It is sometimes the case that seem- 
ingly identical behavior on the part 
of parents is rationalized by different 
beliefs and justified by different values. 
In a study of child-training practices 
among three culturally divergent com- 
munities in the Southwest of the United 
States it was found that most nursing 
infants in the Texan and Mormon com- 
munities slept in the parental bed 
(Whiting et al., ms.). The Mormon 
mothers stated that they kept the in- 
fants in bed to keep them warm. The 
rationalizations and justifications of the 
Texan informants were more varied; 
some stated that they were afraid that 
the child would roll out of bed if he 
were alone; others that it was easier to 
nurse during the night. Crowded living 
conditions seemed, however, to account 
for the Texan sleeping patterns, since 
all families living in houses with three 
rooms or less took their infants into 
the parental bed, whereas three of the 
four families which had four rooms or 
more did not do so. Furthermore, 
several Texan informants reported dis- 
liking the lack of privacy. It appears 
that the Texan ideal would be for the 
infant to sleep alone. Mormon mothers 


is 


velc ped 


925 
evidently have the value that it is good 
to sleep with the infant and do so even 
though they have adequate space for 
the baby to have his own crib. 

A similar contrast can be made be- 
tween physical punishment as admin- 
istered by the Rajput Indian mothers 
and the Mixtecan mothers of Juxtla- 
huaca, Mexico (Romney, ms.). In dis- 
cussing such punishment, the Rajput 
mother is apt to justify striking the 
child because of her own anger at 
misbehavior. The Mixtecan mother, on 
the other hand, more often justifies the 
punishment in terms of teaching the 
child how to behave. Furthermore, the 
Mixtecans believe that the emotion 
of anger causes illness. The Indian 
mothers, although claiming to disap- 
prove of the expression of anger, are 
constantly fighting among themselves 
and yelling at their children. The dif- 
ferences in beliefs, and the consequent 
difference in expressiveness, undoubt- 
edly explain to some degree the dif- 
ferences in the aggressive behavior of 
both Mixtecan and Rajput children and 
adults and suggest that isolated be- 
havior out of the context of the total 
custom complex may be less predictive 
of behavior. 


METHODOLOGY 


Since description of culture has been 
one of the major interests of anthro- 
pologists, they have developed methods 
for obtaining data that are somewhat 
different from those employed by 
other social scientists. Whereas the 
sociologist and social psychologist are 
concerned with describing the fre- 


926 


quencies with which certain types of 
customary behavior are manifested in 
a given society and presenting them in 
terms of their distribution, the anthro- 
pologists, as indicated above, are con- 
cerned more with determining the 
typical and agreed-upon beliefs, values, 
and techniques. Since their realm has 
been the ethnographic description of 
people little known to the western 
world, they have been less concerned 
with variations among the individual 
members of the societies that they have 
studied. To say that middle-class 
American children are usually brought 
up by their own parents is not a par- 
ticularly interesting or dramatic state- 
ment. Social scientists who have studied 
our society take such a fact for granted 
and have concerned themselves with 
the effect of such matters as “broken 
homes.” To say, however, that an 
Ojibwa child is brought up by his 
grandmother from the time he is 
weaned and may live in a different 
village from his own parents’ from that 
time forth is of interest, even though, 
and this is reported to be the fact, this 
custom is not practiced by all or even 
a majority of Ojibwa families (Barn- 
ouw, 1950). That it is a custom, and 
that any Ojibwa family who practiced 
it would not be looked upon as deviant, 
is the sort of statement that is generally 
made by anthropologists and can be 
said to be descriptive of Ojibwa cul- 
ture, 

To obtain such data as this, two tech- 
niques have been developed that repre- 
sent the contributions of anthropology 
to the methodology of social science. 
These are the ethnographic interview 
and participant observation. 
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The ethnographic interview (Paul, 
1953; Maccoby and Maccoby, 1954) 
differs from other interviews in a num- 
ber of respects, perhaps the most strik- 
ing of which is the status relationship 
between ethnographer and informant. 
A doctor interviews his patients, a 
social worker interviews clients, a 
sociologist or social psychologist in- 
terviews respondents. In each of these 
cases the interviewer has a status vis-à 
vis the person interviewed that implies 
greater expert knowledge and more 
control of resources. The ethnographer 
is also acknowledged as an expert by 
his informant, but not with respect to 
the subject under discussion. He de- 
scribes himself as knowing about the 
culture of other peoples but as being 
ignorant of the culture being studied. 
He is careful to acknowledge the 
superior expert knowledge of his in- 
formant with respect to the subject 
under discussion. The status relation- 
ship is thus more nearly one of equality 
than that used in other social science 
interviews.1 An ethnographer, for ex- 
ample, having some knowledge of how 
canoes are built the world over, may 
go to the most expert canoe-builder of 
the tribe he is studying and tell him, 
“I have come to you because of your 
expert knowledge and wish you to tell 
me how to make a canoe.” This is quite 
different from a doctor asking his 
patient to relate his symptoms or a 
pollster asking a respondent how he 
voted in the last election. 

To ask an informant for cultural data 
requires that the questions be put some- 
what differently from an interviewer 


* This idea was suggested to us by Professor 
Cora DuBois in personal communication. 
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a respondent about his, that is, 
pondent’s, attitudes or opinions. 


the 


When the question, “What do you 
think about such and such?” is asked of 
an informant, the “you” is meant to re- 
fer to the members of his society or 
group. When the same question is 
asked of a respondent, the “you” refers 
to the respondent himself. An in- 
formant replies to a question from an 
ethnographer, “We bring up our chil- 


dren this way,” whereas a respondent 
ss, “I bring up my children this 
way. 

The syntax of the role relationship 
between ethnographer and informant 
requires that the ethnographer be a 
member of a different society from that 
of his informant. For example, if an 
American middle-class interviewer 
asked an American mother how Amer- 
ican middle-class mothers brought up 
their children, the informant would 
look at the interviewer in amazement 
and say, “You know as well as I dof 
the same question had been asked her 
by a Chinese anthropologist, or even 
by an interested Britisher, the question 
would make sense, and she would 
reply accordingly. It is quite under- 
standable then that the ethnographic in- 
terview was developed by anthro- 
pologists working in unknown cultures 
rather than by social scientists working 
within the framework of their own 
society. 

In addition to having the status of 
expert to expert, to asking questions 
phrased in terms of the generalized 
“you,” and being an outsider, the 
ethnographer often defines his status 
outside the society in which he is 
interviewing. It is understood that a 
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doctor, social worker, or pollster seeks 
information, and these statuses are well 
known to the members of our society. 
Outside the western world, however, 
the professional seeker of information 
is not a familiar status. This fact is 
exemplified by an anthropologist who 
overheard two Australian aborigines 
talking about him as they sat around 
a campfire believing him to be already 
asleep. Said one to the other in Pidgin 
English: 


What name this fellow man? 

Me no savvy, I think he belong govern- 
ment? 

No got—he no belong government. ii 
think he belong mission. 

No got—he no belong mission. Him 
no talk all the same man belong mission. 
You think he belong business? 

No, me savvy! Him, he man belong 
nothing, that’s all. 

True too much, him, he man belong 
nothing. 


Not all anthropologists define them- 
selves as “belonging nothing,” but 
many find this a very useful status, 
which, like the stranger at The Cock- 
tail Party, elicits information that 
would not be given even to intimate 
friends. It is well known that many 
anthropologists are “adopted into the 
tribe” and given kinship status within 
it. This, however, is generally rec- 
ognized by both sides to be a pseudo- 
matter to be talked and joked about 
rather than to be taken as a serious 
set of mutual obligations. An anony- 
mous relationship with an informant, 
such as is described above, is difficult, 
if not impossible, to establish if one is 
working in one’s own society; and, 
again, it is understandable that this 
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should be uniquely characteristic of the 
ethnographic interview. 

One final word should be said about 
the ethnographic interview. Although 
this characteristic is not unique, in this 
kind of interviewing an ethnographer 
generally works with an informant over 
a long period of time. He may often 
work with one informant daily for 
weeks or return to him frequently for 
long sessions over a period of years. It 
is not unusual for an anthropologist to 
discuss a topic with an informant for 
an afternoon, to spend the evening 
typing up and checking over his notes, 
and to return the next day to a discus- 
sion of the same topic with the same 
informant, filling in gaps and straight- 
ening out ambiguities that appeared to 
him during the course of working up 
his notes. As a corollary to this pro- 
cedure, the ethnographic interview is 
not standardized in the same manner 
that respondent interviews of sociolo- 
gists and social psychologists are. Such 
standardization as there is takes the 
form of detailed topical outlines, which 
have been developed over the years, 
As ethnographic data have accumu- 
lated, the universal problems for which 
every culture has some solution have 
become apparent and a list of these 
topics or problems form a guide to the 
ethnographer as he works with an in- 
formant, Notes and Queries (1951), 
first published in 1888, by the Royal 
Anthropological Institute of Anthro- 
pology of Great Britain, is one of 
the most widely used of such topical 
guides and is now in its sixth edition. 
The Outline of Cultural Materials 
(Murdock, 1950) is another useful 
general guide. In addition to this, there 
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are many field guides or outlines for 
special topics. The Field Manual for 
the Cross-Cultural Study of Socializa 
tion (Whiting et al, 1953), a pre- 
liminary mimeographed draft published 
by the Social Science Research Council, 
is an example of a specialized outline 
relevant to the topic of this chapter. 
Participant observation is the second 
major method that is generally credited 
to the field of anthropology. Florence 
Kluckhohn (1940) discusses this 
method in detail in an article well 
worth reading. It is difficult to define 
this method precisely, and perhaps the 
best general description of it is tha 
the ethnographer participates in the 
daily life of the people in such a way 
that he learns the roles he is permitted 
to play by actually performing them 
and learns their reciprocals by actual 
interaction with members of the 
society. To use participant observation 
successfully as a technique, the ethnog- 
rapher must be very sensitive to the 
sanctions that are applied to him. If 
he makes an error, or behaves im- 
properly, he must also be sure that his 
status in the society is such that the 
sanctions will be applied to him. An 
experience that one of the authors had 
while working with Kwoma in New 
Guinea is an example (Whiting, 1941). 
These people wore no clothing, and 
the ethnographer found himself gazing 
at some of the fair maidens of the tribe. 
After about two months, he noticed 
that there seemed to be some rather 
rigid rules of etiquette requiring that 
men, when visiting a house, should 
sit on the edge of the porch facing 
outward; the women sat around the 
fire behind the men’s backs. Further- 
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more, he noticed that if a man and 
woman met on the path she would 
turn aside and stand with her back to 
the path until the man had passed. He 


would then stop, and they would carry 
on any conversation facing away from 
one another, The ethnographer then 


cheeked these observations with one of 
his informants and verified them. He 
asked why he had not been told about 
his impropriety; whereupon his in- 
formant replied, “Well, after all, you’re 
a white man and you don’t know any 
better.” 

The status of an ethnographer as a 
participant observer is similar to that 
of a child. He is putting himself in a 
position to be socialized: that is, to be 
instructed with respect to proper be- 
havior, to practice whatever roles are 
appropriate for him, to err, to be 
successful, to be rewarded and pun- 
ished, so that eventually he may live 
in the society as a full-fledged member. 
This final aim has seldom been ac- 
complished by professional anthropolo- 
gists, but, during the course of living 
in the field for eighteen months to 
two years, many have learned large 
segments of the culture of a society in 
this way. Insofar as he is successful, an 
ethnologist becomes not only a re- 
porter on the society but an informant. 
An anthropologist in the field does not 
rely solely upon participant observa- 
tion or informant interviewing but 
uses them both in conjunction to con- 
struct the “invisible pattern” that 
orders the behavior of the members of 
a society. Malinowski (1935) in an ex- 
cellent discussion of ethnographic field 
mga describes this process as fol- 
ows: 
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The observer should not function as a 
mere automaton; a sort of combined 
camera and phonographic or shorthand 
recorder of native statements. While 
making his observations the field-worker 
must constantly construct: he must place 
isolated data in relation to one another 
and study the manner in which they in- 
tegrate. To put it paradoxically one could 
say that “facts” do not exist in sociological 
any more than in physical reality; that is, 
they do not dwell in the spatial and 
temporal continuum open to the untutored 
eye. The principles of social organisation, 
of legal constitution, of economics and 
religion have to be constructed by the ob- 
server out of a multitude of manifestations 
of varying significance and relevance., It is 
these invisible realities, only to be dis- 
covered by inductive computation, by 
selection and construction, which are sci- 
entifically important in the study of 
culture (p. 317). 


This method of combining partici- 
pant observation and informant inter- 
viewing gives the ethnographic inter- 
view a structure all its own. The topics 
to be discussed are often suggested by 
recent events rather than by some pre- 
determined plan. It has been argued 
that the lack of standardization of the 
ethnographic interview brings into 
question its scientific usefulness. Since 
the ethnographic interview is designed 
to get at materials at the cultural level 
rather than at the level of individual 
differences, the problem of stand- 
ardization is somewhat different. It is 
our opinion that internal consistency in 
material gathered from one or two in- 
formants over a period of months and 
then checked with observations of be- 
havior will add to the validity on a 
cultural level of material gathered by 
a carefully designed questionnaire 
given to a large number of subjects in 
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a single interview. Such criticisms, 
however, should not be taken lightly, 
for if the study of children is to become 
legitimately scientific the question of 
validity of reporting is clearly of prime 
importance. In the early days of anthro- 
pology, in particular, and unfortunately 
even today, we find such statements as 
the “Mbongo-Mbongo never punish 
their children.” All too frequently such 
statements have been refuted by later 
work and were, in all probability, based 
upon the fact that the parents seemed 
kind to their children and the reporter 
could not remember having observed 
any parent spanking her child. It is un- 
fortunate for the field of child develop- 
ment that it is only recently that child 
rearing has become accepted as a legiti- 
mate and necessary part of the descrip- 
tion of a culture; hence standardized 
methods are only now being developed 
for obtaining data on this aspect of life. 
With respect to the gathering of cul- 
tural materials on other aspects of life, 
anthropologists have developed cer- 
tain techniques of checking the validity 
of their data. Similar techniques may 
be borrowed and utilized in the study 
of child rearing. 

Kinship systems have for a long time 
been the most carefully described as- 
pects of the culture and of the cus- 
tomary behavior of primitive peoples. 
This has been in a large part attri- 
butable to the development of the so- 
called genealogical method, which was 
first described by Rivers (1914). The 
method consists of asking an informant 
to name and place on a genealogical 
tree all of his kinsmen. After he has 
named and identified them by sex, age, 
and blood or affinal relationship, then 
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and then only does the ethnographe 
ask his informant what kin term he 
uses to refer to and/or to address each 
of them and what behavior is expected 
or most frequently occurs in their in 
teraction, When an informant’s family 
tree has been established, that is, when 
all his blood relatives and affines hav: 
been named, then the essential features 
of social structure can be explored. 


To which of these people must you 
show respect? 

Which of them must show respect to 
you? ; 

Which do you help when they are in 
trouble? 

Which of them can you call on for help 
when you are in trouble? 

Which of them do you have the right 
to discipline? 

Who can discipline you? 

Which of these people must you avoid? 

Which of these people may you marry? 

Which are you sociable with? 


It can be seen that all questions rel- 
evant to the socialization process, inso- 
far as they affect kinsmen, can be 
concretely described by the use of this 
method—that i is, the relatives who are 
the caretakers, the disciplinarians, the 
educators, and the playmates. This 
method can be easily extended to cover 
nonkinsmen, important in the socializa- 
tion process. One would ask an in- 
formant to give the names of neighbors, 
colleagues, friends, teachers, and then 
proceed to ask him the same questions 
concerning his interaction with them as 
those described for kin relationships. 

It can be seen that this is essentially 
a reversal of the sociometric method 
as it is generally used. Instead of asking 
a child to name the best friend in his 
class, to name those whom he likes in 
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his class, or who like him in the class, 
the investigator using the genealogical 
method would ask his child informant 
first to name each member of his class 
and then proceed to question him con- 
cerning his relation to each person 
named. For example, if Joe is one of 
the persons named, he would ask these 
questions: 


Do you play with him? 
Do you fight with him? 
Do you boss him around? 


Do you avoid him? 


It can be seen from this description 
that the method is appropriate to arrive 
at both the symbolic attributes of a 
social system, which we have called 
culture, and to the typical interactions 
of individuals who occupy reciprocal 
kin relationships. In other words, this 
method has been used to arrive at both 
the cultural and behavioral aspects of 
the ethnographic domain. 

Since ethnographers have found it 
useful, when describing kinship and 
social structure, to start with naming 
names and collecting many instances of 
concrete relationships to arrive at the 
general features of the system rather 
than attempting to ask an informant 
about the general features directly, we 
feel that this method might well be 
most profitably employed in a study of 
the socialization process. 

The procedure of arriving at general 
concepts by induction from concrete 
instances has been used by anthro- 
pologists to describe other features of 
the ethnographic domain than that of 
social structure. Although extensive 
work has not been done in this area, the 
method can well be applied to de- 
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termining the meaning of emotional 
concepts. Instead of asking an in- 
formant to define aggression, let us 
say, one might proceed in the following 
manner: ask an informant to give a list 
of native terms referring to emotions 
such as anger, hurt feelings, fear, shame, 
and happiness and then ask him to give 
concrete instances in which each of the 
native terms could be applied ap- 
propriately to the feeling that the in- 
formant had in these instances. In this 
way, an operational definition of the 
emotional categories recognized and 
distinguished in any culture can be de- 
rived. It may well be that the distinc- 
tions that seem self-evident to us are 
not the same in other societies. 
Although anthropologists have gen- 
erally not been self-conscious with 
respect to the problem of sampling, 
they have been quite consistent in their 
procedures. In the first place, they have 
chosen a group for intensive study, 
which consists of a relatively small 
number of people, generally less than 
50 or 60 families living in a hamlet, 
band, or barrio. A complete census is 
generally taken of this group and a 
complete genealogical chart of their 
kin relationships is drawn up. If the 
group is sedentary, a map and village 
plan is made to show the range of 
fields, buildings, paths, and public 
places. Since it is ethnographic protocol 
to stay in the field for at least a year, 
the field worker generally knows every 
member of the group by name and 
has talked with most of them. Although 
he may have used the majority of adults 
as informants on one or another aspect 
of the culture, he generally works in- 
tensively with but a few of them. Some 
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anthropologists have obtained a great 
majority of their information from a 
single informant, but more generally 
an anthropologist works extensively 
with five or six informants, using them 
to check one another. Although anthro- 
pologists often report their findings to 
be representative of a larger society 
than the village, band, or barrio they 
have intensively studied, it is our opin- 
ion that their reports should more prop- 
erly be referred to the smaller group, 
and with respect to such a group the 
informal methods of sampling used are 
generally quite adequate (Whiting et 
al., 1953). 

Living as an anthropologist does with 
a small group for a year or more 
provides a validity check that is often 
not recognized. If two members of the 
society are interacting in the presence 
of on-lookers and this audience pays 
little or no attention to the interaction, 
it can be taken as being within the 
bounds of custom. From the cultural 
point of view, there are as many af- 
firmations of the customary nature of 
the event as there are people present. 
Any anthropologist who has been in 
the field can report the strong and 
immediate audience reaction to events 
that are outside the permitted range of 
custom (Whiting, 1941). 

As we suggested in our discussion of 
the complex nature of culture, an 
ethnography must be more than simply 
a record of events. These events must 
be classified and their structural and 
functional relationships explored and 
presented. It is our view that the most 
useful ethnographic reports contain 
both concrete observations and inter- 
pretations. A good example of this is 
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Margaret Mead’s reports on the 
Arapesh, which range from her cir- 
cumstantial “Diary of Events in Alitoa,” 
published in the American Museum 
monograph series (1947), to her in- 
terpretation of the Arapesh in Sew 
and Temperament in Three Primitiv 
Societies (1935). Anyone writing an 
ethnography, whether he wishes to b 
mainly circumstantial or largely inter- 
pretive, should in any case be sure 
to make this clear. Nothing is more 
annoying, when abstracting ethno- 
graphic data, than to read a para- 
graph that might contain either the 
carefully checked opinions of a number 
of native informants or the ethnog- 
rapher’s unchecked opinion arrived at 
after he had left the field. We do not 
wish to imply by this that such post 
hoc interpretations are invalid or 
useless. On the contrary, we feel it is 
the responsibility of any good cultural 
reporter to make such interpretations. 
The wisdom of the historian who dis- 
tinguishes history from historeography 
and interpretation from documentation 
may well be borrowed here. 

Before concluding our discussion of 
method, mention should be made of 
the phenomenon of culture shock. No 
matter how objective an anthropologist 
feels himself to be as a consequence of 
his graduate training or previous field 
experience and no matter how much he 
knows in advance about the culture he 
is studying, there comes a time, gen- 
erally in the first few weeks, when some 
event or events produce a violent emo- 
tional reaction in him. This reaction is 
difficult to describe and probably 
differs from one person to another, 
but it is ordinarily a combination of 
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dis apprehension, or anger and a 
sense of guilt for having such feelings. 

Oby iously, if one is to be a successful 
ficld worker, this sense of shock can- 
not persist, and anthropologists have 
adopted different methods of defending 
themselves against it. The relation be- 
tween a field worker’s strongly held 
personal values and those represented 
in the culture he is studying is a matter 
of great methodological importance, 
unfortunately, has never been 
vatically studied. Informal and 
matic reports by ethnographers 
that many different types of 
dete ise may be employed. Rationaliza- 
tion, repression, avoidance of disturb- 
ing parts of the culture, and projection 
have probably all been employed. A 
systematic attempt to keep complete 
personal diaries, especially during the 
first months of field work, and a care- 
ful study of a number of such docu- 
ments would be invaluable. 

Not only does the type of defense 
vary, but ‘also. the aspect of the cul- 
ture that provides the shock. One field 
worker may be disturbed by dirt and 
disease, another by violence, still an- 
other by a feeling of being excluded, 
and yet another by lack of privacy. 
That these defenses and shock areas 
influence the perception and the inter- 
pretation of certain custom complexes 
is not surprising. 


which, 
syste 
un 


VARIATIONS 
IN CHILD-REARING PRACTICES 


The many studies of child life and 
child rearing that have been made by 
anthropologists in societies all over the 
world represent perhaps the most valu- 
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able contribution of anthropology. 
These studies describe a range of be- 
liefs, values, and behavior not found 
in our society. The comparison of 
these accounts suggests variables that 
are not obvious to students who confine 
their work to Western European 
societies. 

Although there were some early 
ethnographic accounts that gave a 
creditable description of child life, for 
example, Junod (1927), the first mono- 
graph to focus on child life was Mar- 
garet Mead’s study of preadolescents in 
Samoa (1927). She, and those anthro- 
pologists who became interested in the 
field of culture and personality, pro- 
duced some of the best early descrip- 
tions. In the last twenty years there has 
been a steady increase in the number 
of accounts of child life. As in the other 
branches of anthropology, each good 
monograph increases the range of 
topics and areas covered by subsequent 
field workers. 

If children are studied within the 
confines of a single culture, many 
events are taken as natural, obvious, 
or a part of human nature and are 
therefore not reported and not con- 
sidered as variables. It is only when it is 
discovered that other peoples do not 
follow these practices that have been 
attributed to human nature that they 
are adopted as legitimate variables, In 
addition, even when individual varia- 
tion among parents within western 
society suggests the presence of an im- 
portant variable, the range of variation 
is often very small in contrast with its 
range in the societies of the world at 
large. 

An example of such a variable is the 
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age at which a child is weaned. With 
our clocks and calendars, it seems ob- 
vious that this is a meaningful transcul- 
tural variable, but it turns out not to 
be so simple as it seems. A cross-cultural 
study of weaning now in progress at 
the Laboratory of Human Develop- 
ment, entitled The Meaning of Wean- 
ing, indicates that western culture is 
relatively unusual in determining the 
age for weaning by the calendar. If the 
age of the child is a factor, this is 
measured in other areas of the world 
by maturational events rather than by 
months. Of 106 societies in the sample, 
30 of them used a maturational event as 
a basis for weaning. Five of these 30 
weaned when the child began to cut 
his teeth, and the other 25 were dis- 
tributed among such maturational 
events as crawling, toddling, walking, 
or talking. Neither the age nor the 
maturity of the child, however, pro- 
vides the occasion for weaning in most 
societies. It is determined, rather, by 
social events, such as the mother’s 
pregnancy or by the birth of a younger 
sibling. Thirty-three of our sample of 
societies weaned the child sometime 
during the mother’s pregnancy and 23 
at the time of the birth of a sibling. If 
societies wean their children on the 
basis of maturational age, a reasonable 
estimate in terms of months can be 
established. If, however, pregnancy or 
the birth of the next sibling is the occa- 
sion for weaning, the conversion is 
more difficult. Even with no conscious 
policy of spacing children, imple- 
mented by abstinence or contraception, 
the time following childbirth in which 
a woman is able to conceive is ap- 
parently quite varied. There is some 
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evidence that prolactin, the hormone 
related to milk production, is, to some 
extent at least, antagonistic to the 
production of sex hormones, and cross- 
cultural evidence suggests that in 
societies without a post-partum sex 
tabu or contraception the average spa 
ing of children is about two years, or 
approximately nine months after the 
child begins to get most of his suste- 
nance from supplementary foods rather 
than from his mother’s milk. This in 

terpretation needs much more careful 
checking before it is taken as estab 
lished in any sense. Even if it wei 

true, however, there is much greater 
variation in the time of the mother’s 
next pregnancy than there is in the 
child’s cutting his teeth or learning to 
crawl. Those societies in which chil- 
dren are weaned at the mother’s next 
pregnancy should have a much greater 
variance in the age of weaning than 
those who wean by some maturational 
event, even though the average age may 
be the same. 

Those societies that have a long post- 
partum sex tabu, and nearly half of the 
societies in the world have such a tabu 
lasting for a year or more, generally 
wean their children at the moment the 
post-partum sex tabu terminates, This 
conjunction is a nice illustration of the 
effect of beliefs upon customary be- 
havior. In many of these societies, par- 
ticularly those situated in the tropics, 
there is a belief that sexual intercourse 
sours a mother’s milk, and she therefore 
abstains in order to insure the health 
of her nursing infant. As soon as the 
child is weaned, however, this reason 
no longer obtains, and sexual inter- 
course is resumed, 
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hus the age of weaning, looked at 
cress culturally, may depend upon 
dus beliefs and upon either matura- 
tional or social events. The range of 
variation within any society and be- 
tween societies is also much greater 
than could be imagined if one studied 
it within Western European society 
alone. The effect of this extended range 
may have unexpected consequences, 
such as the curvilinear relationship with 
the emotional disturbance of the child 
that was pointed out by Whiting 
A child, for example, is most 
rbed if he is weaned around 18 
months. To be weaned earlier or later 
than this is progressively less disturb- 
Ing. 

It is impossible in the space of this 
hapter to present all the variables 
reported by anthropologists that might 
be relevant and important to the study 
of child rearing. We would like to 
select those that at present seem to be 
the most neglected because of the 
nature of Western European society. It 
is our belief that they suggest variables 
for isolation and study that will be use- 
ful in the understanding of human be- 
havior. These variables stem primarily 
from a comparative analysis of social 
structure, 
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HOUSEHOLD COMPOSITION 


Although there is some variation in 
household arrangement in our society, 
by far the majority of children are 
brought up in a home that consists 
solely of a father and mother and un- 
married sons and daughters—the so- 
called nuclear family household. This 
arrangement of people under a single 
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roof is by no means universal. A recent 
examination of households of a sample 
of 565 societies, representing a sample 
of world cultures (Murdock, 1958), 
indicates that only slightly more than a 
quarter of them have this arrangement. 
Various forms of extended family 
households, that is, in which either the 
married sons or the married daughters 
remain living in the house of their 
arents, are commoner, and the mother- 
child household in which the father 
sleeps either in a hut or at the men’s 
clubhouse is nearly as common. The 
following table indicates the distribu- 
tion of household arrangements in this 
world sample. 


Extended 161* 
Nuclear 141 
Mother-child 123 
Polygynous 89 
Communal II 


*This includes large extended families 
with three generations of nuclear families, 
lineal households with two generations, 
and stem family households, in which only 
the oldest or youngest married son or 
daughter lives in the household. 


The infant’s world and its effect on 
socialization should be profoundly dif- 
ferent for households with such dif- 
ferent groupings. A  Siriono child 
growing up in a communal household 
in which a whole band of 20 or 30 
families sleep (Holmberg, 1946) has 
quite a different experience from a 
Gusii child who grows up in a hut with 
only his mother and his sisters and 
brothers under 7 (LeVine, ms.), or, 
again, the Zuni child who, in addi- 
tion to his mother and father and 
siblings, shares a roof with his mother’s 
parents and her married sisters and their 
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children (Roberts, 1956). Since he calls 
his maternal aunts by the same term as 
he calls his mother and since they share 
with the real mother the responsibility 
of care and discipline, it is as though 
a Zuni child were brought up by many 
mothers. 

Perhaps the most curious arrange- 
ment in the eyes of one brought up in 
our society is the polygynous house- 
hold. In its commonest form, this con- 
sists of two sisters married to the same 
man and living in the same house. Al- 
though the rivalries that we expect 
from our vantage point are apparently 
held in check to make such an arrange- 
ment viable, there is evidence that 
hostility among co-wives is not natu- 
rally absent and is often projected in 
the form of a strong belief in witch- 
craft, 

The importance of household com- 
position is more obvious when we con- 
sider that it defines both the number of 
people with whom the child interacts 
and the physical nature of the setting 
in which he is socialized. Status rela- 
tionships among members of the house- 
hold and their roles in socialization are 
important variables for child rearing 
for two quite different reasons. In the 
first place, the persons with whom a 
child has intimate contact and who con- 
trol resources for him are the most 
important models for identification 
(Whiting, 19592). Thus in a mother- 
child household the mother is the sole 
model for identification and the sole 
object of cathexis. The problem of 
cross-sex identification for a male 
brought up in this kind of household 
is great, and these societies find it nec- 
essary to develop techniques for chang- 
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ing the boys’ object of identification 
before adulthood (Whiting et al., 1958). 
The frequency of mother-child hous 
holds in our own society, particularly 
in lower-class groups, in which divorce 
and desertion are very frequent, is 
just beginning to be recognized and its 
effects are being studied. Preliminary 
findings indicate that here also the! 
are problems of cross-sex identification 
(Miller, 1958; Rohrer and Edmonson 
1960). 

An example of the influence of status 
in the household on one’s role in 
socialization is exemplified by a cross 
cultural study of the function of grand- 
mothers in child rearing (Apple, 1956). 
A grandmother who is titular head of 
a household is likely to function as a 
disciplinarian, whereas one with less 
status is more likely to function only 
in a nurturant caretaking role. The 
locus of responsibility for caretaking 
and the relative authority within the 
household is a variable which child 
psychologists have recognized. The 
variations in patterns, however, are 
greater than in non-Western European 
societies, in which women are the 
primary caretakers and disciplinarians, 
for example, but have low status in the 
society as a whole. They may have 
difficulty in controlling their male 
children, especially in latency and 
preadolescence. This is a problem for 
the Rajput mothers (Triandis and 
Hitchcock, ms.). 

In three communities in the south- 
western United States differences in 
authority patterns have been shown to 
have an interesting effect (Whiting et 
al, ms.). In one group of Mormon 
origin the father is clearly in au- 
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thority. In a second group of migrants 
from Texas matters of import are dis- 
cussed between father and mother, 
and thus authority can be said to 
be shared. In the third group, the 
Zuñi Indians, the authority in the 
household is vested in the grandmother. 
When preadolescent children in these 
three groups were asked, “If a magic 
man could change you into a mother, a 
father, a brother, or a sister, which 
would you choose to be?” most chil- 
dren chose to be a relative of the same 
sex as themselves. Several Mormon 
girls, however, chose to be a brother, 
and several Zuñi boys chose to be a 
sister. No cross-sex choices were made 
by Texan children. Thus the authority 
structure of the household seems to in- 
fluence sex preference. 


SLEEPING ARRANGEMENTS 


Another important aspect of the 
household is the pattern of sleeping 
arrangements during infancy, We are 
not referring here so much to the 
recent interest in methods of swaddling 
that has been held by some to be a 
major determinant of national character 
(Gorer, 1949; Mead, 1954) but rather 
to the question of who shares the bed 
and bedroom with a child from birth 
until he is grown up. A recent cross- 
cultural study (Whiting et al., 1958) 
indicates that in a great majority of 
societies an infant sleeps in the same 
bed with his mother during the time 
that he is nursing. It is only rarely (in 
less than 10 per cent of the cases) 
that he has a crib or cradle of his 
own, and only in Western European 
societies, notably middle-class America, 
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does an infant ever have a bedroom of 
his own. In approximately one half of 
the societies under consideration the 
father also shares the bed with mother 
and infant. Sleeping arrangements for 
older children also vary. They may 
continue to share a bed or sleeping 
platform with their mother, with both 
parents, or move to one of their own. 
Boys and girls are generally segre- 
gated at sometime before adolescence, 
but the age at which this occurs varies 
considerably from one society to 
another. This segregation sometimes 
takes the extreme form of a child not 
being permitted to sleep in the same 
house as his mother or sisters after 
he reaches a certain age, and sometimes 
this change of residence may involve 
moving to a separate  village—in 
matrilineal societies often to the house 
of his mother’s brother. 

Here again the persons with whom a 
child shares the intimacy of a bed seems 
to have a profound effect on the de- 
velopment of his personality. It is our 
view that the nature and resolution of 
the oedipus complex is determined by 
social arrangements at night rather 
than during the day. It is then that 
intimate and intensive interpersonal 
feelings within the family are expressed 
rather than during the casual, busy day 
when adults are concerned with mak- 
ing a living. 

The number of people who share a 
household and the nature of the spatial 
arrangements also have a demonstrable 
effect on the timing and techniques of 
socialization and on the values that are 
felt to be most important to transmit 
to the child. For example, weaning and 
independence training are earliest in 
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nuclear family households, apparently 
because the father and child are compet- 
ing for the nurturance of a single 
woman, who, unlike the woman in the 
extended or polygynous family, has no 
other woman to help her. Similarly, 
when there are many hands to do the 
work, children’s help will not be re- 
quired so early. The economic role 
of both men and women is a variable, 
which, in conjunction with household 
arrangements, seems to predict not only 
the age of independence training but 
the age of responsibility training. 
Among the Rajput mothers of northern 
India (Triandis and Hitchcock, ms.), 
where a married woman often lives with 
her mother-in-law and sisters-in-law in 
an enclosed courtyard and seldom leaves 
it and where there are lower caste 
servants, independence and responsi- 
bility training are late; young girls do 
little housework, their mothers dis- 
couraging rather than encouraging 
their participation in food preparation, 
cooking, and other tasks. The situation 
is similar in other societies with large 
extended households when there are 
many hands to do the work. In Okinawa 
(Maretzki, ms.), on the other hand, 
where the women usually live with 
their husbands and perhaps their 
mothers-in-law and do a large part of 
the agriculture and help in the lumber- 
ing, both boys and girls are expected to 
baby sit and do household chores at an 
early age; similarly, in a Texan home- 
stead group, in which the mother in a 
nuclear family household helps with 
the farming, children are expected by 
the age of 6 or 7 to do regular tasks 
(Whiting et al., ms.). 

Similarly, household arrangements in- 
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fluence the age and severity of aggres- 
sion training and the techniques parents 
employ in training for control. Chil- 
dren’s aggression is most severely pro- 
hibited in societies with extended 
family households (Murdock 
Whiting, 1951). Apparently, hostilities 
must be controlled when so many 
people are living under one roof; anc 
it is disruptive if the mother is ex- 
pressive in her efforts to teach the chili 
control. More indirect methods are 
used in the extended household. A Zuñi 
mother will ignore an aggressive chil 
or threaten him with a supernatura 
agent—the “scare kachina,” a god 
impersonated by a Zufi man, rather 
than yell at him or spank him (Whit- 
ing et al., ms.). In contrast, the use of 
physical punishment occurs most fre 
quently in societies with mother-child 
households, in which the mother ap- 
parently can “blow her stack” without 
disturbing anyone else. 

Settlement patterns, or the cluster- 
ing of households, also have an effect on 
child rearing. Once a child can get 
about, the nature and composition of 
play groups will be determined not 
only by the spatial pattern of house- 
holds but by the kinship and social 
structure bonds relating the members 
of the settlement. A child in a village 
with closely packed houses, with many 
children of his own age nearby, will 
lead quite a different life from the child 
raised in a household distant from 
others or a child raised in a small 
nomadic band. The isolated child is 
likely to spend much more time alone 
or with adults than with his age mates. 
It is not surprising that self-reliance 
tends to be more emphasized in his case, 
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whereas responsibility to the group is 
more highly valued in the village life. 

In closely settled areas, however, the 
social life of the child may be limited 
by other social-structure variables. If 
a society has a caste or class subgroup 
living in close proximity, the rules per- 
taining to adult social participation may 
be reflected in the composition of the 
play groups. Thus among the Rajputs 
(Tviandis and Hitchcock, ms.) children 
are discouraged from playing with 
members of lower castes and are allowed 
to treat these children in domineering 
and aggressive ways not approved of 
within their own caste group. The play 
groups tend to follow lineage ties; fam- 
ilies are related through the males and 
often live in adjacent houses. 

Similarly, social classes and ethnic 
and religious groups in our own society 
influence play groups. In some cases 
younger children from divergent 
groups intermingle, but conflicts be- 
tween the beliefs, values, and child- 
rearing practices of the groups often 
lead to tension and conflict. These cul- 
tural differences sometimes result in the 
kind of clique structure found in urban 
public schools. 

When subgroups are important in 
the social structure, ostracism from 
one’s group becomes a serious matter, 
and parents often use the threat of 
abandonment by this group to control 
their children’s behavior. Thus a Rajput 
mother will reprimand her child by 
calling him a “leatherworker” or a mem- 
ber of the lowly sweeper caste, im- 
plying ostracism from his own group; 
or an Okinawan will chide her child 
by calling him a “kindergarten child,” 
implying loss of status from the age 
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group to which he rightfully belongs 
and with whom he spends the majority 
of his waking hours; or an American 
mother will call her child “cheap,” 
implying behavior of a lower social 
class. Among the Zuñi, where the 
pueblo is an isolated subgroup in the 
American Southwest, parents will ad- 
vise their children to behave well so 
that “They,” i.e. the Zuni, will help 
them (Whiting et al., ms.). 

One’s position in the hierarchy of 
castes or classes may have profound 
effects on child rearing and personality, 
as has been suggested by studies of 
Negroes in our society by Kardiner 
(1951). A similar effect is seen in the 
personality of a Mixtecan living in a 
minority subgroup in a barrio of the 
Mexican town of Juxtlahuaca (Rom- 
ney, ms.). 

In summary, the anthropologist’s 
analysis of household arrangements, 
residence patterns, and larger settle- 
ment patterns with associated status 
and kin relationships has isolated vari- 
ables that are important to the study 
of child rearing. One of the authors 
(Whiting, 19592) recently discovered 
a convincing example of the impor- 
tance of these variables in an attempt 
to predict the occurrence of the prac- 
tice of a patient accepting blame for 
his illness, a measure developed by 
Whiting and Child (1953) as an index 
of guilt. It was shown in Whiting and 
Child that early socialization predicted 
high guilt. When this factor is com- 
bined with monogamous family struc- 
ture, which also predicts high guilt, the 
association is much stronger than with 
either factor alone. 

As already suggested, intimately as- 


940 
sociated with social structure is the en- 
vironment and economy of a society. 
Analysis of variables along these di- 
mensions has also proved profitable. In 
environments, for example, in which 
the habit and economy make nomadic 
life a practical adjustment, the women 
usually carry the gear while the men 
march ahead with their weapons in 
hand, both for defense against enemies 
and for hunting. Families traveling in 
this manner with few beasts of burden 
or conveyances cannot afford to have 
two children who cannot walk. Chil- 
dren are, therefore, spaced by means 
of a post-partum sex tabu, which lasts 
from three to four years; and to save 
carrying extra food for the young 
child the nursing period is prolonged. 
It is interesting to note that a similar 
pattern of prolonged dependency also 
occurs in the tropics. Here, however, 
the long post-partum sex tabu and the 
prolonged nursing period seem to be 
associated with infant diseases, which 
are apparently held in check by pro- 
longed nursing and caretaking. 
Another example of the interde- 
pendence of environmental and social 
structure variables is the relationship 
between temperature, household type, 
and child-training practices. Since ex- 
tended family households are, economi- 
cally speaking, the cheapest per capita 
way to provide heat and shelter for 
the adults of a society (there is one 
house for five or more adults as against 
one house for two adults in the nuclear 
family household), they are found more 
often in cold climates, especially where 
there is a lack of building materials. 
We have discussed the child-training 
practices associated with extended 
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families. In temperate zones, where the 
people live a settled life in nuclear 
family households, weaning and inde- 
pendence training are early. It is in- 
teresting to note McClelland’s findings 
(McClelland and Freidman, 1952) that 
early independence training is associ- 
ated with high need achievement. 

The activities of children are also 
influenced by the natural environment. 
Climate, of course, will determine how 
much time a child will spend out of 
doors. The presence of dangerous fauna 
or natural phenomena may limit the 
extent of freedom. If the climate is 
favorable and there are few dangers in 
the environment and if wild foods may 
be gathered and eaten, the range of 
territory over which children are al- 
lowed to wander may be exceedingly 
great—as reported by Margaret Mead 
(1927) in her study of Samoa. In New 
England man-made dangers, such as 
automobiles and thoroughfares, and 
concepts, such as private property, 
limit the freedom of children. The 
danger of the environment may be su- 
pernatural as well as real. Thus among 
the Tlocano-speaking Tarongans of 
Northern Luzon (Nydegger, ms.) it is 
believed that the environment, though 
realistically not particularly threaten- 
ing, is peopled by spirits who must be 
handled with caution. Children are 
afraid to wander away from the se- 
curity of their hamlets, whose mem- 
bers are usually related families. The 
Tarongans also fear the supernatural 
powers of people from other hamlets, 
and the children learn to distrust and 
fear all strangers. Such projected fears 
may be related to the type of socializa- 
tion of sex and aggression and social 
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structure variables (Whiting, 1959b). 

Whether the determinants are the 
result of the settlement pattern, eco- 
nomics, or environment, the play set- 
ting available to children will have ef- 
fects on their socialization and behav- 
ior. Of particular importance is the 
length of time spent in activities that 
are not directly supervised by adults. 
The Mixtecan boy and girl (Romney, 
ms.), who are encouraged, persuaded, 
and forced to stay in the family patio 
until the age of 7 or 8 and play with 
their brothers, sisters, or cousins within 
earshot of the mother or some other 
relative (lest they get into fights with 
other children), are in striking contrast 
to their Okinawan counterparts (Ma- 
ski, ms.) and Gusii (LeVine, ms.) 
age mates. Okinawan children, who 
have been members of large play 
groups since the age of 3, play without 
adult supervision, and the Gusii have 
been out herding the cattle and play- 
ing with infrequent supervision. 

Recent studies of the influence of 
setting on children’s behavior (Barker 
and Wright, 1954) would indicate that 
analysis of the consequences of the 
amount of time spent in different set- 
tings might prove valuable to students 
of child development. It may be, for 
example, that the kinds of self-control 
that children learn are related to these 
settings. Children who are constantly 
under adult supervision may have less 
practice in self-control and may be 
more responsive to direct threats of 
injury and abandonment. Children who 
spend the majority of their time with 
peers may also be more responsive to 
socialization by age mates. 

In addition to and related to the so- 
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cial-structure, economic, and environ- 
mental variables, which have been sug- 
gested by comparative studies of child- 
rearing practices in nonwestern socie- 
ties, there are variables relevant to the 
timing of socialization, to variations in 
the abruptness of the changes in the 
demands made of a child, to variations 
in the behavior expected of different 
sex and age groups,' and to variations 
in the techniques employed by social- 
izing agents. Many of these variables 
have been suggested in the foregoing 
discussion. Again, the range of varia- 
tion suggests new parameters for ana- 
lyzing child rearing. Time and space 
prohibit the discussion of all these 
variables. We would like in conclusion 
to touch briefly on two aspects of so- 
cialization techniques that are not char- 
acteristic of our society. 

One of the techniques of training of 
particular interest frequently used in 
nonwestern societies is the change of 
setting and/or caretakers at a time 
when there is a change in the demands 
made upon a child. For example, a 
technique of weaning a child from the 
breast is to send him to the house of a 


*See in particular the cross-cultural study 
by Barry, Bacon, and Child (1957), who rated 
110 societies on the age and sex differences in 
the demand made in regard to nurturance, 
obedience, responsibility, self-reliance, and 
achievement. They found that although boys 
and girls were generally treated the same 
in infancy there was a general tendency 
toward higher demands for nurturance, obedi- 
ence, and responsibility in girls and toward 
self-reliance and achievement in boys. The 
greatest sex difference occurred in those 
societies whose economy required superior 
strength and in those societies in which there 
were large family groups with high coopera- 
tive interaction. 
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relative to stay until he is accustomed 
to doing without his mother’s milk. 
Usually the child returns home when 
he has given up fussing and is moved 
to sleep with his father, brother, or 
sister or into a bed of his own. New 
caretakers may enter the life of the 
child at this time—siblings take over 
this responsibility for periods of time 
during the day. 

One of the most dramatic cases of 
change of setting and agent is that of 
the Ojibwa child who at weaning 
moves permanently to the home of her 
grandmother, who becomes the mother 
surrogate, 

Changes of residence also occur in 
late childhood and early adolescence. 
This is more frequent among boys than 
girls. In some societies with matrilineal 
descent boys are sent to live with their 
maternal uncles, who take over the 
role of disciplinarian and teacher (Mal- 
inowski, 1927). In other societies the 
move is associated with the separation 
of the sexes. In Africa the boys may 
move into a house of their own (Le- 
Vine, ms.), or, more drastically, they 
may set up a village, returning to eat 
in rotation at the houses of their par- 
ents and eventually marrying and set- 
ting up housekeeping in their own set- 
tlement (Krige and Krige, 1943). A 
similar function is performed in Eng- 
land in the boarding school. 

A second interesting phenomenon 
not found in European societies are 
changes of status with maturation and 
their formal celebration. Initiation 
ceremonies are the most dramatic ex- 
ample of such “rites de passage” and 
generally include isolation from the 
opposite sex, hazing, instruction in new 
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role behavior, and sometimes genital or 
other body mutilation. It is interesting 
to note that those societies in which 
the child is closely associated with the 
mother during a prolonged infancy are 
the most likely to have such ceremonics 
(Whiting et al., 1958). In many socie- 
ties marriage and the marriage cere- 
mony mark dramatic changes of status 
and role. 


Summary and Conclusions 


In conclusion, we suggest that stu- 
dents may find the concept of the cul- 
ture complex, with its analysis of 
values, beliefs, techniques, justifica- 
tions, and rationalizations, and the 
methods devised by anthropologists to 
get at these, helpful in the study of 
child development. We also suggest 
that the comparative study of child 
life in non-European societies indicates 
the presence of important variables 
that have been overlooked and that 
may prove very useful in the predic- 
tion of human behavior. 
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The Measurement 


of 
Family Life 
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There is a variety of approaches to the 
study of the family. Sociologists and 
anthropologists have often been con- 
cerned with the family as a social sys- 
tem or as an institution within a society. 
Recent years have seen an increased 
interest in the family as one type of 
small group with its own patterns of 
functional and structural characteristics. 
Other approaches, which are more psy- 
chological, treat the family as an inter- 
personal system of dyadic relations or 
focus on the personality and behavior 
of individual family members. 

Aside from the way in which the 
family is conceptualized, approaches 
differ along other dimensions. For ex- 
ample, some investigators have a pre- 
dominantly taxonomic and descriptive 
interest in the family, whereas others 
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have primarily an analytic interest in 
the dynamics of family process and 
_ socialization. Still others are largely 
interested in a diagnostic approach to 
the understanding of family pathology 
and health. 

In this chapter we shall restrict our 
focus to the study of various aspects 
of family life as they are related to the 
child’s behavior and development. Spe- 
cific techniques of measurement are dis- 
cussed in previous chapters. Our inter- 
est here is to illustrate the application 
of these methods in the study of link- 
ages between family-life and child-be- 
havior variables. The goals are two- 
fold: 


1. To present a schema for classify- 
ing concepts in family research and to 
ilustrate how the concepts at each level 
have been measured and used for link- 
ing family variables and child-behavior 
variables. 

2. To review some of the methodo- 
logical problems that seem to be unique 
or more focal to research on family life. 


A CAUSAL SEQUENCE SCHEMA 


As a schema for classifying research 
designs and procedures, we have chosen 
a sequence of causality that goes from 
the most general and causally distant 
variables and moves by steps through 
variables that are more specific and 
causally closer to the child’s personality 
and behavior. The schema we are using 
is as follows: 

Parental Background. This category 
includes such variables as the parent’s 
national origin, educational history, and 
early relations with own parents. 

Current Family Setting. Included 
here are such variables as culture, social 
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class, occupation, temporal setting, ru- 
ral-urban-suburban location, and physi- 
cal characteristics of home and neigh- 
borhood. 

Family Composition. This includes 
family size, ordinal position of child, 
age and sex distribution of siblings, 
presence or absence of certain family 
members, and special status of family 
membership (e.g., adopted). 

Relationships between Parents. This 
includes such variables as power rela- 
tions, decision-making patterns, division 
of labor, the communication pattern, 
degree of consensus, and the affective 
relationship. 

Personal Characteristics of Parents. 
This cluster includes those psychologi- 
cal characteristics of parents not specifi- 
cally oriented toward the parental func- 
tion, such as general personality struc- 
ture, religious values, social attitudes, 
and philosophy of life. 

Child-Oriented Parental Attitudes. 
Included here are child-rearing goals, 
concepts about the parent role and child 
role, attitudes toward parenthood, atti- 
tudes about discipline techniques, and 
acceptance or rejection of the child. 

Overt Parental Behavior Patterns. 
Here we include child-rearing and 
training practices, influence and disci- 
pline techniques, responses to child be- 
haviors, changes or inconsistencies in 
behavior patterns, and types of conflicts 
with the child. 

Child’s Orientation toward Parents 
and Siblings. This includes a large set 
of variables, such as perceptions and 
evaluations of parent behavior patterns, 
attitudes toward parents and sibs, per- 
ception of parent feelings and attitudes 
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toward the child himself, and percep- 
tions of the relations existing between 
parents and among all family member 
This level in the sequence already ir 
volves the child’s response to the family 
situation. 

Overt Child Behavior toward Other 
Family Members. We include here 
categories of child response to others 
such as reaction to discipline, and spon- 
taneous initiated actions, such as atten- 
tion-demanding behavior, hostility, 
affection. 

Personal Characteristics of the Child 
Here are included a variety of variables, 
such as intelligence, physical character- 
istics, ego structure, superego forma- 
tion, conception and evaluation of thi 
self, emotionality, and basic need tend 
encies. 

The Child Away from the Family. 
This focuses on the child’s attitudes and 
behaviors in areas other than the family, 
such as interaction with peers and the 
child’s behavior at school. 


THE USE OF THE 
CAUSAL-SEQUENCE SCHEMA 


In looking for studies to illustrate 
this schema, we have been interested in 
those that make some linkage between 
one or more levels of family variables 
and one or more levels of child vari- 
ables—especially when the attempt is 
made to clarify the processes interven- 
ing between these two levels. We have 
felt free, however, to use studies that 
stop short of making the bridge to child 
variables, when this seemed necessary to 
illustrate the measurement of certain 
important concepts. 


THE MEASUREMENT OF FAMILY LIFE VARIABLES 


INTERVENING PROCESSES 
OR LINKING MECHANISMS 


Almost any study that attempts em- 
virically to test a relationship between 
family variables and child variables is 
potentially interesting. It seems to us, 
1owever, that the major need is for 
studies that use some conceptual model 
for the intervening processes and at- 
tempt to clarify and test the model by 
appropriate measurement operations. 

\ wide variety of theoretical models 
has been used, either explicitly or im- 
plicitly, to clarify the intervening proc- 
esses between the family-life situation 
and the development of the child. There 
are cognitive learning models that stress 
the importance of the stimulus patterns 
available for the child to learn; rein- 
forcement learning models that empha- 
e the effects of patterns and intensi- 
ties of rewards and punishments; inter- 
personal relationship models that em- 
phasize one or another aspect of the 
emotional dynamics of the relationship 
as the channel of influence. For exam- 
ple, a particular parent-child relation- 
ship may be seen as leading the child to 
act out with others the emotional frus- 
trations experienced in the relationship, 
or it may be seen as creating in him the 
need to replicate the same relationship 
with others. Another linkage process 
model often used is that of the child’s 
identification with one or both parents, 
along with notions about the nature of 
the internalization processes involved. 

Some investigators have focused on 
pathologies or discontinuities in the so- 
cialization process. They point, for 
example, to the critical phenomenon of 
early mother-child separation or to af- 
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fective deprivation resulting from par- 
ent rejection. Others have focused on 
variables that are more sociological, 
such as environmental deprivation with 
respect to learning experiences or sub- 
cultural stimulus patterns that promote 
directions of socialization counter to 
those promoted and approved by the 
larger society. 

There is no paucity of theories to ex- 

lain the effects of the family on the 
child, but there is a paucity of empiri- 
cal research connected with these theo- 
ries. Perhaps one of the most frequent 
problems with existing research is the 
size of the “causal jump” in the sequen- 
tial framework. Too broad a jump, 
without operationalizing the interven- 
ing steps, makes it difficult to infer 
whether any particular finding may be 
regarded as evidence for one or another 
linking process. The size and nature of 
the causal jump, along with the prob- 
lems in the measurement of variables 
necessary for all conceptual approaches, 
is an important focus of this chapter. 


Family Variables: Measurement 
and Linkage 


PARENTAL BACKGROUND 


Studies that relate parental back- 
ground to child variables need to stop 
on their way at some other point in 
the causal sequence. There is almost 
always an intervening variable or hy- 
pothetical construct that reconceptual- 
izes the parental background to a more 
genotypic and less “distant” level in 
relation to the child. For example, 
analytic writers (Erikson, 1946; Hell- 
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man, 1954) may discuss the relationship 
between early experiences of the parent 
and the child’s psychological state, but 
these background factors are seen as 
operating through their effects on the 
parent’s personality, which in turn 
affects the parent’s interaction with the 
child, Or the parent’s early experience 
may be seen as affecting his marital 
relationship, which in turn plays a role 
in the child’s personality development. 
Sears (1950) discusses the effects of a 
discrepancy between the parent's social- 
class background and current class 
status on the child’s dependency be- 
havior, the intervening variables in this 
case being sensitivity to approved meth- 
ods of child rearing and whether or 
not the child was fed on demand. 
Spiegel (1957) and Kluckhohn (1958) 
have studied the effects of the parent’s 
cultural background on the child, but 
again through other variables, such as 
current value conflict and family dis- 
equilibrium. Similarly, Baruch (1937) 
related the parents’ early home environ- 
ment to the child’s adjustment through 
the intervening variable of the marriage 
relationship. Several investigators have 
studied the influence of education on 
the parents’ child-rearing attitudes and 
behavior (Bouck, 1936; Elder, 1949; 
Staples and Smith, 1954; Schaefer and 
Bell, 1958; Burchinal, 1958a,), which 
in turn are presumed to affect the 
child’s development. 
a Glueck and Glueck (1957) found a 
relationship between the juvenile delin- 
quency of the mother and the juvenile 
delinquency of the son and suggest that 
the relationship takes place through the 
mother’s current instability, which is 
assumed to have carried through from 
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her past. Miller and Swanson (1958) 
use rural-urban background of parents 
and whether they were foreign or 
American-born as measures of current 
“entrepreneurial” or “bureaucratic” set- 
tings, which they believe have signifi- 
cance for child-rearing patterns. 

Thus the studies relating parental 
background variables to the child’s de- 
velopment have found it necessary to 
pass through a more current aspect of 
the family situation. Starting from the 
more remote level of parental back 
ground, however, does lend a certa 
richness to family studies. More im- 
portantly, it ties them to a wider con- 
text and suggests avenues by which 
historical change and idiosyncratic 
variations in family structure occur. 
Some researchers (e.g, Ackerman and 
Sobel, 1950; Spiegel, 1957) feel that to 
understand the child as a functional part 
of the family group a study must take 
account of the history of that group, 
including the parent’s background, so 
as to gain the necessary understanding 
of the child’s current experiences in the 
family. 

The measurement of certain parent- 
background variables, such as education 
and place of birth, is relatively simple. 
Respondents can report these data with 
only minor precautions being taken by 
the researcher to assure their validity. 
However, personal report is also the 
method one must usually use to obtain 
data about the early interactions of the 
parents with their own parents. Radke 
(1946) . asked mothers and fathers 
to complete questionnaires about their 
childhood home lives. Subjects were 
given short declarative sentences such 

s “As a child I had a voice in making 
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family plans and decisions.” Their 
were made in terms of 

(“usually,” “sometimes,” 
“rarely and, for some items, imensity 
y much,” “somewhat,” “very 

). The items in general dealt with 

a and discipline patterns. 
w other measures have been de- 
veloped specifically to obtain parents’ 
perceptions of their childhoods, but 
there are some, such as the P.O.P. 
measure (Schutz, 1958), designed to 
clicit childhood data from adults, which 
could obviously be used for this pur- 
Clinical interviews have also been 
used, especially with mothers of chil- 
dren in treatment, but these are usually 
intensive studies of only a few cases. 
A notable exception is Baruch (1937, 
1944), who clinically interviewed a 
number of parents of nursery-school] 
children to obtain parental background 
and marital adjustment data. She found 
that the parents’, especially the 
mothers’, childhood family relations 
related to both marital tension and the 
adjustment of the child. 

Using recall data is usually undesir- 
able, particularly when the events oc- 
curred many years earlier and may be 
emotionally loaded. Simple forgetting, 
conscious distortion, and repression are 
likely to affect the reports. Unfortu- 
nately, about the only alternative to 
the method of recall is the longitudinal 
study that continues through at least 
two generations. The need for longi- 
tudinal studies is great, but they pose 
obvious practical difficulties. There are 
the difficulties of maintaining contact 
with the families for a long time, the 
continued improvement of measures 
with those used in the past generation 
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often seeming obsolete in the present, 

the unwillingness of many researchers 
to commit themselves to a study of the 
necessary duration, and the inevitable 
organizational and personnel changes 
that also lessen the chances of research 
continuity. These difficulties have at 
times been surmounted. Certain organ-- 
izations have maintained contact with 
families over one or more generations, 
e.g., the Fels Institute and the Merrill- 
Palmer School. When the research in- 
stitute takes responsibility for the longi- 
tudinal study, the researcher involved in 
the earlier phases can often use the data 
as part of an intact research project and 
the data can remain with the organiza- 
tion for later use. The values of longi- 
tudinal studies have become increasingly 
apparent, and the storehouses of data 
that they make available have many uses 
in addition to that of facilitating studies 
involving parental background. 


CURRENT FAMILY SETTING 


The family-setting variable that has 
probably been most frequently related 
to the child, directly and indirectly, is 
social class. Social class has been related 
to the child’s perceptions, attitudes, per- 
sonality, motivation, future expecta- 
tions, and overt behavior (e.g, Lynd 
and Lynd, 1929; 1936; Bruner and 
Goodman, 1947; Centers, 1949; Rabban, 
1950; Schneider and Lysgaard, 1953; 
Himmelweit, 1955; McKee and Leader, 
1955; Hoffman and Albizu-Miranda, 
1955; Douvan, 1956; Rosen, 1956; 
Sewell and Haller, 1956; Rainwater, 
1956; Angelino, Dollins, and Mech, 
1956; Remmers and Radler, 1957; Hoff- 
man, Mitsos, and Protz, 1958.) These 
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are all family studies in the sense that 
the child’s social-class position is a func- 
tion of his family membership, and the 
family is usually seen as the medium 
through which the meaning of the class 
setting is transmitted to the child. 

In addition, however, a number of 
studies have related social class to the 
child-rearing attitudes and practices of 
the parents. (These include Davis and 
Havighurst, 1946; Duvall, 1946; Eric- 
son, 1947; Hollingshead, 1949, Havig- 
hurst and Taba, 1949; McGuire, 1952; 
Klatskin, 1952; Oeser and Emery, 1954; 
Oeser and Hammond, 1954; Maccoby 
and Gibbs, 1954; Alper, Blane, and 
Abrams, 1955; Havighurst and Davis, 
1955; Sears, Maccoby, and Levin, 1957; 
Littman, Moore, and Pierce-Jones, 1957; 
White, 1957; Boek et al., 1957; Miller 
and Swanson, 1958; Kohn, 1959b.) 
Some of these studies have examined 
the relationship between class and child- 
rearing variables and between class and 
child behavior. Some have related class 
to child-rearing variables and the lat- 
ter, in turn, to child behavior. Others 
have stopped at the point of relating 
class to child-rearing variables, often 
inferring probable child resultants or 
leaving that linkage to other re- 
searchers, 

The studies that directly link class 
to child resultants often involve the 
theoretical weakness that the breadth 
of the jump leaves open many possible 
alternative explanations for any em- 
pirical relationships obtained. j Why 
does social class differentiate child atti- 
tudes and behavior? The studies of class 
and intrafamily variables help bridge 
this gap, but even these studies often 
involve static analyses which treat class 
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simply as a fixed value setting. Thus 
it was surprising when two studies, ten 
years apart, revealed different relation- 
ships between social-class and child- 
rearing practices (Havighurst and 
Davis, 1955). The original study was 
not based on a dynamic theory capable 
of incorporating changes over time. 
Since then, several theories have been 
advanced, which would account for 
such changes. White (1957) suggests 
that if the discrepancies in the two 
studies reflect true change, rather than 
mere difference in samples or methods, 
one explanation might lie in the dif- 
ferent reference groups used by the 
classes. She found that middle-class 
mothers more often mention experts, 
other mothers, and friends as thcir 
sources of ideas about child rearing. 
If they mention their own parents, it is 
usually as a negative reference. Lower- 
class mothers, on the other hand, more 
often rely on their own inclinations 
and their own upbringing—using their 
parents as a positive reference group. 
Thus the lower class is less susceptible 
to change in child-rearing patterns, 
whereas the middle class responds more 
to expert advice and the prevailing 
climate of opinion. 

Miller and Swanson offer another ex- 
planation, which views class differences 
within the context of larger societal 
settings they call “entrepreneurial” 
and “bureaucratic.” They contend that 
the growing complexity of economic 
organization, the continued increase in 
urbanism, and the decreased rate of im- 
migration have changed the meaning of 
social class. Child-rearing patterns have 
correspondingly changed in such a 
way that the child’s resulting person- 
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ality will enable him to pursue more 
effectively an occupation in the family’s 
current setting. For example, risk tak- 
ing may have been appropriate behavior 
for the middle class in a entrepreneurial 
society, but in a bureaucratic society a 
security orientation is more appro- 
priate. This changed requirement is 
presumed to be reflected in child-rear- 
ing practices, thus fitting the child 
psychologically for his probable adult 
occupation. A similar theory of the 
process by which class affects child- 
rearing practices (Aberle and Naegele, 
1952) views the parent as socializing 
the child for the adult role he is ex- 
pected to occupy, the parent’s model 
for this role being the one he himself 
occupies. 

The measurement of social class has 
been discussed and studied for many 
years, but still there is no generally 
agreed-upon way of measuring it or 
even conceptualizing it. To some re- 
searchers, class represents a way of life; 
to some, power over resources and per- 
sons; to some, reputation or esteem. 
Some investigators regard class as an 
objective phenomenon, to others it is 
entirely subjective; and even those who 
agree on its subjectivity disagree 
whether it is a perception of the self 
or of others. The number of classes in 
our society has usually been placed at 
between two and six, although some 
researchers think of it as a continuous 
ranking system with almost as many 
positions as there are families to be 
classified. Perhaps the prototypic defi- 
nitions of social class are those of Marx 
(1906-1909), who defines class in terms 
of man’s relationship to the means of 
production, i.e., occupation; Veblen 


(1918), who considers consumption 
patterns as crucial; Warner and Lunt 
(1941) who define it in terms of other 
people’s judgments of a family’s prestige 
and esteem; and Centers (1949), who 
defines it primarily as the subject's self- 
judgment. The occupational indices 
most commonly used have been either 
the classification of the family bread- 
winner’s occupation according to the 
categories used by the U. S. census 
(U. S. Employment Service, 1949) 
or a simple white collar-blue collar 
dichotomy. In the Veblen tradition 
one might count the Chapin scale 
(1930), which includes a rating of liv- 
ing-room furniture. Warner’s original 
rating system used community inform- 
ants, but because it was too cumber- 
some for extensive use, a scale called the 
Index of Status Characteristics was de- 
veloped (Warner, Meeker, and Eells, 
1949). This index, designed to estimate 
community prestige ratings, is based 
on a sum of weights assigned to occupa- 
tion, dwelling area, house type, and 
source of income. Hollingshead’s Index 
of Social Position (Hollingshead and 
Redlich, 1958) uses occupation, dwell- 
ing area, and education. Hatt (1950) 
has suggested a compromise to the use 
of occupation and community judg- 
ments in which occupations are rated 
and classified not according to the re- 
searcher’s judgments but according to 
popular judgments. Sewell and Haller 
(1956), in a small community study, 
used teacher ratings of family status 
in combination with the father’s occu- 
pation. Income, ethnic background, and 
race have also been considered as aspects 
of class. 

Each operational definition of class, 
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and these include many combinations 
of the measures mentioned above as 
well as others, has its protagonist. Al- 
though the ultimate decision depends 
on the needs of the particular research, 
some help in deciding which measure 
to use is available in the empirical com- 
parisons that have been made of the 
various indices. Nineteen class indices 
were interrelated by Kahl and Davis 
(1955). They conclude that the various 
indices are highly correlated, and, al- 
though two common factors could be 
isolated from these indices, all can be 
viewed roughly as representing a single 
dimension. This would mean that for 
many purposes the simplest measure, 
the breadwinner’s occupation, would 
probably be sufficient. On the other 
hand, Haer (1957) compares several 
indices on their ability to predict 
various attitudes and behavior and con- 
cludes that Warner’s Index of Status 
Characteristics has the greatest predic- 
tive power, with the Centers type of 
self-perception measure having the 
least. Boek et al. (1957), also report 
that the Warner I.S.C. was the social- 
class index most highly correlated with 
six competing indices. It should also be 
pointed out that, other things being 
equal, there is an advantage to using 
measures of class that will make possible 
comparisons and syntheses with other 
related studies. 

Father’s occupation can be con- 
ceptualized as a family-setting variable 
in other ways besides merely as an 
index of social class. Occupation could 
affect family life and the child by its 
effect on the father’s personality. Thus, 
for example, occupations might be 
classified in terms of their frustration 
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potential and their opportunities for 
hostility release. Such a classification, 
cutting across class lines, might prove 
to be significant for family interaction 
and child behavior. Using the same 
general theory as Miller and Swanson 
and Aberle and Naegele, the child’s 
socialization might be different, de- 
pending on whether the father’s occu- 
pational position depends on formal 
training or personal characteristics. 
Dyer (1956) has shown that job satis- 
faction of the father is communicated 
to the child and also affects the parents’ 
job aspirations for the child, Bronfen- 
brenner (1958a) has categorized occu- 
pations into those that are quality 
oriented and those that are not. The 
first group includes occupations that 
involve evaluation of a concrete prod 
uct, the evaluation being based on the 
worker’s own knowledge and skill. This 
classification of occupations, which cuts 
across class lines, has been found to re- 
late to parent-child relations, as re- 
ported by the adolescent child. Aberle 
and Naegele (1952), who interviewed 
fathers about effects of their jobs on 
the family, found that their respondents 
did not see a connection between their 
jobs and their homes. Their conclusion 
that this is perceptual blindness on the 
part of the subjects would suggest that 
this line of research might better be 
explored by using some theoretically 
based classification of occupations as the 
independent variable and relating it to 
measures of family life and child vari- 
ables rather than relying on introspec- 
tive reports of job effects. 

Time is another family-setting vari- 
able and one which is a challenge to 
measure. There are many theoretical 


about the relationship between 
ven time period, family patterns, 


and child behavior, but empirical studies 
are few because standardized studies 
over the years have not been done. 
Ramsey and Nelson (1956), in one of 


the few replication studies that have 
reported, compared adolescent at- 
s toward the family in 1939 and 
52. They point out that even with 
ict replication there are difficulties 
hat the cultural meaning of words 
and the interpretations of questions 
may change over time. This would 
apply to longitudinal studies also. But 


been 
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replication and longitudinal studies are 
still the most promising methods for 
studying changes over time. 


Wolfenstein (1953) and Stendler 
(1950) have operationalized time as a 
variable by studying literature about 
child care over several generations. 
Such content analyses of expert opinion 
are not studies of practices, of course, 
but Bronfenbrenner (1958a) has shown 
in a reanalysis of some of the studies 
of social class and child-rearing prac- 
tices that there is a remarkable corres- 
pondence between expert opinion and 
reported behavior, particularly for the 
middle class. Bronfenbrenner has man- 
aged, by organizing existing empirical 
studies according to the time period 
the data represent and by reanalysis of 
the data to make the various studies 
more comparable, to operationalize 
time as a family-setting variable. He 
Suggests that expert opinion may be 
the intervening variable between the 
time setting and the child-rearing 
practices. 

Other methods of comparing time 
settings include asking adults to recall 
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their own childhood (Radke, 1946) 
and comparing attitudes of grand- 
mothers with attitudes of mothers of 
the current generation of children 
(Staples and Smith, 1954). The recall 
method has already been criticized; and 
the data gathered from grandmothers 
suffer in that they do not necessarily 
reflect the opinions of these women 
when they actually were mothers of 
young children. 

Still another way of considering time 
as a variable is by specifying certain 
crucial aspects of the time period and 
studying these aspects as they occur 
currently. This is, in a sense, what 
Miller and Swanson have done. They 
have studied contemporary families, 
but by classifying them according to 
whether they are entrepreneurial or 
bureaucratic they are doing a time 
study in that the entrepreneurial repre- 
sent an earlier generation in certain 
genotypic respects. 

Studying war or depression as a cur- 
rent family-setting variable involves 
problems similar to those in a study of 
any time period. If there is no current 
war or depression, the problem is the 
same as studying any time period that 
is past. The paradox of the researcher’s 
position was clear in the laughter of 
the classroom when Robert Angell said 
in mock solemnity that he had always 
hoped to do a replication of his study, 
The Family Encounters the Depression 
(1936). It is sometimes true, however, 
that a depression or other disaster will 
hit one community, even though the 
country as a whole is relatively unaf- 
fected. A confined disaster is a prefer- 
able state of affairs for the researcher, 
for a second community can be. used 
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as a control group. In war, and in a 
general depression, this is not possible. 
Usually, the events that preceded have 
to be reconstructed by subject reports, 
as in the case of Angell’s study. The 
possibility that the perception of the 
preceding state of affairs is colored by 
later events is always a danger. 

Other family-setting variables include 
culture (see Chapter 21), subculture 
membership, rural-urban-suburban loca- 
tion, neighborhood, and even the house 
and its physical surroundings. All the 
family-setting variables, like parental 
background, are most effectively linked 
to the child through intervening steps. 
One outstanding example is a study of 
achievement by Strodtbeck (1958) in 
which he links subculture membership 
to parent values and family structure, 
examines the relationship between each 
of these and child values, and links 
child values to the child’s personality 
and behavior. 

Most family-setting variables are not 
particularly difficult to measure. Meas- 
uring neighborhood can be somewhat 
challenging, depending on what aspect 
of the neighborhood is important for 
the theory. One might want to consider 
the socioeconomic status of the neigh- 
borhood, the ethnic composition, the 
physical facilities, the transiency, the 
delinquency rate, the community's per- 
ception of the neighborhood, etc. 
Census-tract data, interviewer ratings, 
use of community informants, inter- 
views on neighborhood perceptions 
with a random sample of community 
members, and the ratio of police records 
of juvenile arrests (coded by neighbor- 
hood) to the number of children in a 
neighborhood (obtained from the 
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school census) have all been used as 
measures of the neighborhood setting 
Bossard (1951) has suggested a unique 
way of considering the house itself a: 
a variable. He has developed a Spatial 
Index for Family Interaction, based on 
the number of persons in the household, 
the resultant number of interrelation- 
ships, and the number of square feet of 
floor space in the living quarters. H« 
suggests that this index is related to the 
stresses, strains, and frustrations o! 
family living. 


FAMILY COMPOSITION 


~ Most of the anthropological studie 


of cultural groups consider family com- 
position variables and their significance 
for child development. Malinowski 
(1927), for example, has attempted to 
demonstrate the cultural basis of the 
Oedipus complex by showing that in 
the Trobiand Islands, where the child's 
important family unit is his mother’s 
kin group, it is the uncle, the family 
authority, who is the target of the male 
child’s hostility, whereas the father is 
an affectionate companion. In cross- 
cultural studies it is difficult to demon- 
strate empirically the particular effects 
of family composition because cultures 
usually vary in many additional re- 
spects. Nevertheless, many valuable in- 
sights have come from cross-cultural 
comparisons, and the possibility of 
comparing many cultures, alike in some 
respects but different in others, is a 
promising area of research. Because the 
anthropological studies are discussed in 
Chapter 21, we confine ourselves here 
to discussing mainly variations in family 
composition within the conjugal family. 
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The nuclear family in America typi- 
cally consists of a mother, father, and 
one or more children. However, one 
t may be absent from the home 
ently or temporarily, and both 
may be absent, with the con- 
al parental roles being taken by 
other adults. Sometimes the family unit, 
includes adult members in addition tô 
the parents and their offspring. Finally, 
the number of children, their sex and 
and the ordinal position of any 

child may vary considerably. 
of these variations has been given 
as an independent variable in 
mM to some aspect of the child’s 
development. 

Absence of a Parent. Authors of 
studies involving the father’s absence 
from the family often begin their re- 
ports with the comment that everyone 
studies mother absence but few have 
studied father absence. Strictly speak- 
ing, this is not so. We know of very 
few studies of mother absence, per se, 
although there are studies of children 
reared without families in which the 
researcher is relatively unconcerned 
with the missing father and siblings. 
The reason for this paucity is partly 
that families without mothers are un- 
common and cannot be found in con- 
centrated groups. Maternal absence has 
therefore usually been studied by com- 
paring children in institutional settings 
with children in intact families—natural 
or adopted. These studies typically re- 
late “maternal absence” directly to 
data about the child and show physical, 
mental, and emotional retardation to 
be associated with separation from the 
family (e.g, Goldfarb, 1945; Bender, 
1948-1949; Feinberg, 1954)- Bowlby 
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et al. (1956), studied maternal separa- 
tion by locating children, 7 to 13 years 
old, who had been separated from their 
families before the age of 4 and had 
spent a few months to two years ina 
tuberculosis sanitarium. Comparison be- 
tween these children and a control 
group did not reveal differences as strik- 


ing as those in previous studies, pre- 


sumably because the separation in this 
case was not tied in with parental re- 
jection or other emotional conflicts in 
the family. Heinicke (1956) has studied 
the degree of separation from family by 
comparing children in a day nursery to 
children in a residential nursery. Fami- 
lies were contacted before the separa- 
tion, and some attempt was made to 
standardize the two groups so that the 
amount of separation was the major 
variable influencing differences. A very 
interesting study designed to obtain 
data on the importance of the mother 
is that used by Spitz (1945), in which 
two groups of institutionalized children 
were compared as to retardation. In 
one group, composed of children in a 
nursery attached to a women’s prison, 
the mothers were present, and in the 
other they were not. This study gives 
evidence that the crucial variable is the 
absence of a mother or mother-surro- 
gate. An additional piece of the puzzle 
is provided by an experiment by Rhein- 
gold (1956). In the experimental situa- 
tion one person cared for eight babies, 
giving them as much individualized at- 
tention as possible. The control group 
included babies cared for by regular 
institutional routines, the caretaking 
acts being performed by many different 
persons. The dependent variable in this 
study was social responsiveness, and 
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the experimental babies showed more 
social responsiveness, even to strangers, 
than did the control babies. 

In reviewing these studies of mother 
absence or separation, it can be noted 
that the challenge of research is not 
so much in the construction of measures 
as in the ability to find a natural setting 
that itself measures the independent 
variables. Bowlby, concerned with iso- 
lating mother separation from the emo- 
tional factors that often precede such 
an event, used children who had been 
hospitalized for disease. Spitz, interested 
in distinguishing between the absence 
of a mother figure and institutional 
care, sought out an unusual situation in 

J which the latter condition existed but 
not the former. The “trick” is to find 
natural settings, alike in some ways, 
but different with respect to the crucial 
variable under investigation. This is not 
an easy task. Even the Spitz study was 
not completely successful, for the two 
nurseries also differed in that the 
nursery attached to the penal institution 
was superior to the other in certain 
physical respects. The Rheingold study 
is one of the few that involved experi- 
mentation. Manipulation of family- 
structure variables by the researcher is 
usually limited to the institutional set- 
ting, and, because of ethical considera- 
tions, it is further limited to improving 
conditions. The most important prob- 
lem in measuring structural variables 
remains that of finding experimental 
conditions already in existence. 

The prevalent use of institutionalized 
children is probably one of the reasons 
why the actual processes that link 

~ mother absence to child personality 
have not been examined. Institutional- 


ized children are different in so many 
ways that specifying empirically the 
particular intervening variables between 
mother absence and the child’s mal 
adjustment is usually not feasible, de- 
spite the noteworthy attempts by such 
investigators as Spitz and Rheingo d 
It would be worthwhile to seek out as 
subjects families in which the mother 
is absent but the family has continued 
to function; for example, when thi 
mother is hospitalized or dead. In most 
cases a mother substitute will have been 
provided, but one might still classify the 
mother substitutes along a number of 
theoretically derived criteria. Variables 
such as maternal employment could also 
be conceptualized in terms of relative 
degrees of mother absence. 

In some ways the studies of father 
absence have been more fortunate. 
There are many natural settings that 
enable one to study father absence. 
Freud and Burlingham (1944), Bach 
(1946), Sears, Pintler, and Sears (1946), 
Hill (1949), Sears (1951), and Stolz et 
al. (1954), studied families in which the 
father was absent during the war. Tiller 
(1958) studied families of sailors in 
Norway in which the father’s occupa- 
tion kept him away from home for long 
periods at a time. Additional studies 
deal with situations in which the mar- 
riage has been disrupted by separation, 
divorce, or death of father (Sutherland, 
1930; Sears, 1951). The father-separa- 
tion studies, for the most part, deal with 
families that continued to exist and, as 
such, allow the researcher more readily 
to study father separation as an isolated 
variable, i.e., by comparing families that 
are alike in most other respects. Further- 
more, the fact that families are other- 
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wise intact has made it possible to study 


the ects of father separation on 
family interactions. 

‘The intervening processes have in- 
cluded consideration of such variables 


as resolution of the Oedipus complex 
and identification, absence of a mas- 
culine model, communication of the 
father figure through the mother’s de- 
ption, the mother-child relationship 
that results from father absence, the 
presence of a grandparent, employment 
of the mother, and the mother’s emo- 
tional state. Thus the father-absence 
studies compared to the mother-absence 
s have left much less of a mystery 
concerning the processes by means of 
which parental absence is linked to the 
childs personality and behavior. The 
Stolz study is particularly exciting be- 
cause she also deals with the father’s 
return and the problems of readjust- 
ment, and because her design enables 
her to compare the child born while 
the father was absent with a second 
child born after his return and with 
a comparable first-born child whose 
father was not absent. 

Before going on to sibling composi- 
tion, it might be mentioned that a few 
studies have dealt with the presence of 
an extra parent figure. For example, 
Staples and Smith (1954) have studied 
the relationship between the presence 
or absence of a grandmother in the 
home and the permissiveness of the 
mother. 

Siblings. Concern with sibling com- 
position variables include family size 
(within the conventional nuclear family 
the number of children is the main 
source of variability), ordinal position, 
spacing, sex, and certain specific pat- 
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terns, such as the “only” child, the 
adopted child, and twins. These vari- 
ables have been linked to the child’s 
personality and behavior directly and 
by the intervening variable of family 
interaction—either parent-child or 
child-sibling. In most studies that 
empirically link sibling composition di- 
rectly to child variables certain assump- 
tions are made about family interaction. 
However, Bossard (Bossard, 1948; 
Bossard and Sanger, 1952; Bossard and 
Boll, 1954; Bossard and Boll, 1955b) ap- 
proaches the problem with what might 
be called a true ecological theory. 
Bossard’s theoretical constructs in- 
clude such variables as the density of 
interaction, the availability of supplies, 
and the availability of personality roles. 
As an example of the last, he takes up 
the question of personality roles in the 
large family. One of the reasons he 
gives for the personality roles of the 
younger children is that the other 
roles, such as the “social butterfly,” 
will already have been chosen by one 
of the children who arrived on the 
scene earlier. 

Koch (1954, 1955a,b, 1956a,b,c,d), 
on the other hand, empirically relates 
composition factors directly to the 
child data, but she organizes her rela- 
tionships by means of a theory that 
involves assumptions about the gratifi- 
cations and stresses of the child-sibling 
relationship and the parent-child rela- 
tionship. Brim (1958) has reanalyzed 
some of Koch’s data linking sex of sibs 
and ordinal position to sex-linked 
traits in the child. The intervening 
process in his theory is that interaction 
between two persons leads to assimila- 
tion of roles, especially the assimilation 
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by the younger (less powerful) sibling 
of the role of the older. Still other 
studies (e.g., Lasko, 1954) have con- 
sidered the relationship between ordinal 
position and parent-child relations with- 
out studying the traits of the child. 
Sears (1950), Sears, Maccoby, and 
Levin (1957), and Henry (1957) have 
empirically studied all three steps— 
ordinal position, child rearing, and the 
child. Each of these studies considers 
relationships between the first step and 
the third, the first and the second, and 
the second and the third, and each is 
concerned with establishing linkages 
between ordinal position and child re- 
sultants. It is interesting to note that 
both the study by Sears, Maccoby, and 
Levin and the one by Henry address 
themselves to the relationship between 
being the oldest child and being ag- 
gressive toward authority, but each sug- 
gests a different theory to explain the 
relationship. For Henry, the interven- 
ing step is the fact—reported by both 
studies—that fathers are more likely 
to discipline the oldest child; for Sears, 
Maccoby, and Levin, the intervening 
step involves consideration of inter- 
sibling relationships. Both hypotheses 
need further testing, but the fact that 
two competing hypotheses have em- 
erged from studies showing consistent 
patterns of findings suggests the diffi- 
culties of demonstrating linkages. A 
more adequate test of the Henry 
hypothesis might be to divide oldest 
children into two groups, those who 
are the recipients of greater father dis- 
cipline and those who are not, and to 
divide nonoldest children into the same 
two groupings. If the Henry hypoth- 
esis is correct, both oldest and nonold- 
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est children whose fathers are disci- 
plinarians should show more aggression 
than the others, and they should not 
be different from one another. 
Methodological problems in studies 
of sibling composition do not lie in the 
measurement. Both adults and children 
can report family composition in re 
sponse to direct questions without dis- 
tortion. However, there is often a 
problem in locating appropriate sub- 
jects. This problem is most obvious in 
a study like that of Bossard and Boll 
(1954, 1955b), in which their sample 
needed to include only families with 
six or more living children, or in a study 
like that of Mowrer (1954), which 
analyzed relationships of 612 twins to 
each other, to other members of their 
families, and to friends. However, even 
in a study of more commonly occurring 
sibling status positions, there are diffi- 
culties when one tries to hold some 
aspects of sibling structure constant. 
A remarkably well-controlled study 
from this standpoint is that of Koch 
(1955a,b, 1956a,b,c), in which she se- 
lected a sample consisting of 384 5- and 
6-year-old children from native-born, 
intact, urban, two-child families. These 
subjects had been selected so that it 
was possible to subdivide the sample 
into equal-sized groups based on sex, 
ordinal position, sex of sibling, and 
space between siblings. Since the first 
three variables were dichotomies and 
the third was operationalized as a tri- 
chotomy, she had 24 groups of 16 
children each. By considering only two- 
child families, Koch immensely reduced 
the number of possibilities to be con- 
sidered. Even so, there were 24 possi- 
bilities, and had the 384 subjects not 
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been selected to fall equally into each 
classification she would not have had 
enough subjects in some of her groups. 

The difficulties involved in such a 
study are obvious. Where does one ob- 
tain a large group of subjects with 
these particular characteristics? If a re- 
searcher is interested in obtaining a 
sample of 6-year-old children, he need 
only contract the schools to obtain con- 
centrated groups of children who fit 
this classification, but there .is. no 
natural setting in which one can readily 


find concentrated groups of two-child 
families. This kind of problem occurs 
so often in family research that it will 
be taken up in the section dealing with 
general methodological problems. Suf- 


fice it to say that the researcher is in- 
deed fortunate who is on cordial terms 
with the superintendent of a large 
school system and who is able to obtain 
the names and addresses of the children 
who have the specific characteristics he 
seeks. 

Another very promising method of 
studying sibling position while control- 
ling on other variables is that employed 
by Lasko (1954). She was interested in 
comparing first and second children in 
the same family. To understand the dif- 
ferences in personality between first 
and second children, it is important to 
know whether they are treated dif- 
ferently by their parents. Dean (1947) 
had already done a study that com-\ 
pared personality traits of siblings, 
using the mother’s report of the dif- 
ferences between them as her measure. 
Lasko, however, had behavioral ratings 
of parent behavior (the Fels Parent Be- 
havior Rating Scales). Furthermore, 
these data were part of a longitudinal 


study, home observations being made 
at intervals of six months. With such 
data, Lasko was able to compare the 
mother’s behavior toward her first and 
second child when the two siblings 
were the same age and also to compare 
the consistency of the mother’s be- 
havior toward each child over the years. 
She found, for example, that parent be- 
havior toward the first child is, on the 
average, less warm and more restrictive 
and coercive. She further found that 
first children are treated less consis- 
tently by their parents over the years. 
They start out from a more favorable 
position than the second child ever ex- 
periences, but by the time they are 3 
or 4 they are treated less warmly than 
the second child is treated at a similar 
age. This study represents a real at- 
tempt to understand the environmental 
factors through which ordinal position 
is linked to child personality and be- 
havior. It is interesting to note that 
Sears, Maccoby and Levin report no dif- 
ferences in maternal warmth expressed 
toward first and second children. Their 
study involves interfamily comparisons 
based on maternal interview data. 
Lasko’s use of intrafamily comparisons, 
in combination with the use of longi- 
tudinal behavior observations, makes 
the Lasko study much more sensitive 
and capable of discerning differences. 


TA TIONSHIPS 
BETWEEN PARENTS 


As we move down the process se- 
quence, fewer steps are required to link 
the independent variable with the 
child. Mother-father relationship can be 
linked to the child almost directly by 
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processes that are internal to the child. 
For example, the power relationship be- 
tween the husband and wife is perceived 
by the child and may influence his iden- 
tification choice; the division of labor 
between husband and wife determines 
in part the child’s cognition of what 
male and female roles are; and the de- 

ree of marital tension may affect the 
child’s sense of security. These variables 
might also be tied to the child less 
directly. Hoffman (1960), for example, 
suggests that when the father is domi- 
nant toward the mother she is dominant 
toward the child, and the child, con- 
tinuing the pecking order, is then 
dominant toward other children in 
nursery school. This theory involves a 
more concrete step, linking the parental 
relationship first to the parent-child 
relationship and only then to the child. 
It is apparent from these illustrations 
that the husband-wife relationship is an 
important variable in the child’s devel- 
opment. 

To elaborate, using the theoretical 
model implied in the division of labor 
example, the family represents a model 
of society for the young child. Perhaps 
the first status in society that the child 
becomes aware of is sex. He learns his 
own sex and that of each parent. The 
relationship between the parents com- 
municates to him what males and fe- 
males are like and how they interact. 
He may learn that males are strong and 
dominant and females are weak and 
submissive, if this is the pattern com- 
municated to him by his parents’ inter- 
action. This tells the child something 
about what to expect from other males 
and females, and, since his sex is the 
same as one of the parents, it also tells 
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him something about what should be ex- 
pected of him, Furthermore, if that e 
pectation is undesirable to him, e.g., 1! 
he is a male but perceives that males 
(his father) are weak, he may choose to 
be the opposite sex. There are two em- 
pirical studies that suggest that boys 
prefer the dominant parent and girl: 
prefer the nondominant parent. Kohn 
and Clausen (1956) found this with 
schizophrenic adults recalling their 
childhood, and Lippitt and Hoftn 
(1959) found it with normal children 
between 8 and 12. Kohn and Clausen’s 
study corroborated subject reports 
about the dominance of their parent: 
with reports of other family members. 
Lippitt and Hoffman used both chil- 
dren’s perceptions of father-mother 
power and mothers’ reports of which 
parent had the major role in certain 
important family decisions. These find- 
ings suggest that the child’s identifica- 
tion choice and probably also the child's 
adjustment to society may be influenced 
by whether or not the father-mother 
relationship in any particular family 
corresponds to the male-female relation- 
ships of the larger society. The Kohn 
and Clausen study reports that mothers 
of schizophrenic patients controlled 
more major and minor decisions in the 
families than the mothers of normals, 
and they suggest that schizophrenia is 
not the only maladjustment so charac- 
terized. Whether or not a cross-sex 
identification is the process involved in 
linking mother dominance to maladjust- 
ment needs further investigation. 
When one considers the importance 
of the relationship between the mother 
and father for the child’s development, 
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the ity of studies in this area seems 
start There is a number of studies 
predicting to husband-wife power (e.g., 
Strodtbeck, 1951; Sharp and Mott, 
1956; Heer, 1958; Blood, 1958; Wolfe, 
1959; Hoffman, 1960.) Some studies es- 
tablish norms about or predict to 
the division of functions (e.g., Herbst, 
1952; Oeser and Hammond, 1954; Oeser 
and Emery, 1954; Johannis, 1955; Ken- 


kel and Hoffman, 1956; Blood, 1958; 
Hoffman, 1960). However, omitting the 
studies that have dealt only with the di- 


vision of certain child-rearing functions 
between husband and wife (e.g., King 
and Henry, 1955; Henry, 1956, 1957; 
S Maccoby, and Levin, 1957), 
which will be considered in later sec- 
tions, we find very few studies relating 
either husband-wife power or division 
of functions to the child. Aronfreed 
(1955) and Altucher (1956) tried to re- 
late a child perception measure of these 
variables to sex identification in males, 
but their results are inconclusive. 

Strodtbeck (1958) has studied the 
effects of power in the family on the 
child. However, the independent vari- 
able is not, strictly speaking, husband- 
wife power but husband-wife-son 
power, and the theory Strodtbeck uses 
to link power to child attitudes is one 
that involves the son’s being in a re- 
ciprocal relationship to the high-pow- 
ered father; that is, when the father is 
high powered, the son feels controlled 
and generalizes this to the nature of the 
world, believing, for example, that man 
is controlled by destiny. 

Although there are few attempts to 
relate husband-wife power and division 
of functions to the child, there are many 
methods of measuring these variables. 
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Power is the more challenging to meas- 
ure. For one thing, since power is not 
specific to the home setting but capable 
of being revealed in discrete acts, it is 
not limited to home observation or the 
questionnaire and can be measured by 
means of an experimentally contrived 
situation. An example would be Strodt- 
beck’s revealed difference technique, 
which he has used in studying husband- 
wife power as well as husband-wife-son 
power. The husband and wife sepa- 
rately answer questionnaire items. The 
experimenter then presents them with 
one of the items about which there was 
disagreement and asks the couple to 
reconcile their differences. Their dis- 
cussion is tape-recorded while the ex- 
perimenter is absent from the room. 
The process is repeated for a given 
number of items, and a score is later 
computed for power based on which 
person won the other over to his view. 
A similar technique was used by Kenkel 
(1956). Married couples were asked to 
pretend that they had received a gift 
of 300 dollars and to go through the 
process of deciding how it would be 
spent. Power was measured in terms of 
the adopted items suggested by each 
spouse. Vidich (1956), without much 
success, tried a similar technique in 
which couples were asked to discuss 
several salient problems and to decide 
how they would spend an imaginary 
amount of money. 

Interviews have also been employed 
to determine husband-wife power. 
Wolfe (1959) used a measure based on 
wives’ responses to questions about 
whether they or their husbands decided 
each of eight important family matters, 
such as when to buy a car, where to 
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live, and how much money should be 
saved. Answers were placed on a five- 
point scale from “husband only” to 
“wife only,” and scores were added 
across the eight items. Heer (1958) 
asked husbands and wives jointly which 
parent won out in a case of disagree- 
ment. The Herbst Day-at-Home meas- 
ure (1952) consists of a series of ques- 
tions about relatively trivial household 
activities in which the child is asked to 
report who does the activity and who 
decides about it. It has been adapted by 
several researchers (e.g., Aronfreed, 
1955; Altucher, 1956; Hoffman, 1960) 
to measure husband-wife power. Hoff- 
man, for example, developed a scoring 
system whereby each pair of doing and 
deciding items was considered as a unit. 
The number of activities the mother 
decided but the father did was sub- 
tracted from the number of activities 
that the father decided but the mother 
did. 

It is interesting and reassuring to note 
that the different measures of husband- 
wife power have been found to relate 
in the same way to many other vari- 
ables. For example, father dominance 
has been associated with high socio- 
economic status, using the Strodtbeck 

- measure, the Wolfe measure, the Heer 
measure, and the Hoffman measure. The 
last three have also shown the same 
relationships between husband-wife 
power, on the one hand, and size of 
family and maternal employment, on 
the other. 

Strodtbeck and Kenkel also analyzed 
the interaction in terms of the Bales 
(1950) categories. Thus these measures 
can be used to obtain data about the 
division of certain goal-directed and 
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emotional functions between the hus- 
band and wife, e.g., which parent per- 
forms the integrative function of re- 
establishing cohesion after situations of 
stress. 

Measuring the division of household 
tasks between the mother and father can 
be done by a relatively direct question- 
naire, so long as the items are specific 
and are not threatening and the answers 
can be checked from a preconstructed 
list. Although there have been many 
variations of this technique, the proto- 
type has been put forth by Herbst 
(1952). The Herbst Day-at-Home 
measure consists of 33 questions about 
different household tasks (e.g., Who 
mows the lawn?). Each item is followed 
by a list of family members, and the 
respondent is asked to circle the per 
sons who always do it or whose ‘ob 
it is to do it, to underline those who 
do it sometimes or help, and to cross 
out all those who would never do it. 
(Each participation item has a counter- 
part that asks who makes the corre- 
sponding decision.) Herbst selected the 
particular items used by considering the 
distribution of the activities with re- 
spect to four regions—household duties, 
child care and control, social activities, 
and economic activities. He also con- 
sidered a time sequence for an ordinary 
work day. With these two considera- 
tions in mind, he attempted to get an 
“unbiased, random distribution” of ac- 
tivities, including, however, only those 
that could be engaged in by either par- 
ent. This measure was actually devel- 
oped for use with child respondents 
and, as such, might more properly be- 
long in the later section on child per- 
ceptions. We have included it here for 
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two reasons. First of all, with the pos- 
sible exception of the child-care items, 
the questions concern activities that are 
rather objective and free of emotion 
for the child. The child may therefore 
be thought of simply as the informant 
about what takes place in the house- 
hold. In fact, it is Herbst’s opinion that 
the child is a better informant about 
such things than the parent, for parents 
tend to give a picture of greater mutu- 
and cooperation than actually ex- 
ists. Second, we include it because the 
same measure has also been used with 
parents. 

Several variations have been added to 
the original measure, e.g., a differen- 
tiation between responses when the 
mother and father participate in the 
activity jointly and when they partici- 
pate separately and a check list that 
asks the respondent to report the fre- 
quency of each activity for each parent. 
The answers have been scored to give 
a measure of husband-wife interaction, 
sex-role reversal, amount of participa- 
tion, areas of participation, etc. Despite 
several criticisms (Yamamura and Zald, 
1956), this measure has received much 
validational support. For example, Hoff- 
man (1960) found a relationship be- 
tween a measure of sex-role reversal 
constructed from children’s responses 
and the parents’ responses to an attitude 
scale measuring sex-role ideology. 

The affectional relationship between 
mother and father has probably been 
considered in child-development studies 
more than the power relationship or the 
division of functions. Certainly the ana- 
lytic literature is filled w. ith case his- 
tories that illustrate the relationship be- 
tween marital tension and maladjust- 
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ment of the child, and a number of 
theories link these two variables. Mari- 
tal tension has been linked to child mal- 
adjustment somewhat directly in the 
theory that the child perceives the ten- 
sion and therefore feels insecure and 
less directly in theories that state marital 
tension leads to parental rejection be- 
cause (1) the child symbolizes the par- 
ents’ inability to dissolve the unhappy 
marriage, (2) the child resembles the 
disliked spouse, (3) the child becomes 
the scapegoat for uncommunicated areas 
of marital tension, and (4) the parents 
have a generally low morale. It has also 
been suggested that disappointment in 
her mate leads the mother to invest too 
much emotion in the child. Marital ad- 
justment might also be linked to the 
child by the intervening variables of 
insufficient or inconsistent parental au- 
thority. Finally, it is possible that mari- 
al tension and child maladjustment are 
spuriously associated, in that certain 
personality traits of the parent lead to 
a rejection of both spouse and child. 

There are, however, very few ade- 
quate empirical studies in this area. 
No one really has studied the affec- 
tional relationship between the mother 
and father as it affects the child and also 
operationalized the linkages between the 
two. Furthermore, the studies that have 
tried to relate marital adjustment to 
parental attitudes or to child variables 
have produced a pattern of findings 
difficult to integrate. 

Porter (1955) and Hawkes, Burch- 
inal, and Gardner (1956) have examined 
the relationship between marital adjust- 
ment and parental acceptance of the 
child as measured by the Porter Parental 
Acceptance Scale. Porter measured mar- 
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ital adjustment by a modified question- 
naire based on previous scales developed 
by Terman et al. (1938), Burgess and 
Cottrell (1939), and Locke (1951). 
Hawkes, Burchinal, and Gardner devel- 
oped their measure from the marriage- 
success indices of Burgess and Wallin 
(1953). Porter reports a relationship be- 
tween marital“adjustment and parental 
acceptance, using combined mother- 
father data. “Hawkes, Burchinal, and 
Gardner, doing separate analyses for 
mothers and fathers, found no such re- 
lationship for mothers and a low one 
for fathers. 

Turning to the studies that have ex- 
amined the relationship between marital 
adjustment and child personality di- 
rectly, we find at least two (Stroup, 
1956; and Burchinal, Hawkes, and 
Gardner, 1957a) that suggest that no 
such relationship exists. Stroup tried to 
relate mother’s marital adjustment, 
measured by the Kirkpatrick Interests 


Scale, to the child’s adjustment, meas- . 


ured by the California Personality 
Scale. Burchinal, Hawkes, and Gardner 
tried to relate marital adjustment of 
both parents, measured by items from 
scales developed by Burgess and Wallin, 
to the child’s adjustment, measured by 
the Rogers Test of Personality Adjust- 
ment. 

The Baruch studies (1937, 1944), on 
the other hand, present a different pic- 
ture. Using clinical-style interviews 
with both parents to obtain data about 
marital adjustment, Baruch tried to iso- 
late the areas of marital tension that 
were important for predicting to the 
child’s adjustment. She found tension 
over sex, lack of consideration, insuffi- 
cient expression of affection, the as- 


cendance-submission relationship, and 
the inability to talk things through ail 
related to maladjustment in the child, 
measured by behavioral observations 
in a nursery-school setting. The fir 
three tension areas she sees as similar 
and inter-related, and she refers to them 
as “security-giving factors.” Ascend- 
ance is seen as an “adequacy-giving fac 

tor.” The inability to talk things ove! 
is seen as a factor that intensifies the 
other two. She suggests that these fac- 
tors are the essential aspects of the 
marriage relationship. In the second 
study (1944) Baruch examines the ef- 
fects of marital tension on boys and 
girls separately and notes that marital 
tension has a greater influence on girls 
than on boys. This is like a replication 
of another finding of the study—that 
the mothers in her study were more 
influenced than the fathers by their 
own parents’ marital adjustment. Fur- 
thermore, it ties in with studies by 


Toby (1957) and Nye (1958) that deal , 


with the relationship between family 
disorganization and juvenile delin- 
quency. Both investigators note that 
this relationship is stronger among ado- 
lescent girls than among adolescent 
boys. Although their explanations differ 
somewhat, both Toby and Nye sug- 
gest that since adolescent boys are gen- 
erally unsupervised, as compared to 
girls, any factor that leads to a lack 
of supervision will disproportionately 
affect girls. In this sense, both investi- 
gators conceptualize marital discord in 
terms of less parental supervision, and 
explain the sex differences in sensi- 
tivity to marital discord accordingly. 
If the Baruch finding with preschool 
children is part of the same pattern as 
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that manifested in the juvenile-delin- 
quency findings, it may be that the lack 
of supervision is not the genotypic vari- 
able. In any case, it would be worth- 
while to study further the differential 


responses of boys and girls to marital 
tension, 

Another sex difference reported by 
Baruch is that the one area of tension 
that influenced the adjustment of boys 


more than girls is ascendance-submis- 
sion, This finding is especially inter- 
esting in line with our earlier discussion 
of mother-father power relations and 
identification. Is it possible that mother- 
dominance is a pattern more likely to 


lead to tension in the area of ascend- 
ance-submission? Is it possible that 
mother-dominance and cross-sex iden- 
tification is more dysfunctional to boys’ 
development than to girls’? Or is as- 
cendance-submission a more salient area 
in general for boys? It is an unfortunate 
indication of the discontinuity of fam- 
ily research that the Baruch study has 
not been followed up with attempts to 
determine why certain areas of tension 
were especially crucial, to distinguish 
further the areas in which the chil- 
dren’s maladjustments were manifested, 
and to understand the process by means 
of which these tensions affected the 
child. 

It is also significant, and should be 


/investigated further, that the Baruch 


study, using clinical-style interviews, 
found a relationship between marital 
tension and child adjustment, whereas 
the Stroup study and that by Burchinal, 
Hawkes, and Gardner, using marital ad- 
justment scales, did not. This differ- 
ence, in combination with the abun- 
dance of psychoanalytic case histories 


illustrating the linkage of parents’ mari- 
tal relations to children’s personality dis- 
orders, suggests that the clinical inter- 
view is better than the marital adjust- 
ment scales for detecting those aspects 
of marital tension most crucial to the 
child’s development. 

ve (1957, 1958), using adolescent 
reports of parental adjustment, also finds 
mother-father tension linked to the 
child’s maladjustment, specifically to 
psychosomatic illness, delinquency, and 
strained parent-child relations. In fact, 
both Nye and Landis (1955) have 
helped to pinpoint certain aspects of 
the marital tension situation by compar- 
ing child adjustment in unbroken but 
unhappy homes with child adjustment 
in broken homes and in happy un- 
broken homes. They find, in general, 
that the unhappy unbroken honies pro- 
duce the most disturbance in the child. 
These findings may mean that on-going 
and continual marital tension is dis- 
turbing to the child but that once the 
tension is over and the parents separate 
he can, to a certain extent, recover, 
even though he has lost one of his par- 
ents. On the other hand, there is an- 
other interpretation of these findings 
that has significance for the validity of 
using the standard marital adjustment 
scales for child development studies: 
these findings might mean that marital 
tension and the predivorce situation are 
not the same thing. That is, it is possible 
that there is a kind of marital tension 
disturbing to the child that does not 
culminate in divorce. Spiegel (1957) 
has suggested that when areas of ten- 
sion between parents are not communi- 
cated the child becomes a scapegoat. 
This situation is functional to the mar- 
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riage but dysfunctional to the child. 
If there is a form of marital tension 
that is dysfunctional to the child but 
does not lead to divorce, then the in- 
adequacy of the standard marital ad- 
justment scales in predicting to child 
adjustment is explained. Most of the 
existing marital adjustment scales have 
been developed as predictors of divorce 
and have often been standardized and 
validated by comparisons between mar- 
ried and divorced couples. They may 
not be sensitive to the detection of other 
kinds of marital tension that do not 
typically culminate in divorce but are 
highly disturbing to the child. This 
raises a methodological problem and a 
related theoretical one. The first in- 
volves a need to develop scales to de- 
tect unhappy marriages that do not 
end in divorce, and the second involves 
a need to identify more thoroughly 
the different areas of marital tension 
and their significance. 

Several of the standard marital ad- 
justment scales have already been men- 
tioned. Such scales include those devel- 
oped by Burgess and Cottrell (1939), 
Terman et al. (1938), Locke (1951), 
Popenoe (1937), Adams (1946), and 
Burgess and Wallin (1953). These 
scales typically include items such as 
“How often do you kiss your mate?” 
ang are presented with a list such as 

“every day, now and then, almost 
never.” Respondents check their an- 
swers, and these are weighted and 
scored. The Kirkpatirck Family Inter- 
ests Scale (1937) is considered more 
indirect. It consists of a list of activities 
on which the respondent is asked to 
check the ones he enjoys. On the oppo- 
site side of the sheet the respondent 


THE STUDY OF THE CHILD’S SOCIAL BEHAVIOR AND ENVIRONMENT 


checks the ones he enjoys with his 
spouse. The more enjoyed activities the 
respondent likes to do with his spouse, 
the better is his marital adjustment. 
Taves (1948) has compared the Kirk- 
patrick measure with the more direct 
Terman scale and found the Kirk- 
patrick superior in several respects. 

Other measures of marital adjust 
ment use adolescent reports. Nye (1957, 
1958) and Harter (1950) have used this 
method. Harter compared these scores 
to marital adjustment scores obtained 
from the parents. He found correlations 
between 0.48 and 0.58 and concludes 
that adolescents (high-school seniors) 
can give fairly accurate pictures of the 
degree of marital adjustment of their 
parents. 

Bossard and Boll (1955a) have used 
ratings made by siblings of the married 
couple as a measure of marital happi- 
ness. Strodtbeck (1952) has also inter- 
viewed persons about the marital ad- 
justment of friends. This may not be a 
very practical technique for general 
use, but it does offer possibilities for 
validating other measures. 


PERSONAL CHARACTERISTICS 
OF PARENTS 3 


In this section we consider studies 
that attempt to link the parent’s per- 
sonality or general social attitudes (1) 
directly to child variables, (2) to par- 
ent child-rearing practices or attitudes 
without going as far in the causal se- 
quence schema as child variables, (3) 
to both child-rearing orientations and 
child variables. 
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Linking Parent Characteristics Di- 
rectly fo Child Variables. The paper by 
Rosenzweig and Isham (1947) illustrates 
the attempt to jump the wide “sociali- 
zation process gap” between parent per- 
sonality and child personality. These 
writers present an analysis of the com- 
plementary personality trends in the 
TAT protocols of a mother and her 
16-year-old son. Most of the analysis 
is focused on understanding the current 
ersonal situation between the 
mother and son rather than exploring 
the mother’s personality structure as 
a source of trends in the son’s person- 


inter 


ality pattern, although they mention 
the latter as a desirable and feasible type 
of is. They feel the TAT is more 


suited to this kind of complementary 
personality analysis than the Rorschach 
(Buhler, 1943; Beck, 1945), “since it is 
more sensitive to such problems of 
adjustment as would involve the vari- 
ous members of the patient’s personal 
world.” Such analyses of “complemen- 
tary patterns” do not demonstrate a 
definite sequence of causality nor shed 
light on the process involved. This re- 
quires research designs that are still 
largely absent from the research liter- 
ature. 

Frenkel-Brunswik and Havel (1953) 
compared the ethnocentric attitudes of 
parents and children as determined in 
interviews. The correlations between 
parent and child ethnocentrism were all 
positive and ranged from 0.20 to 0.60, 
but the investigators note that such fig- 
ures give no insight whether ethnocen- 
tric attitudes are transmitted directly or 
by indirect processes. In their investi- 
gation of the same variables with a 
larger population Bird, Monachesi, and 
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Burdick (1952) found correlations in 
the high 0.20’s between the attitudes of 
parents and their third-, fourth-, and 
fifth-grade children. These investiga- 
tors, assuming a process of direct com- 
munication, believe the correlations are 
low because parents refrain from ex- 
pressing their own attitudes to their chil- 
dren because of guilt or ambivalence. 
The question of the processes interven- 
ing in the communication of prejudice 
was considered with more methodologi- 
cal care by Radke Yarrow, Trager, and 
Miller (1952), as we shall see in the next 
section. 

Several studies, for example, those by 
Gray and Klaus (1956) and Payne and 
Mussen (1956), have used the compari- 
son between parent characteristics and 
child characteristics as a measure of 
identification. Gray and Klaus com- 
pared three different measures of iden- 
tification: similarities between parent 
and child responses to the Allport-Ver- 
non-Lindzey Study of Values, similari- 
ties between the child’s actual responses 
and his perception of how each parent 
would respond, and the child’s warmth 
toward his parents as expressed in sen- 
tence-completion items. They showed 
that for each of these measures of iden- 
tification there was more like-sex 
parent identification. Furthermore, they 
showed that the three identification 
concepts are related but not identical. 
Payne and Mussen secured California 
Personality Inventories from adolescent 
boys and their parents. Identification 
with father was measured by subtract- 
ing the number of mother-son agree- 
ments from the number of father-son 
agreements. Story completions were 
used for obtaining the boys’ imagery of 
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parent-child relationships. They ex- 

amined the relationship between the 

warmth of perceived parent-child re- 
lations and identification and between 

parent personality and identification. In 

addition, they obtained teacher ratings 

of the boys on social-emotional adjust- 

ment and related these to identifica- 

tion. Unfortunately, the parent per- 

sonality and identification measures 

were not independent, since both were 

obtained from the responses to the 

California Personality Inventory. How-, 
ever, the idea of predicting from parent | 
personality to identification rather than 

directly to child personality or behavior 

offers a promising line for further re-' 
search. 

Goodstein and Dahlstrom (1956) il- 
lustrate a different methodological ap- 
proach. They selected a group of chil- 
dren with the same personality disturb- 
ances (stuttering) and a control group 
of normals and administered personality 
tests (MMPI) to the mothers. They 
found no significant differences in the 
personality patterns of the two sets of 
mothers. With this type of compara- 
tive design, it is difficult to discover the 
linkage between parent variables and 
child variables because of the complex 
global nature of the dependent child 
variable as well as the size of the jump 
in the socialization process. 

The direct linking of parent and 
child characteristics seems to be a less 
rewarding focus for research than start- 
ing with parent characteristics and 
moving through intervening variables 
toward child personality and behavior. 

Linking Parent Characteristics to 
Child-Rearing Variables. There is a 
variety of studies that do not attempt 
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to relate parent characteristics to child 
variables but make their contribution 
to an understanding of the socialization 
sequence by studying the connection 
between the parent's personality or val- 
ues and his child-rearing orientation or 
practice. 

In a very interesting exploratory 
study Sigel and Hoffman (1956) : j- 
ministered 13 TAT-type cards to par- 
ents. These cards depicted all possible 
combinations of parents and childre 
mother-son, father-daughter, family 
scenes of parents and children, cards 
of children alone, etc. Projective stories 
were elicited, using the standard instruc- 
tions. These parent protocols were rc- 
lated to extensive data on the influence- 
technique patterns used by parents with 
their children. The influence-technique 
data were secured by an interview 
that will be taken up later. In the 
pilot report the authors make predic- 
tions from the projective stories of three 
fathers to their interaction patterns with 
their preschool-age children. The analy- 
ses and predictions from the personality 
data were made by one judge, and a 
second judge independently analyzed 
the materials on the influence techniques 
used by the parent. The two sets of data 
were then compared. This pilot study 
indicates a high degree of predictive 
success. The investigators believe this is 
the result of using test stimuli corre- 
sponding in content to the area of real 
life under study, predicting from a co- 
herent personality picture of the re- 
spondent, using the entire test record, 
and conceptualizing in advance the be- 
havioral situation toward which the pre- 
dictions are made. 

Zemlick and Watson (1953) report 
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an interesting design for studying the 
mother’s “psychic acceptance-rejec- 
tion” as a determinant of attitudes and 
adjustment during and after pregnancy. 
TAT stories, sentence completions, and 
judges’ ratings were used to assess the 
degree of rejection of femininity, sexu- 
ality, marital status, and motherhood. A 
separate assessment was made of psy- 
chosomatic and physiological symptoms 
of anxiety and disturbance. These per- 
sonality variables were related to ratings 


of adjustment to labor and delivery 
and time-sampling observations of 
mother-behavior patterns and verbaliza- 


tions with the baby after birth. The 
personality measures of rejection and 
disturbances were positively related to 
problems of adjustment in the prenatal 
behavior patterns and attitudes of the 
mother but negatively related to the 
postnatal observations. This led the in- 
Vestigators to suggest that basic rejec- 
tive attitudes are often expressed in 
overprotective maternal behavior. 
There are several points in this study 
where one might wish for more ade- 
quate procedures, but the over-all de- 
sign is suggestive of the fruitfulness of 
studying adjacent phases of the proc- 
ess sequence, 

Several studies have used the authori- 
tarian-egalitarian dimension in seeking a 
connection between parent characteris- 
tics and child-rearing attitudes and prac- 
tices. Levinson and Huffman (1955) 
made a methodological contribution in 
déveloping a test of Traditional Family 
Ideology in which the respondents indi- 
cate three degrees of agreement and 
disagreement with a series of items 
about parent-child relationships, hus- 
band-wife relationships, male and fe- 
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male roles, and general aims of the fam- 
ily. The TFI scale was found to be 
highly related to the Ethnocentrism (E) 
and Authoritarianism (F) scales. It also 
differentiated between persons with dif- 
ferent religious preferences and prac- 
tices. Hart (1957) carried the analysis 
further with a population of mothers 
of preschool children. These mothers 
responded to the TFI scale and the 
F scale and also reported what they 
would do in response to 38 different 
child-behavior situations. The findings 
indicate a relationship between the 
mothers’ reports of how they would be- 
have toward their children and the 
authoritarian-egalitarian values meas- 
ured by the tests. The investigator re- 
lates this study to those that show simi- 
larities between the aythoritarianism of 
mothers and that of children, but notes 
that further data are needed to com- 
plete the causal chain. 

Kates and Diab (1955) explore the 
relationship between the F scale and 
Shoben’s Parent Attitude Survey, which 
scores parent attitudes of dominance, 
possessiveness, and ignoring. A signifi- 
cant relationship was found, but the 

attern for male and female respondents 
was markedly different. The findings 
fit in with the view expressed by Mc- 
Candless (1958) that the link between 
personality needs and child-rearing 
philosophies may be considerably higher 
for mothers than for fathers, for whom 
there is less psychological involvement 
in child-rearing practices. However, 
whether the fact that the relationship is 
stronger for females is due to differences 
in mother and father roles or a reflection 
of wider sex differences is an important 
question that requires further research. 
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Shapiro (1952) used Eysenck’s social 
and political attitude inventory to ob- 
tain scores on radicalism-conservatism 
and tendermindedness-toughminded- 
ness. These scores were found to relate 
significantly to those obtained on an 
inventory of 40 child-rearing items 
scored on the dimensions of restrictive- 
ness-permissiveness and objective-emo- 
tional. 

Brodbeck, Nogee, and DiMascio 
(1956) worked with a much clearer 
conceptual direction, starting with Reis- 
man’s ideas of other-directed and inner- 
directed character structure. They 
studied 30 mothers with young chil- 
dren in homogeneous lower middle- 
class socioeconomic family situations. 
The mother checked multiple-choice 
responses to 39 different mother-child 
interaction situations in three ways: first, 
indicating what she actually did in these 
situations, second, what she thought she 
should do, and third, what her neigh- 
bors usually did. The authors inter- 
pret their findings as indicating that 
the mother’s behavior patterns were 
more influenced by internal than exter- 
nal norms, although they recognize the 
methodological difficulty of using only 
mothers’ reports. 

Patterson (1943) found that the “neu- 
roticism” and “dominance” scores ob- 
tained by a group of mothers on the 
Bernreuter Personality Inventory re- 
lated to their scores on the Fels battery 
of Parent Behavior Ratings. The fact 
that high dominance scores seemed to 
go with positive and democratic parent- 
behavior patterns led the investigator to 
reconceptualize the dominance variable 
to stress the dimensions of confidence 
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and extraversion as being implicit in 
the measure. 

Parent Characteristics, Intervening 
Linkages, and Child Variables. A 
very interesting research sequence is 
represented by the study of McClelland, 
Rindlisbacher, and deCharms (1955) 
complemented by that of Winterbot- 
tom (1958). In the first study relation- 
ships were found between the parents’ 
value orientations, as inferred from 
group membership (Protestant, Jew h, 
Irish-Catholic, and Italian-Catholic), 
and their expectations regarding the 
child’s attaining independence, as meas- 
ured by a questionnaire developed by 
Winterbottom. The next step in the 
sequence is Winterbottom’s study in 
which middle-class mothers of 8~year- 
old boys reported their goals concern- 
ing 20 kinds of independence and mas- 
tery behavior and their expectations re- 
garding the age at which their children 
would learn the particular behavior. 
Winterbottom also secured parent re- 
ports of reactions to the child when he 
did and did not fulfill parent expecta- 
tions. Need achievement in the boys 
was coded from a set of stories told in 
response to verbal cue situations such 
as “A mother and her son. They look 
worried.” Significant relationships were 
found between the need-achievement 
scores of the boys and the child-rearing 
orientations of the mothers. In addition, 
need achievement in the boys was re- 
lated to teachers’ ratings regarding 
achievement. Furthermore, Winterbot- 
tom observed each of the boys in a 
standardized problem-solving behavior 
situation and found significant relation- 
ships between the need-achievement 
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scores and such performance variables 
as not requesting help, refusing offers 


of help, and turning down opportunities 
to stop for a rest. Although the samples 
are small and restricted in range, the 
two studies together present a signifi- 
cant model of theoretical linkage, sup- 
ported by careful measurement proce- 
dures, between parental values and 
practices and child personality and be- 
havio 


Str 
upon t 


dtbeck (1958) builds creatively 
research sequence, Instead of 
star from the fact of group affilia- 
tion 4 parental value index, Strodt- 
beck also utilized a set of value orienta- 
tion scales, which differentiated be- 
tween the Jewish and Italian-Catholic 
families in the study. A factor analysis 
of the value questionnaire indicated that 
differences in cultural and religious affil- 
iation were reflected in values concern- 
ing mastery or rejection of fate and 
independence or dependence of the 
family. Unfortunately, this study did 
not include data about child-rearing 
practices similar to the Winterbottom 
questionnaire, but a significant meth- 
odological contribution was made by 
setting up standardized behavior situa- 
tions in which the interaction between 
mother, father, and son could be ob- 
served in a series of decision-making 
situations. Strodtbeck moves from par- 
ental value orientations, to mother-fa- 
ther-child interaction, to values, person- 
ality, and behavior of the son. An im- 
portant finding is that the son’s 
value orientation is a function of 
parent-child interaction (mother-fa- 
ther-child power) and not a function 
of the parents’ ideology. As Strodtbeck 
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remarks, “a clear case of the children 
believing what the parents do and not 
what they say.” The sample is too small 
to permit a careful analysis of some of 
the relationships, but the findings and 
methodology suggest important vistas 
for future investigation. 

Some of the intervening child-rearing 
processes between parent prejudice and 
child prejudice have been explored by 
Radke Yarrow, Trager, and Miller 
(1952). One hundred mothers and their 
first- or second-grade children were 
studied. Parent prejudice and child-rear- 
ing practices were measured by a struc- 
tured interview schedule, which in- 
cluded the presentation of a number of 
interracial and interreligious problem 
situations for analysis. Parent prejudice 
was linked to such attitudes and prac- 
tices as restriction of the child’s play- 
mates and school associations, feelings of 
responsibility for the development of 
the child’s beliefs and attitudes, and dif- 
ferences in parental teaching about own 
group membership. The investigators 
find that parents’ attitudes toward a par- 
ticular racial or religious group are 
correlated with their children’s attitudes 
but that there are many individual cases 
in which large discrepancies exist. The 
authors feel that a “more intensive study 
of the attitudes, of the individual per- 
sonalities and of the relationships within 
the family” is needed to explain such 
discrepancies. The study does suggest 
that the transmission of attitudes and 
values in this area is to a high degree 
not direct or planned. 

Behrens (1954) has attempted to con- 
nect mother’s character structure to 
three child-rearing practices and then 
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to child adjustment. The children 
ranged in age from 2 to 6. The measure- 
ments were ratings derived from exten- 
sive intake interviews with mothers at 
a mental-health clinic. The character 
structure scales focused on such vari- 
ables as the mother’s self-image, self- 
mastery, range, quality, and control of 
affect, and a rigidity-flexibility dimen- 
sion. The same rater made ratings of 
whether or not parental practices in 
the areas of toilet training, feeding, and 
weaning were adapted to the child’s 
needs. These scales ranged from neg- 
lectful to permissive behavior patterns. 
A series of variables designating “ade- 
quacy of adjustment” were coded from 
child interviews. These included secur- 
ity with mother, reaction to separation 
from mother, self-assertion, response to 
discipline, quality of social relationship. 
The general personality or character 
scores of the mother did not relate 
closely to the reported child-rearing 
practices nor did the various child-rear- 
ing practices, to each other. Further- 
more, there was only a slight relation- 
ship between the child-rearing practices 
and child adjustment scores. There is, 
however, a high relationship between 
the general mother personality scores 
and the child adjustment scores. Behrens 
believes this indicates that the child is 
highly sensitive and responsive to un- 
conscious attitudes of the mother, 
which are not revealed in child-rearing 
practices. It is also possible that the par- 
ticular child-rearing practices selected 
for study were not the crucial ones or 
that measuring them by means of 
mother reports was not valid. Thus it is 
difficult to appraise whether in fact the 
mother’s personality is more important 
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than child-rearing practices and whether 
it is directly transmitted to the child. 
Two other studies, Stolz (1954) and 


Sears, Maccoby, and Levin (1957), use 
parent personality as assessed from 
parent interviews as an independent 
variable and link this to child variables 


through parent attitudes and practices. 
Stolz and collaborators scored their 
extensive father interviews in terms of 
the personality schema developed by 
Freedman, Leary, Ossorio, and Coffey 
(1951) to examine the relationship be- 


tween the father’s self-perception and 
his perception of his first-born child. In 
addition to the father personality ma- 


terials, both parents reported attitudes 
and behaviors in such child-rearing areas 
as eating, sleeping, elimination, and re- 
lations with siblings. Furthermore, the 
attitudes and behavior patterns of the 
children were assessed by a variety of 
techniques, including behavior observa- 
tion and projective situations described 
later in this chapter. The study consists 
of several discrete analyses, each fo- 
cused on the comparison between war 
separated and nonwar separated groups. 
Each level of analysis is related to the 
major independent variable rather than 
being related to one another to estab- 
lish a chain of causality. However, in 
the case study presentations, and in the 
theoretical sections, the chain of linkage 
is brought out, tying father personality 
(and wartime experience) to the child’s 
personality and behavior through the 
father’s attitude, behavior, and relation- 
ship with the child. 

Perhaps the most comprehensive 
study of parental patterns is that of 
Sears, Maccoby, and Levin (1957)- 
From an extensive interview schedule 


THE MEASUREMENT OF FAMILY LIFE VARIABLES 
with mothers, they coded data concern- 
ing ntal attitudes, practices, and 


child behavior. They derived several 
“underlying traits” by means of factor 
analysis. These include such variables as 
permissiveness and strictness, self-ac- 


ceptance and self-confidence, emotional 
warmth, and aggressiveness-punitive- 
ness. Since all the data are derived from 
mother interviews, the measures are not 
independent, and the study must be 
viewed as an exploratory one. Noninde- 
pendence of measures is of particular 
concern with respect to the relationship 


between parent personality and child- 
rearing practices because the personality 
dimensions have been derived from the 
data on parent practices. Nevertheless, 
future studies should be able to profit 
from the suggestions of these writers 


about personality dimensions of im- 
portance in the study of the socializa- 
tion process. They do link their basic 
dimensions of parent orientation to 
child effects, though the latter are de- 
rived from parental reports of child be- 
havior. In previous studies the linkage 
to independent measures of child be- 
havior in projective doll-play behavior 
situations has been made (Sears, Pintler, 
and Sears, 1946; Sears, 1951). 


CHILD-ORIENTED PARENTAL 
ATTITUDES 


Several extensive efforts have been 
made to develop scales of attitudes to- 
ward the parent role and toward child- 
rearing. Porter’s (1954) emphasis was 
on obtaining an attitudinal index of 
parent acceptance of the child. He felt 
that such an instrument should make it 
possible to bypass the more complex 
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and uneconomical behavior observa- 
tions and situational measurement pro- 
cedures. The measure consists of 20 
descriptions of child behavior. Respond- 
ents indicate their feelings and probable 
behavior in each situation by selecting 
from the alternative responses provided. 
Porter found that individual responses 
showed a high internal consistency and 
that scores differentiated among par- 
ents. Burchinal (1958a) extended the 
methodological work with this instru- 
ment and found meaningful patterns, 
i.e., that mothers appear more accepting 
than fathers, older mothers, more ac- 
cepting than younger mothers, and 
more educated fathers, more accepting 
than less educated fathers. But difficul- 
ties appeared in the use of the scale 
when Burchinal, Hawkes, and Gardner 
(1957b) attempted to use it to link 
parent attitudes to child adjustment 
variables. They found no significant 
connections between parents’ responses 
to the Porter measure and children’s re- 
sponses to the Roger’s Test of Person- 
ality Adjustment. In another more ex- 
tensive analysis Burchinal (1958b) re- 
ports using both the Parental Accept- 
ance Scale and Shoben’s Parent-Attitude 
Survey with a group of parents and the 
Roger's Test of Personality and the 
California Test of Personality with 
children. Almost none of the correla- 
tions between parent-attitude scores 
and child-personality scores were sig- 
nificant. The investigators conclude 
that perhaps attitudes as reported by 
parents and as perceived by their chil- 
dren are different phenomena. It is also 
possible that parent attitudes and prac- 
tices as reported by parents are sig- 
nificantly discrepant from actual par- 
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ent-behavior patterns, or that children’s 
personality trends as revealed by these 
self-report instruments may not be tap- 
ping the genotypic dimensions that are 
most related to parental behaviors. 
Shoben’s Parent-Attitude Survey 
(1949) taps a wider variety of attitudi- 
nal dimensions than the Porter scale. 
One hundred and forty-eight items 
were administered to 50 mothers of 
problem children and şo mothers of 
nonproblem children. The respondents 
checked a four-point scale from 
strongly disagree to strongly agree. 
Eighty-five of the items were found to 
differentiate between the two groups 
of parents. These items were further 
subdivided into three clusters—domi- 
nance, possessiveness, and ignoring. 
Gordon (1957) had mothers of pre- 
school children fill out the Shoben 
questionnaire before a twelve-day camp 
program in which both mothers and 
children participated. The camp staff 
then observed the mothers and rated 
them on dimensions comparable to 
those tapped on the questionnaire. No 
clear-cut relationships between atti- 
tudes and behavior were found. 
Trapp and Kausler (1958) had more 
success in their work with 16 pre- 
school children and their mothers and 
fathers. The parents answered the 
Parent-Attitude Survey. The children 
were observed at school for ten activity 
samples of five minutes each. Predic- 
tions were made from the parent-domi- 
nance scores and parent-difference 
scores to the child-behavior variable of 
“avoidance of adult contacts.” The pre- 
dictions were successful at a modest 
level of significance, and the combined 
parent scores were a more powerful 
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predictor for children of both sexes 
than individual parent scores. 

The study by Gordon and the one 
by Trapp and Kausler, considered in 
combination, tend to support Behrens’ 
notion that the link between parent at- 
titudes and the child is not through 


parent behavior. However, it is possible 
that the crucial parent-behavior areas 
have not been examined. Thus it may 
be that attitudes do affect parent be- 


havior but that Gordon, Behrens, and 
others have not selected the appropriate 
behaviors for study. 

Duvall (1946) has developed a simple 
method of obtaining conceptions of 
parenthood by getting open-ended re- 
sponses to two simple questions, “What 
are five good things a child does?” and 
“What are five good things a mother 
does?” Responses are scored in terms of 
a traditional versus a developmental ori- 
entation to the parental role. Elder 
(1949) reports a study of 32 fathers, 
using this technique in addition to more 
extensive interviewing. He found that 
developmentally oriented fathers re- 
ported more interaction with their 
children both at home and away from 
home and that traditionally oriented 
fathers disciplined for a larger number 
of reasons and used a smaller variety of 
punishments. The methods used, how- 
ever, do not permit any clear conclu- 
sions to be made, since the fathers re- 
port both the attitude and the practice 
data. 

Schaefer and Bell (1958) have re- 
viewed the methodological problems 
and conceptual ambiguities of previous 
parent-attitude scales and attempted to 
profit from these errors in developing 
their Parental-Attitude Research Instru- 
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ment (PARI). A variety of concepts 
was ected from previous studies; 
and attitude scales of five to ten items 
were developed, tested for reliability, 
and used for comparative measurement 
of populations of mothers of differing 
education. The final scale consists of 
23 subscales of five items each. A factor 
analysis is also reported (Schaefer and 
957); this yields five relatively 
independent factors, which overlap in 
part with the factor analysis of the Fels 
Parent Behavior Scales (Roff, 1949). 

An interesting methodological issue is 
revealed by the contrasting findings of 
Helper (1958) and Jourard and Remy 
(1955). Helper obtained parent evalu- 
ations of junior-high-school children on 
dimensions of favorability and accept- 
ance. The children made self-evalua- 
tions on comparable scales. The rela- 
tions between the parents’ evaluations 
of the children and the children’s self- 
evaluations were quite low in contrast 
to the findings of Jourard and Remy. 
In Jourard and Remy’s study the child 
reported both the self and the parent 
evaluation, and this may account for 
the high relationships found. As Helper 
points out, however, the children’s re- 
ports may actually be more accurate 
measures, That is, the parent’s behavior 
may reveal attitudes that the children 
perceived and report but which the 
parents do not acknowledge. 

Cass (19524) reports a study which 
attempts to link parent attitudes to 
parent control techniques, parent-child 
conflict, and the child’s identification 
with the parent. Mothers and their teen- 
age children answered similar attitude 
questionnaires, and the mother also an- 
swered a second questionnaire about 
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her child’s attitudes. One of the parent 
variables, mother’s “awareness” of the 
child, was taken as the extent to which 
her estimates of the child’s responses to 
the questionnaire agreed with the child’s 
actual responses. Identification was 
measured in terms of the degree of 
agreement between the mother’s own 
responses and the child’s responses. 
Parent control scores were computed 
from questionnaires in which the chil- 
dren rated parental practices in a num- 
ber of areas. An incomplete sentence 
test was used to measure parent-child 
conflict. The predictions were that 
there would be a positive relationship 
between the mother’s awareness of the 
child’s attitudes and the child’s identi- 
fication with the mother, that the rela- 
tionship between awareness and identifi- 
cation would be higher for mothers and 
girls than for mothers and boys, and 
that low awareness and high control by 
the mothers would be associated with 
high parent-child conflict. On the 
whole, the predictions were supported 
by the data. 

Some of the problems in making the 
jump from parent attitudes to child 
personality are illustrated in a study 
reported by Harris, Gough, and Martin 
(1950) in which data were obtained on 
child-rearing opinions and practices 
from mothers of children who scored 
high and low on an ethnic prejudice 
test. Parent responses were subdivided 
into five a priori scales: authoritarian- 
ism, permissiveness, parent-child affec- 
tive closeness, parental rigidity, and 
“general good judgment.” Very few 
relationships were found between these 

arental indices and ethnic prejudice 
in the children. 
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An interesting attempt to measure 
parents’ Aas GaGa: expectations for 
children is the study by Little and 
Cohen (1951). Thirty asthmatic and 30 
Ran children, matched for age, 
intelligence and school placement, were 
present with their mothers in a target- 
shooting task. Before each perform- 
ance, the mother wrote down her ex- 
pectations of the child’s performance; 
then the child stated his goal. The 
mothers of the asthmatic children had 
higher achievement expectations, as did 
their children. This method of obtain- 
ing parent and child expectations in a 
specific and immediate laboratory situa- 
tion offers many advantages over the 
usual techniques of obtaining reports 
about general values. Distortions should 
be minimized both by the specificity 
of the verbal report and by the im- 
minence of the child’s actual perform- 
ance. 


OVERT PARENTAL BEHAVIOR 
PATTERNS 


Increasing attention is being paid to 
the importance and complexity of 
measuring parent behavior and parent- 
child interaction. In part, this growing 
interest probably stems from the rather 

‘low level of success in making direct 
linkages between parent personality, 
values, and attitudes and child personal- 
ity and adjustment; and, in part, it 
probably reflects the theoretical and 
methodological interest in moving from 
correlational studies to more process- 
oriented studies of socialization. Direct 
observation of parent-child interaction 
is still rare in the research literature, as 
compared to the number of studies in 
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which categories of parental behavior 
are derived from parent reports or from 
child reports. In this section we will 
take up studies that use parent reports 
of behavior and those that use direct ob- 
servations. Studies using children’s re- 
ports are taken up in the following 
section. 

Parent Reports on Parent Behavior. 
One of the fullest questionnaires on 
parent behavior is that used by Radke 
(1946) in her attempt to link parent 
authority to the behavior and attitudes 
of young children. Each parent re- 
sponded to a questionnaire of more than 
100 items relating to parent practices , 
regarding authority and discipline. 
Some of the items included parent per- 
ceptions of the child’s responses. An 
oral questionnaire was used with the 
children, in which many of the items 
paralleled those on the parent question- 
naire. In addition, the children were 
given two projective tests—one in 
which responses were made to pictures \ 
of pleasant and unpleasant parent-rela- 
tionship situations and the other, a doll 
play procedure. There were also three 
experimental situations in which adults 
used different degrees of authority to 
get the child to comply with orders. In- 
dependent behavioral observations of 
the children were also obtained from 
teachers. Several trends appear, such as a 
relationship between restrictive and au- 
tocratic home discipline and the child’s. 
being passive and unpopular. However, 
the various steps of potential linkage— 
between parent-behavior report, parent 
perceptions of children, children’s per- 
ceptions of parents, child projective-test 
performance, and child-behavior pat- 
terns—are explored without any cen- 
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tral hypotheses about the processes in- 
volve 

Watson (1957) constructed a “How 
I am bringing up my child” question- 
naire with which he obtained multiple- 


choice responses from parents about 
their ways of handling 35 common 
parent child interaction situations. The 


responses were scored along a con- 
1 from highly permissive to en- 
g strict obedience. Fathers 
showed consistently higher strictness, 


but there was also a great deal of agree- 
ment between husbands and wives. 
Using only the children at the extreme 
ends of the strictness-permissiveness 


scale, the parent practice scores were 
related to personality and behavior di- 
mensions of the children, as obtained 
from projective tests, observations of a 
free-play period, questionnaires on 
child perceptions of home discipline, 
and school behavior ratings by teachers. 
Six of the investigator’s nine hypotheses 
relating parent behavior to child atti- 
tudes and behavior were confirmed. No 
doubt part of this success was achieved 
because extreme groups were com- 
pared, but credit should also be given 
to the care with which comprehensive 
reports on a single carefully chosen di- 
mension were obtained from the par- 
ents. 

Since the pioneering study of Levy 
(1928), a number of investigators have 
interviewed parents for reports on feed- 
ing, weaning, and early training prac- 
tices and attempted to link these data 
to child behavior and ~ personality. 
Thurstone and Mussen (1951) faced 
the double methodological problem of 
mail questionnaires and retrospective 
reporting in their study of child-rearing 
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practices used by mothers of college-age 
students. The attempt was made to link 
mother reports on feeding and weaning 
to measures of oral personality traits 
derived from TAT responses, but the 
findings are inconclusive. Maslow and 
Szilagyi-Kessler (1946) also used retro- 
spective reports in seeking information 
about breast-feeding practices. Al- 
though they used interviews rather than 
self-administered questionnaires, their 
interviewers were college students, and 
it may be that for data of this sort 
questionnaires would have been prefer- 
able to using such youthful interview- 
ers. 

The studies of Wittenborn (1956), 
Miller and Swanson (1958), Sears, Mac- 
coby, and Levin (1957) and Sewell, 
Mussen, and Harris (1955) are all illus- 
trations of the use of carefully de- 
veloped interview schedules to secure 
data on early training practices from 
parental reports. The quantitative rating 
scales used by Sears, Maccoby, and 
Levin and the intercorrelation cluster 
technique used by Wittenborn are good 
examples of the sensitive quantification 
of parental-behavior reports. The Sears, 
Maccoby, and Levin study is an impres- 
sive effort to quantify theory- derived 
dimensions of parent child- -rearing prac- 
tices and interpret the meaning of these 
dimensions for the process of socializa- 
tion. However, they face the difficult 
methodological problem of not having 
an independent source of data concern- 
ing the child response variables. Sears 
et al. (1953), in an earlier study, related 
mothers’ reports about child- rearing 

ractices to independent measures of 
the child’s behavior in classrooms and in 
standardized doll-play situations. Wit- 
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tenborn, working with parents and older 
children (5-to-9-year-olds), used a 
promising multiple choice “Social Re- 
action Interview” with the children. 
The instructions were, “Let’s play a 
game of pretending. I'll tell you what 
happens and you tell me all the things 
you would do.” Then followed a series 
of situations to assess the dimensions of 
dependency, aggression, submission, 
sympathy, perceived strictness of train- 
ing, fears, likes, and dislikes. Because all 
of Wittenborn’s subjects were adopted 
children, he had the unusual opportun- 
ity to explore “heredity-free” relation- 
ships, although there are problems of 
selective placement. His findings are 
consistent with the majority of other 
investigators in finding moderate and 
scattered relations between child-rear- 
ing practices and independent measures 
of child variables. 

One fact of great methodological, as 
well as theoretical, importance seems to 
be documented by the studies of Mil- 
ton (1958), Sewell, Mussen, and Harris 
(1955) and Miller and Swanson (1958): 
intercorrelations among child-rearing 
practices are low. It seems probable that 
more genotypic psychological variables, 
which capture the more general and 
underlying meanings of child-rearing 
practices, need to be conceptualized and 
measured to make fruitful progress in 
linking early child-rearing experiences 
to later child personality. 

Among the studies that start from be- 
havior reports by parents and probe 
through shorter “jumps” in the sociali- 
zation process sequence is a study by 
Dreyer (1959). Dreyer scored mother 
interviews for degree of “power as- 
sertion” and other parent behaviors used 
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in handling the child’s “dependency be- 
havior.” These parent-behavior meas- 
ures were linked to the child’s percep- 
tion of authority as obtained from a 
picture-story test, level of aspiration as 
shown on a task devised by Sears and 
Levin, and behavior ratings made by 
nursery-school teachers. Significant re- 
lations were obtained between parental 
power assertion and the child’s percep- 
tion of adult authority figures as con- 
trolling rather than nurturant and as not 
being a source of help and support. 
Adult-oriented perceptions of the child 
were in turn related to the child’s per- 
ception of himself as having difficulty 
in handling problem situations. These 
self perceptions of the child were found 
to relate significantly to low levels of 
aspiration in the behavior situation, Fi- 
nally, low level of aspiration related to 
teacher ratings of low level of aspira- 
tion, fearfulness, lack of flexibility, and 
lack of popularity with other children. 

An interesting attempt to obtain ob- 
jective data from parent interviews is 
the method, first suggested by Nowlis 
(1952), of using detailed accounts of all 
parent-child interactions occurring in 
a single day. This technique has been 
used in a study of parent-influence tech- 
niques reported by Sigel et al. (1957), 
and by Hoffman (1957, 1960). The 
main part of the interview consists of 
the parent’s describing in great detail 
everything that took place between him 
and the child throughout the day pre- 
ceding the interview. For mothers, a 
randomly selected weekday was used; 
for fathers, both a weekday and a Sun- 
day. The interviewer probed to fill in 
the continuity of all action sequences 
involving the parent and child. This 


THE MEASUREMENT OF FAMILY LIFE VARIABLES 979 


kind of interview seemed to make con- 
siderable progress in coping with the 
three problems of forgetting, withhold- 
ing of information, and unconsciously 
motivated distortions. The interviews 
were taped, transcribed, and coded for 
specific behavior categories. In addition, 
child observations were made in the 
nursery school. Great care was taken to 
he child data collected and coded 
lividuals who had no contact with 
the parent material. In this way it was 
possible to confirm, in a meaningful 
way, hypotheses relating the reported 
behavior of the parents to the observed 
behavior of the children. 

Another contribution to the study 
of parent-behavior patterns is the analy- 
sis by Brim (1954) of the process of 
innovation of a new parent-behavior 
pattern in child rearing. During a first 
interview with mothers who were hav- 
ing feeding problems with their chil- 
dren, data were collected on the meth- 
ods they used. These were coded into 
aggressive, intermediate, and persuasive 
modes of feeding behavior. A common 
stimulus situation was introduced, 
which involved advice by an expert, 
to induce all mothers toward a more 
permissive behavior pattern during the 
eating period. Some mothers tried the 
new behavior and maintained it long 
enough so that the behavior of the child 
changed; other mothers tried the new 
behavior but gave up; and others never 
tried the new behavior. The differences 
in change of behavior pattern were suc- 
cessfully linked to perceived support by 
the husband, concern about the child’s 
response, motivation for the current be- 
havior pattern, influences from other 
sources, and prestige of the expert who 


had advised the behavior change, In 
this study some of the basic interper- 
sonal and motivational aspects of par- 
ent-behavior patterns are brought into 
clear relief, and the interdependence of 
parent behavior and child response is 
highlighted in a situation in which cau- 
sation can be studied without ambiguity 
about what is antecedent and what is 
consequent. 

In eliciting information from parents 
about emotionally laden behavior situa- 
tions, several investigators have ex- 
plored the possibility of using less direct 
projective procedures. But there are 
serious difficulties for the investigator 
in knowing whether the respondent is 
using the projective situation to report 
actual behavior patterns or to express 
attitudes that underlie behavior patterns. 
Morgan and Gaier (1957) used a series 
of ten picture cartoons of situations 
leading to punishment of the child by 
his parents. For each picture, the re- 
spondent was asked whether or not this 
ever happened in his own experience, 
how the parent and child behaved when 
it did happen, and how they felt about 
it. Jackson (1956) used a series of 11 
hypothetical situations involving par- 
ents with children (e.g., “glancing into 
your bedroom, you notice your 12- 
year-old daughter taking money from 
your purse”). The parents were asked 
to indicate what they would do in each 
situation, and the responses were coded 
along a dimension from “little pressure 
toward conformity” to “great pressure 
toward conformity.” Differences were 
noted between the responses of the 
fathers and mothers, but no attempt 
was made to link parental behavior de- 
rived in this fashion to child variables, 
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Direct Observation of Parent Be- 
havior. Probably the most extensive 
work on the assessment of parent be- 
havior has been done in the develop- 
ment and use of the Fels Parent Be- 
havior Rating Scales. This battery of 
rating scales was developed by Champ- 
ney (1941) and modified for research 
purposes by Baldwin and his colleagues 
(1945, 1948, 1949). The procedure is a 
mixture of direct observation and in- 
ferences made by a home visitor in a 
series of visits. The observations of the 
mother-child interactions are informal 
rather than standardized, and the rat- 
ings of behavior dimensions are based 
partly on supplementary material ob- 
tained verbally from the mother. On 
the basis of the intercorrelation matrices 
that were obtained and from their theo- 
retical interests, the investigators se- 
lected several parent-behavior syn- 
dromes, e.g., democracy in the home, 
acceptance of the child, indulgence-pro- 
tectiveness, control, and activity level. 
These were linked with varying ‘degrees 
of success to child behavior, as rated in 
a nursery-school situation. Baldwin 
(1948) makes an important contribution 
by showing that a difference in the total 
pattern of child behavior results from 
a difference in one of the major dimen- 
sions of parent behavior. A statistical 
derivation of the underlying parental- 
behavior dimensions in the Fels scales, 
using factor analytic procedures, has 
been reported by Roff (1949). A repli- 
cation of the design used by Baldwin 
and his associates was done in Australia 
by Pentony (1956), who reports that 
similar relations between parent be- 
havior and child behavior were found. 
This kind of research helps map out 
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some of the dimensions needed for fur- 
ther investigation, but it is limited be 
cause of the lack of theory about inter- 
vening processes. 

The studies of Merrill Bishop (1046, 
1951) and Moustakas, Sigel, and Scha 
lock (1956) are good illustrations of the 
few systematic efforts to make dire 
observation studies of parent-child in- 
teraction. Bishop observed mothers in 
teracting with their preschool children 
in a standardized activity setting Ovt- 
side of the home. Her observation cate- 
gories were developed around the 
theory that mother-behavior influences 
child-learning through the offering of 
rewards, incentives, and punishme A 
through the facilitation or inhibition of 
certain activities, and through direct 
and indirect influence attempts. A series 
of direct links were made between 
mother-behavior patterns and child- 
initiation and response behavior. The 
high degree of individuality of the 
interactive behavior patterns of the vari- 
ous parent-child pairs suggests that 
there may be considerable unconscious 
collusion between parent and child in 
developing reciprocal response patterns. 

Moustakas, Sigel, and Schalock use 
an observation system with eight pre- 
coded dimensions of parent-child in- 
teraction, which were tested in the 
nursery-school playroom and in the 
home. There are at least two problems 
in the use of such systematic quantita- 
tive procedures of which these writers 
show full awareness. One is that the 
schedule leaves the investigator with- 
out any analysis of the content of com- 
munication between parent and child. 
The other is the problem of utilizing the 
postural cues and facial expressions that 
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have become part of a communication 
pattern between parent and child. 
Crandall et al. (1958) report a study 
linking parent-child interaction to the 
behavior of the child with peers and 
adults in other situations. For children 
of two different age levels, they related 
ratings of the mothers’ rewarding of 


compliance and punishing of noncom- 


» in their children to observations 


of social interactions with adults and 
peers in a nursery-school and a day- 
camp situation. Furthermore, they 
showed different connections between 


mother behavior and child behavior at 
the different age levels. With such data, 
it is possible to begin to think with 
some degree of confidence about the 
meaning and impact of the socialization 
sequence through time in the develop- 
ment of child-behavior patterns. A great 
deal could be added to this kind of de- 
sign by securing data from the children 
concerning their perceptions of and 
their feelings about parental expecta- 
tions and behaviors. 

Although most of the work on syste- 
matic observation of parent-child inter- 
action has been with the preschool ages, 
the reports of Brody (1956) and Esca- 
lona and her collaborators (1953) illus- 
trate the growing sophistication about 
securing psychologically meaningful 
“parent-practice” data by observation 
of the interaction between mothers and 
infants. Also Strodtbeck’s “revealed dif- 
ference” observation procedure (1954, 
1958) is an important step toward im- 
proving direct observations of parent- 
child interaction with older children. 

Brody used behavioral observations 
and was able to classify mothering be- 
havior toward infants into four patterns 
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based on the dimensions of sensitivity, 
consistency, and attentiveness. As she 
explores the degree to which the 
mother’s behavior pattern and the in- 
fant’s behavior pattern complement one 
another, Brody faces thoughtfully the 
difficulties of interpreting whether the 
mother’s behavior pattern is a stable 
source of influence to which the infant 
is adapting or whether infant patterns 
of need and activity stimulate the de- 
velopment of adaptive mothering pat- 
terns. This is a perennial issue that sug- 
gests the importance of starting with 
mother-behavior patterns toward previ- 
ous children, independent personality 
assessment, and a clear conceptualiza- 
tion of predictions to the pattern of 
interaction with the new infant. 

Strodtbeck has discovered that fami- 
lies accept, indeed even enjoy, the 
standardized discussion situation that he 
“imports” into the home for quantita- 
tive study of family decision making. 
This observation procedure, discussed 
elsewhere in this chapter, yields scores 
for interaction of parents with each 
other and for each of them with their 
child. Methodological work is needed 
on the extent to which the data secured 
in these limited interaction situations is 
representative of a wider range of fam- 
ily interactions. 

Although parent behavior is not the 
focus of their work, Barker and 
Wright (1949, 1951, 1954) have made 
a major contribution to the direct ob- 
servation of family life in their studies 
of the child’s psychological ecology. 
Their continuous observation of chil- 
dren through the day is evidence of the 
readiness of families to collaborate in 
direct observation studies. These data 


982 


should provide a point of reference for 
future studies that will attempt to use 
more focused observation schedules and 
sampling procedures. 


CHILD’S ORIENTATION 
TOWARD PARENTS 
AND SIBLINGS 


This is perhaps the broadest level in 
the causal sequence schema, since it in- 
cludes the child’s perception of the fam- 
ily as well as his evaluative reactions. It 
is also a kind of bridge between the fam- 
ily as a stimulus situation for the child 
and the child as a product of the family. 
The only child-response variables that 
we will focus on are those where the re- 
sponse is the child’s perceptions of the 
family, particularly of his parents’ be- 
haviors and attitudes toward him. These 
perceptions might be conceived of as 
objective or subjective material. In cases 
in which the child reports the number 
of children in the family the response 
can usually be taken as objective fact, 
but when the child reports about the 
discipline patterns in his family the ma- 
terial is more likely to be subjective. 
However, even in such areas child re- 
ports may often provide valid objective 
data. We have already pointed out the 
possible advantages of child reports in 
our comparison of the Helper and the 
Jourard and Remy studies, and we take 
this point up again in a later section. 
The child’s perception of the family, 
in any case, is an important level in the 
process sequence, and we have already 
mentioned several studies in which the 
research used, or might well have used, 
such material as the intervening variable 
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between the family situation and the 
child’s behavior. 

A number of studies employ the 
child’s perception of parent behavior as 
the independent variable. Sometimes it 
is an older child or an adult trying to 
recall his childhood. In this case, thc 
problems of retrospective reporting 
mentioned earlier apply, although ther 
is also the possible advantage of the 
decreased involvement, in that history 
is being described rather than current 
happenings. 

Probably the earliest research of this 
kind is reported by MacKinnon (1938) 
in a study comparing two colle 
student groups of violators and nonvio- 
lators of prohibitions in a test situation. 
Questionnaire reports were obtained 
from each subject concerning early 
parent-child relations, including the 
types of punishment experienced and 
the roles of fathers and mothers as disci- 
plinarians. These responses were related 
to behavior in the experimental situa- 
tion. The studies by Henry (1956, 
1957), already mentioned, deal with 
similar variables and also use the re- 
ports of young adults. 

Kohn and Clausen (1956) 
viewed schizophrenic adults and a 
matched group of normals to obtain 
data on the patterns of parental author- 
ity and affection experienced in child- 
hood. These investigators were able to 
check their data in some cases by inter- 
viewing siblings and in other cases by 
interviewing the subjects’ parents. The 
resulting corroboration, the consistency 
of the findings, and their agreement 
with previous research suggest that re- 
trospective reports about parent be- 
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ior may have objective as well as 
stive validity. 

gan (1956) obtained perceptions 
of which parent was more “friendly,” 
itive, and dominant. He found 
that both boys and girls perceived 
their mothers as friendlier and their 
fati as more punitive and dominant 


but the older children were more 
likely than the younger to view the 
same-sex parent as more punitive and 
dominant. In discussing whether these 
reports are fact or fantasy, Kagan points 
out, on the one hand, that the chil- 
dren’s comments seemed to indicate that 
they were reporting what they felt 


should be the case rather than what 
actually was. For example, when stating 
that the father was dominant, they 
might add “because he has to work for 
a living.” On the other hand, Kagan 
points out that the perception of the 
same-sex parent as more threatening is 
consistent with other data based on 
interviews with mothers. 

Other studies using children’s per- 
ceptions of parents’ current behavior 
include those by Radke (1946) and 
Brown, Morrison, and Couch (1947). 
Radke used an extensive 53-item inter- 
view with preschool children and com- 
pared these responses to a question- 
naire administered to mothers. The 
results indicated that “In those ques- 
tions where desirability or adherence 
to standards entered in, almost with- 
out exception the parent represented 
himself in the direction of desirability, 
and the child represented the parents 
in a less favorable or ‘harder’ light.” 
In the same study Radke discovered 
that the parents perceived their own 
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parents as stricter than they themselves 
were. í 

Brown, Morrison and Couch de- 
veloped an extensive Family Relations 
Questionnaire for use with children as 
young as 10. Some of the major parent- 
behavior patterns reported were partici- 
pation in work and play, approval and ` 
disapproval, role in decision making, 
trust of the child, interparental relations, 
and symptoms of tension. Children re- 
sponded to questions on a five-point 
scale from “very often” to “almost 
never.” These reports on parental be- 
havior were linked to personality-test 
data on the children and ratings by 
adult leaders and peers. The investiga- 
tors made an effort to check the validity 
of the children’s reports by comparing 
them with independent ratings of some 
of the families. They conclude that the 
children were reporting with a high 
degree of objective validity. 

Hawkes, Burchinal, and Gardner 
(1957) obtained descriptions of parents 
from a sample of preadolescent children 
with a series of questions, such as “Do 
your parents praise you for doing a 
job well?”, with multiple choices from 
“always” to “never” to be checked. 

Lippitt and Hoffman (1959) have de- 
veloped two reporting procedures for 
use with elementary-school children. 
One is a “Who does it?” test, in which 
a series of behaviors (e.g., “praises,” 
“blames,” “scolds,” “laughs”) are pre- 
sented, and the child checks a multiple- 
choice form that includes mother, fa- 
ther, teacher, friend, brother, and sister. 
Responses to certain clusters of words 
are combined to yield scores, such as 
the extent to which the mother is associ- 
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ated with positive affect. A second 


questionnaire procedure entitled, “Some 
Kids’ Parents” presents paired descrip- 
p P 


tions of parental expectations on a num- 


ber of conceptual dimensions. The child 
checks the description that is most like 
his parents. The data from these two 
types of parent-behavior reports have 
been linked to independent observations 
of behavior with peers and teachers at 
school. 

Gray (1959) uses Osgood’s format to 
present young children with a series of 
paired adjectives representing either 
end of a seven-point scale (e.g., from 
“tired” to “full of pep”). The child 
checks a description of himself and of 
his parent. Perceived similarity to par- 
ents is used as a measure of parental 
identification and linked to measures of 
adjustment based on peer ratings, a 
manifest anxiety measure, and a mascu- 
linity-femininity scale. 

Lyle and Levitt (1955) use an 
indirect procedure to obtain reports 
from young children on parental puni- 
tiveness. This involves a series of incom- 
plete sentences such as, “To make me 
do what they want, my parents 
-” Attributed parental 
punitiveness, measured in this way, was 
successfully linked to attitude measures 
of ethnocentricism-authoritarianism and 
to projective measures of the child’s 
punitiveness. 

Using an even freer projective pro- 
cedure, Kagan (1958) obtained stories 
about stimulus pictures from boys in 
the 6-to-10-year-old range. These were 
scored for hostility of parent figures 
and dependency of child figures. The 
perceptions of parents measured in this 
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way were linked to aggressive and no 
aggressive behavior patterns of the 
children, as rated by teachers. 
Alexander (1952) has standardized a 
series of parent-child pictures, t 
Adult-Child Interaction Test, for ws 
as a TAT-type test with children to 
obtain data concerning perceptions of 
parents and parent-child relations. Sears 
(1948) and Stolz et al. (1954) have used 
projective procedures to secure reports 
of parent behavior from very young 
children, Sears used observation of a 
doll play situation. Stolz and her c l- 
laborators presented stories for comple- 
tion and also dramatic play situations 
with dolls calling for completion of a 
series of eleven structured situations 
Ausubel et al. (1954) measured chil- 
dren’s perceptions of their parents’ eval- 
uations of them along the dimensions 
of acceptance-rejection and intrinsic-ex- 
trinsic valuation. A projective test was 
constructed from several existing ones, 
such as the Children’s Apperception 
Test and the Blacky Test. Alternative 
interpretations of the pictures dealing 
with the dimensions under investigation 
were presented to the child. Story-com- 
pletion items were similarly adminis- 
tered, giving the child a choice between 
two alternative endings. In addition, a 
Parent Attitude Rating Scale, consisting 
of parent-attitude and behavior items 
reflective of acceptance-rejection and 
intrinsic-extrinsic evaluation, was an- 
swered by the children. Of the three 
measures, only the last was considered 
sufficiently reliable for use in the in- 
vestigation. Using this measure, the 
child’s perception of the parents as 
valuing him extrinsically was related to 
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of omnipotence, goal-frustra- 


his sense 
tion tolerance, ideational dependence, 
and emotional immaturity. 

Lippict et al. (1959) have used a 
procedure in which children from the 
third grade through high school filled 
in the segments of circles with plus or 
minu ns to represent the distribution 
of positive or negative characteristics 
in themselves as they feel each parent 
sees them and as they see themselves. 
This measure of perceived parental eval- 
uation has been linked to independent 
measures of parent behavior and to 
child attitudes and behavior outside the 
family 

In addition to the studies of the 
child’s perception of the family, we 
might have taken up studies of child 
evaluations such as that by Cooper and 
Blair (1959), which relates college stu- 


dents’ responses on a parent-evaluation 
scale to the perceived similarity of 
parent and child values, and that by 
Mussen (1950), which relates negative 
feeling toward parents as assessed by 
a TAT to prejudice and resistance to 
change in prejudice. We might have 
gone into some of the family reference 
group research, such as the work of 
Rosen (1955). However, although these 
are important family studies, the con- 
Ceptualizations involved carry us over 
into studies in which the child’s per- 
sonality and behavior are the independ- 
ent variables and beyond the point 
where the family provides the special 
focus. The problems of dealing with 
child reports of this sort, however, are 
taken up at a more general level in the 
following section. 
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Methodological Problems 


We have discussed the conceptualiza- 
tion of those aspects of the family 
that are seen as important for the child 
and have taken up several examples of 
concept measurement. In this section 
we will take up some general methodo- 
logical problems that seem particularly 
acute in such studies. 


ACCESSIBILITY OF DATA 


An early problem the researcher must 
face is that of deciding how to find the 
families he wishes to study. Research 
projects often call for special groups 
of families, such as two-child families, 
families with adolescent children, work- 
ing-mother families, and families of 
children born while their fathers were 
away in military service. How does 
one go about locating such families? 

First, there is often an agency or or- 
ganization that has contact with the 
particular group desired. For example, 
schools are a good starting point for 
getting families that include children 
of a particular age. The contact here 
is first with the child and then with 
his family. A nursery school for work- 
ing mothers, an adoption agency, a 
juvenile detention home, and a mater- 
nity clinic are examples of sources for 
locating families with particular charac- 
teristics. 

Another important source is in the 
use of existing records. Births, mar- 
riages, divorces, and crimes are recorded 
for each county. Christensen and Rub- 
instein (1956) used county records in 
Tippecanoe County, Indiana, to locate 
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families in which the mother was preg- 
nant at the time of marriage. By com- 
paring marriage records to birth 
records, they were able to select those 
cases in which a child was born less 
than seven months after the marriage. 
They then located the couples through 
directories and interviewed them. Al- 
though their particular interest was in 
marriage adjustment, this same tech- 
nique might be used to study the ef- 
fects of premarital pregnancy on 
attitudes and behavior of parents. 

Hospitals keep incomplete but often 
helpful records. “Twin” studies some- 
times locate subjects through hospital 
records of twin births for any given 
year. Kaplan (1958) used hospitals to 
study effects of a crisis situation on fam- 
ilies, the crisis situation being a prema- 
ture birth. 

Locating families through agencies 
having more or less recent contact with 
them has an advantage in that current 
addresses may be available. This is less 
likely to be true when records are used, 
since they often refer to events in the 
past. For example, locating school-age 
twins through their teachers and prin- 
cipals is much more efficient than using 
records. Furthermore, the agency’s rap- 
port with the subjects may facilitate 
cooperation (although sometimes it may 
hinder the openness of responses). The 
agency may also have on-going records 
from which the researcher can select 
particular subjects from its total group 
of clients. 

A third method of securing subjects 
is to select a geographical area in which 
the possibility of locating the desired 
subjects is maximized. For example, if 
the researcher were interested in lo- 
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cating families representing a partic ular 
ethnic group, and there were no ex- 


isting groups or organization through 
which they could be reached, he might 
use informants or census data to find 


areas in which these families were con- 
centrated. This method is, of course, 
less reliable and more time consuming, 
since many ineligible families would 
have to be interviewed in the process 

of locating the appropriate ones. 
Finally, there is an important method 
for locating subjects that can be used 
alone or in combination with the fore- 
going methods. This involves gathering 
` 


data on more subjects than will actually 
be included in the study. The data col- 
lected can be minimal, designed only 
to determine whether or not the re- 
spondent has the qualifications required 
for him to be selected for further study. 
If the final data are relatively simple 
to collect, it might be more efficient in 
the long run to gather the complete 
data at the first contact, even though 
only the responses from the selected 
subjects are to be analyzed. Despite the 
disadvantage of requiring the collection 
of more data than is needed, this method 
has two important advantages. First, it 
may well be the only way of locating 
clusters of families that cannot be 
reached through agencies, records, or 
areas of residence. Second, it does not 
impose limitations with respect to the 
representativeness of samples as extreme 
as the other methods. Statistical gen- 
eralization is limited to that population 
from which persons had a chance of 
being included in the sample. If the 
original contact were Public School No. 
2, only families with a child attending 
that school had the possibility of being 
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included. In Christensen and Rubin- 
stein’s study, only those who were mar- 
ried in Tippecanoe County, whose first 
child was born there, and who could 
be located for study had the possibility 
of being included. These limitations are 
not always serious, as we shall see later, 
but they are usually undesirable. The 
gathering of data from many subjects 


can follow standard population-sam- 


pling techniques and as such permit 
generalizing findings to a broader popu- 
lation and drawing conclusions less sub- 
ject to error from sample bias. 

Once the subjects have been located, 
the researcher faces another obstacle, 


one which stems from the aura of 
privacy that surrounds family life. The 
high value on family privacy may pro- 
vide a barrier at three points: 


1. Agencies may be reluctant to Co- 
operate in the interest of protecting 
the privacy of the families with which 
they are dealing. 

2. The family members themselves 
may not wish to reveal what goes on 
in the home and thus refuse to 
participate. 

3. Even when cooperation has been 
agreed upon, the family members may 
hold certain aspects of their home life 
as inviolate. 


The first of these problems some- 
times appears to be the greatest, for 
organizations may be more protective 
of their clients’ privacy than the indi- 
viduals themselves. Organizational per- 
sonnel sometimes assume more naiveté 
and even hostility toward research on 
the part of the clients than actually 
exists. School systems that are quite 
willing to allow researchers to adminis- 
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ter personality tests, sociometrics, and 
even experiments—when the child is 
the only one being studied—will often 
be unwilling to cooperate with a study 
of the child’s family. Even if the child 
is the only one in the family to be in- 
terviewed, the school may insist that if 
the interview touches on family life the 
parents must give permission before- 
hand. Sometimes school policy forbids 
giving out parents’ addresses, and the 
original contact with the parent may 
have to be made in a note delivered 
by the child. In one school system, in 
which the parents’ names and addresses 
could not be given out, the authors 
were allowed to give the children let- 
ters in stamped envelopes, which they 
addressed to their parents as an exercise 
in penmanship class and the school 
mailed. 

The protectiveness of organizations 
is even more of a problem when social 
work agencies having standard obliga- 
tions to protect the privacy of their 
clients are involved. In general, when 
the head of an agency is not convinced 
that there is something to be gained, for 
himself, his organization, or society, he 
will not risk the possibility of censure 
by his client. If the proposed study 
might be seen—even by just a few—as 
a violation of privacy, he is undertaking 
a risk. Therefore, he must be “sold” on 
the study before he will agree to co- 
operate. 

One of the difficulties with limited 
cooperation is that it intensifies the 

roblem of sample bias mentioned 
earlier. For example, in the case in 
which parents have to return permis- 
sion blanks beforehand, bias is intro- 
duced, for some parents will not give 
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permission or will forget to return their 
forms, and these persons as a group may 
be different from those from whom per- 
mission is received. Furthermore, it may 
not be possible to identify these dif- 
ferences and thus to appraise the nature 
of the bias, If the school permits the 
researcher to ask the children whose 
parents did not return permission blanks 
a few simple questions, such as father’s 
occupation, mother’s working status, 
and whether both parents live at home, 
these nonreturns can at least be soci- 
ologically identified. 

This brings us to the second problem 
—that family members themselves may 
be reluctant to discuss family relation- 
ships. Resistance to research cooperation 
has often been greatly exaggerated, and 
the willingness of most subjects to dis- 
cuss personal matters with a stranger is 
often a source of amazement even to 
the seasoned researcher. Kinsey et al. 
(1948) have shown that it is possible to 
get people to discuss their sexual be- 
havior. Several studies have shown the 
willingness of respondents to take the 
researcher into their homes—actually as 
well as descriptively. (See the earlier 
section on direct observations of parent 
behavior.) One extreme example of the 
willingness to cooperate is shown in 
the work of Soskin (1959). Soskin ob- 
tained continuous recordings of the 
conversations of young couples during 
a week’s vacation at a summer resort. 
The resort vacation was financed by 
the research project in exchange for the 
respondents’ agreement to wear small 
tape recorders. The couples were al- 
lowed to disconnect the recorder at any 
time if they felt the invasion of privacy 
was too great, but this was done rarely, 
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and the respondents seemed to forget 
about the apparatus. 

There are always some respondents 
who are reluctant to cooperate—th SC 
who hold such a high value on privacy 
that no intrusion is allowed. If the repre- 
sentativeness of the sample is at all 
important, these nonrespondents be- 
come a concern, and special efforts 
must often be made to obtain data from 
them or at least to identify them so 
that the limitations of generalizing from 
the research can be known. Although 
a high value on privacy may at times 
be the actual reason for refusal, the 
prevalence of a value on the sanctity of 
the home makes it likely that some 
people will use this value to clothe 
less acceptable reasons. For example, the 
actual conscious or unconscious reasons 
for refusing to participate in a study 
of family life might be anxiety about 
how good a job one is doing as wife, 
mother, husband, or father. This makes 
the problem of the lack of cooperation 
by all subjects more crucial because a 
bias may thereby be introduced into the 
sample. 

There is another way in which the 
privacy of family life affects the indi- 
vidual’s cooperation. A man’s home is 
not only a castle defended against re- 
searchers but it is also defended against 
his neighbors. As such, there is a great 
deal of pluralistic ignorance about 
family life. A mother may know that 
she herself sometimes loses her temper 
and screams at her child, but she does 
not know that the lovely young mother 
across the street does the same thing. 
Since each keeps the information within 
the family, each considers herself in- 
adequate. When there is communica- 
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tion, most people will find that others 
also fal! short of the ideal, and they will 
be Jess ashamed of their own shortcom- 
ings. !lowever, because there is a 
romanticized ideal about family life, 
because most families fall short of this 
ideal, and because of family privacy, 
many areas of shame exist and cause po- 
tential respondents to be inappropriately 
reluctant to expose themselves to the 
researcher. 

Similar points can be made about the 
inaccessibility of data once the subjects 
have been obtained. For example, the 
researcher may be allowed to make ob- 
servations in the home, yet be excluded 
from many kinds of interaction, This 
means that he may not be able to ob- 
tain any data on certain activities. An 
even more serious problem concerns the 
bias that results when the researcher 
does nor take account of the possibility 
that he is being excluded from certain 
kinds of interactions and that this ex- 
clusion does not take place equally in 
all families. 

Although these problems may be 
paramount in studying the family, their 
solutions are not essentially different 
from those employed in studying any 
area in which organizational and sub- 
ject cooperation are necessary. When 
approaching an organization, it is im- 
portant to call upon the right level 
first. Sometimes there will be a person 
specifically in charge of research. The 
organization will need to be assured of 
the researcher's professional compe- 
tence, experience, and willingness to 
consider the needs of the organization. 
Often the researcher can provide some 
service to the organization, such as in- 
cluding questions that may be of use to 
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it or making available a report of some 
of the findings (with the anonymity of 
the individual respondent being pre- 
served.) Most important, the signifi- 
cance of the study should be stressed, 
Usually the organizations approached 
will be directly concerned with chil- 
dren or family life, and the study should 
be of special interest to them. 

It is usually beneficial to work with 
persons in the organizational structure 
when planning the study. Not only will 
such involvement help rapport, but 
there may be many things the researcher 
can gain from the planning sessions, 
since the persons in the organization 
are likely to have valuable information 
about the population to be studied and 
about many relevant aspects of the 
situation. 

In gaining subject cooperation, a per- 
sonal contact is better than one made 
by mail or telephone. In addition, the 
anticipation of being a res ch subject 
is ordinarily more negative than the 
actual experience, Therefore, to ap- 
proach the respondent at the time one 
is ready to begin is sometimes more 
effective than asking for cooperation in 
advance. 

Regarding the motivation of the sub- 
jects to cooperate, the pluralistic igno- 
rance and the general anxiety around 
the child-rearing area operate in the re- 
searcher’s favor in some respects, just 
as they operate against him in others. 
Because of parents’ lack of familiarity 
with the manner in which other 
families interact and because of their 
extreme interest, subjects will quickly 
see the importance of a study in this 
area and will therefore be motivated to 
cooperate. Promising the subject a re- 
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port on some of the findings will add 
incentive, enhance involvement, and 


help make him feel a part of something 
important. Even though the reluctance 
to participate may be motivated by a 
need for privacy, the subject’s feeling 
of making a contribution may be as 
helpful in gaining cooperation as the 
assurances of anonymity. 

The problem of adequate motivation 
is especially important when the study 
requires the cooperation of all the mem- 
bers of a family, for example, when they 
are to be interviewed or observed simul- 
taneously. This requirement is often 
not in accord with the normal routines 
of family life. It is an inconvenience to 
the subjects and thus requires higher 
motivation than a study in which the 
researcher is able to adapt himself to 
the normal routines and time demands 
of the subject. 


THE FAMILY 
AS A SMALL GROUP 


In many respects, studying the family 
is like studying other small groups, but 
it is distinct in a number of ways. First 
of all, although many small groups are 
relatively unstructured, each member 
in the family holds a different status. 
Status in the family is assigned primarily 
on the basis of sex and age, and only 
in the case of twins of the same sex 
does one find two members who are 
identical with respect to these charac- 
teristics. Because of this individuation 
of status and because of the small size 
of the group itself, the significance of 
each member is greater. Therefore, only 
families alike in composition are truly 
comparable. 
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How serious this lack of compara- 
bility is depends on the variables being 
studied. When the independent variable 
is itself some aspect of family structure, 
such as the ordinal position of the child, 
controlling on other aspects of family 
structure and considering more than 
one aspect at a time might be essentia] 
for highlighting effects. For exampl 
the number of children in a family an: 
the sex and age of each might need t 
be considered in a study dealing with 
the effects of sibling composition. An 
older sister is a very different influence 
on a child’s development than an olde: 
brother or a younger sister. Being the 
second of two children is a very dif 
ferent experience from being the second 
of three or the youngest of more than 
two. The studies of Koch, mentioned 
earlier, are outstanding examples of 
structural controls. 

Another characteristic of the family 
that distinguishes it from other small 
groups is that it has a history. The 
small group in the laboratory is “usually 
newly created. Small informal groups 
within organizational settings, social 
groups, and committees may have some 
history, but it is of short duration as 
compared to families. Having a history 
means that many variables important 
for understanding a given unit of in- 
teraction are not visible to the outside 
observer or even available to partici- 
pants. In this sense, the observer of 
family life is like the observer of a total 
society. Lack of familiarity with the 
group’s history may cause the observer 
to misinterpret the significance of a 
given unit of behavior, just as would 
be so if an anthropologist studied a 
society in which the history and tradi- 


THE MEASUREMENT OF FAMILY LIFE VARIABLES 


tions were unfamiliar to him. What 
seems to be hostile behavior may not 
be hostile in the context of the family’s 
interaction over the years and their 
particular language forms, mannerisms, 
and affective states. What seems to be 
an independent, power-assertive deci- 
sion on the part of one member of the 
family may have covertly, perhaps un- 
consciously, taken account of the wishes 
of each of the other family members as 
they are known from the past history 
of the group. 

The importance of family history in 
understanding the present interrelation- 
ships comes out clearly in the study by 
Spiegel and Kluckhohn referred to 
earlier (Spiegel, 1957). Partly because 
of their sensitivity to this problem, 
they have intensively studied the 18 
families in their sample over a four- 


year period. They have utilized 
behavioral observations, participant 
observation, intensive clinical inter- 


views, and questionnaires. Although 
their analysis is an ahistorical structural- 
functional one, they have used their 
knowledge of significant past events to 
grasp fully the current family interre- 
lationships. 

However, the sheer volume of data 
and the prolonged period of field work 
needed may make this wholistic kind 
of study impractical for most research 
projects. As always, the first step is to 
be aware of the problem and the second 
is to evaluate how serious it is for the 
particular study. The fact that the 
history of the group affects its present 
behavior and influences the relationship 
between independent and dependent 
variables is not in itself sufficient reason 
for complicating the research design. 
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A study need not take account of all 
the variance. On the other hand, if ig- 
noring the historical context leads to 
mislabeling behavior and to super- 
ficiality, there is a real problem. This 
can often be handled, however, simply 
by supplementing behavioral measures 
with introspective reports by the actors. 
For example, an influence technique 
used by a parent on a child is very much 
affected by previous interactions be- 
tween them, and the psychological 
meaning of the event to both may thus 
not be apparent from the behavior it- 
self, but asking each of them about 
their thoughts and feelings in the sit- 
uation may bring out as much of the 
missed context as the study requires. 


SOURCES OF DISTORTION 
IN VERBAL REPORTS 


It is clear from the earlier section 
that the present authors differentiate 
the subjective from the objective family 
situation. This distinction is an im- 

ortant one. For example, several 
studies have found that the child will 
report his father as being a more active 
disciplinarian than will other members 
of the family (Kohn, 1959b; Lippitt 
and Hoffman, 1959). This exaggeration 
of the father’s disciplinary functions 
may stem from several sources. The 
father’s discipline may have been more 
severe if not more frequent; it may have 
been called forth at a more intense 
moment in the parent-child conflict, 
ie., after the mother’s attempts had 
already failed; it may have carried more 
authority behind it because of the per- 
ception ‘of the father as a more power- 
ful figure; it may have been less threat- 
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ening because it did not involve the 
loss of love from the nuturant parent; 
it may have been exaggerated by the 
boy because of the Oedipal conflict. 
Whatever the reason for the perceptual 
distortion (assuming it is a distortion), 
if it is an honest report of a perception 
it will influence the child’s behavior. 
Indeed, empirically it is likely to be 
more related to the child’s personality 
and behavior than the actual discipline 
situation because it is closer in the 
process sequence. However, it is a sub- 
jective impression and not a measure 
of the actual discipline situation. Too 
often a stimulus-response theory about 
the effect of a parent action on the 
child is “tested” by using the child’s 
subjective impression of the parent’s 
action. For example, Henry (1956, 
1957) uses subjective reports of parental 
discipline, and, though he is very care- 
ful to refer to them as children’s per- 
ceptions rather than parent behavior, 
his theory is about parent behavior, 
not about children’s perceptions. Fur- 
thermore, by using the same respondent 
to provide both independent and de- 
pendent variables, he risks the problems 
of variable contamination. The lure of 
the questionnaire, and particularly the 
single questionnaire, is one of the main 
reasons for the prevalence of family 
studies dealing with response syndromes 
rather than testing stimulus-response 
theories. 

On the other hand, the lure of the 
uestionnaire is realistically based. No 
other method of data collection is so 
economical. Furthermore, many of the 
problems involved in the use of the 
questionnaire or the interview stem 
from misuse rather than from intrinsic 
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faults. Two common errors are mad 

One is the assumption that it is impos 
sible to obtain objective material fro: 

verbal reports, and the second is that 
any verbal report is a legitimate sul 

jective response. Neither of these a 

sumptions is true, and in considering 
sources of distortion in verbal report 
it will be clear that many apply equally 

whether the researcher is interested in 
obtaining valid subjective or objecti 

material. 

Deliberate Distortion. Perhaps thi 
most obvious form of verbal distortion 
occurs when the respondent does not 
accurately communicate what he c 
sciously feels to be true. Deliberate fal 
sification of this sort can exist whether 
the respondent is reporting an empirical 
fact, such as his annual income, or an 
emotional state, whether or not he is 
disappointed in his eldest son. The re- 
spondent may know the answer to both 
questions but refuse to communicate 
the true answer to the interviewer. 

The need to present a desirable pic- 
ture to others is especially great in mat- 
ters concerning the family. It is not 
uncommon for married persons who 
have successfully communicated to 
their friends a picture of a happy mar- 
riage to announce suddenly that they 
are getting a divorce. The value on 
The Happy Family is so high that the 
family will carry on a masquerade for 
the outside world long after they have 
come to acknowledge the unpleasant 
facts to themselves. This is particularly 
true when the outsider is believed to 
be an expert. Parents are remarkably 
well informed about what a “good 
parent” is like. They are familiar with 
words like “togetherness” and “accep- 
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tance” and “love,” and they are proud 
of their familiarity with these concepts. 
A person who is doing research on 
family life and parent-child relations 
is viewed as an expert on these matters 
and is someone whom one can impress 
with one’s psychological sophistication 


and with the beauty of one’s own family 
interactions. On the other hand, report- 
ing undesirable aspects of one’s family 
acknowledging one’s own 
faults and the faults of other family 
Acknowledging one’s own 
faults as a family member may involve 
confessing one’s lack of competence, 
morality, and warmth. Acknowledging 
the faults of other family members vio- 
lates the value of family loyalty. 
There are many ways of handling 
the problem of conscious distortion. 
The most basic involve gaining the re- 
spondent’s acceptance of the interview 
situation. The respondent must be con- 
vinced that his sincere response will 
be an important contribution, that the 
researcher will not pass judgment on 
him, that his responses will be anony- 
mous, and that the interviewer is a 
pleasant person. The researcher must 
communicate both that the respondent 
is not the only one being interviewed 
and that his contribution is a very spe- 
cial one. These are the kinds of aims 
communicated by the word ‘ “rapport.” 
Á The w ording of a question is also 
important in minimizing conscious dis- 
tortion, There are several techniques 
for wording a question to make an 
otherwise unacceptable answer more 
palatable to the respondent. One com- 
mon method, for example, is to make 
a statement before the actual question 
that indicates to the respondent that 


involves 


members, 
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the answer he is about to give is per- 
fectly acceptable and is shared with 
other persons. The following questions 
have been used by the authors: 


Everyone in America, except for Ameri- 
can Indians, can trace his family back- 
ground to some other country. What 
would you say your nationality back- 
ground is? 

Most parents disagree sometimes about 
how the children should be handled. 
What are some of the things you and 
your husband sometimes don’t agree 
about? 


In both examples, the only purpose 
of the sentence preceding the actual 
question is to make the respondent feel 
freer to answer openly. The same tech- 
nique of indicating that the respondent 
shares his situation with others can be 
used when a precoded list of answers 
is presented. In the “Some Kids’ 
Parents” measure, mentioned earlier 
(Lippitt and Hoffman, 1959), the child 
reports parental expectations by indi- 
cating which of two positions his 
parents are likely to take. The child is 
given pairs of items such as the fol- 
lowing: 


Some kids’ parents tell them they should 
try to do things the other kids want them 


to. 

Some kids’ parents tell them they 
shouldn’t try to do things just because the 
other kids want them to. 


By making each position one that 
others also held, it was felt that the 
child would be more likely to report 
his parents’ position as he actually per- 
ceived it. 

It is often important also in such pre- 
coded responses to word the alterna- 
tives as equal in attractiveness as possi- 
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ble without distorting the meaning. The 
position that seems more socially ac- 
ceptable is toned down, and the socially 
unacceptable position is phrased as posi- 
tively as possible. For example, the re- 
sponse, “Some kids’ parents feel that 
children sometimes need to be spanked,” 
is more acceptable than “Some kids’ 
parents believe in spanking their chil- 
dren” (although, for isolating an ex- 
treme group, the latter might be 
preferable). 

The position of items in a question- 
naire is also important in setting the 
stage to encourage free communication. 
For example, an item that asks the 
child to describe something negative 
about his mother should be preceded 
by an item that allows the child to ex- 
press positive attitudes. Often the re- 
searchers will include questions for the 
sole purpose of providing a favorable 
setting for the important questions that 
follow. 

The use of indirect measures has al- 
ready been mentioned and is often used 
to obtain freer communication about 
the child’s family. In Douvan and 
Kaye's study of adolescent girls (1958), 
a modification of the Rosenzweig tech- 
nique was used. In one item the subject 
was presented with a sketch showing a 
girl going out the door, parents sitting 
on a couch, and the girl saying, “I’m 
going over to Mary’s house—all the 
girls are going to be there.” The re- 
spondent was asked what the parents 
would say. The response presumably 
indicates something about the girls 
perception of her own parents, but since 
it is not obvious to the respondent that 
this is the case the response may be less 
inhibited. Similarly, one might ask ques- 
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tions phrased in such terms as “What 
do most girls your age disagree with 
their parents about?” or “How do 
parents like you feel about . . . ?” In- 
direct measures do involve a problem 
in that it may be erroneous to infei 
that the response is a reflection of the 
respondent’s own situation rather than 
an intellectual view of others or a wish- 
ful fantasy, but this can often be 
checked against other aspects of the 
total interview, and, in addition, indi 
rect measures are at least useful for iden 
tifying salient areas in the family situs- 
tion. 

Hoffman (1957) suggests still another 
method for minimizing deliberate dis- 
tortion. We have already described this 
interview, which has as its primary 
purpose the obtaining of objective data 
about parent-child interactions. By ask- 
ing for extremely detailed descriptions 
of the behavior, deliberate distortion is 
minimized because the “set to recall de- 
tails seems to result in a fragmentation 
of the event which weakens its Gestalt 
properties and thus divests it of much 
of the emotional meaning it would 
otherwise have.” Thus the motivation 
to distort, both deliberately and through 
unconscious mechanisms, is less. In ad- 
dition, the sheer task of recalling be- 
comes so absorbing that the personal 
relevance of the material is lost to the 
respondent, but even if the motivation 
to distort persists it is extremely difficult 
to do so without detection when great 
detail is required. 

The idea that breaking a pattern 
down to its insignificant parts avoids 
distortion is an important principle to 
employ in measurement construction 
and evaluation. This is the principle in- 
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sd in the Wolfe measure and the 


volv 

Herbst measure already described. By 
asking specific questions and then com- 
bining them into an over-all measure, 


a more valid perception is obtained, 
assuming that these questions are suffi- 
ciently inclusive and representative. 
Respondents might hesitate to report, 


for example, that the father was highly 
active in household tasks conventionally 
thought of as feminine, but they might 
reveal this information if it were par- 


celed out only in very small innocuous 


erate distortion involves the ques- 
tion of which family member should 
be the respondent. The Herbst measure, 
for example, was designed for children 
because Herbst felt that parents would 
distort responses by exaggerating the 
amount of joint parental action and 
mutual decision making. This measure 
was intended for obtaining objective 
material more than subjective and thus 
interest was simply in minimizing dis- 
tortion. Since the information was ac- 
cessible to all, the questionnaire was 
geared toward the ones least likely to 
distort. The relative naiveté of children 
made them, in this case, the best 
reporters. 

In many studies the question of who 
shall report is predetermined by the 
theory. Thus, if one is predicting from 
a child perception of the family to a 
child behavior, the child is almost al- 
ways the best reporter of his own per- 
ception. However, when the aim is to 
obtain objective data, the researcher 
should consider which family member 
would be least motivated to distort. 
Obviously, the least motivated persons 
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are not always the best ones to use, 
for they may not have so great an op- 
portunity to observe the situation, but 
occasionally the advantages of greater 
objectivity may be more important. For 
example, mothers are usually the re- 
porters of their children’s behavior 
rather than fathers. In addition to the 
fact that mothers are more available, 
it is usually felt that they are better 
informed. At least two studies (Sara- 
son, 1958; Levitt, 1959) have shown that 
mothers are rather unreliable inform- 
ants, and Sarason has found that 
fathers’ descriptions of their children 
show a higher relationship to clinical 
appraisals than mothers’ descriptions. 
The parental role is more salient to the 
mother’s self concept than the father’s, 
and it may be more important for her 
to communicate a positive picture to 
the “expert.” 

Sometimes it is possible to use more 
than one informant—interviewing re- 
spondents either separately or together. 
It should be pointed out, however, that 
family members are not independent 
reporters. Agreement represents con- 
sensus but not necessarily one arrived 
at by each independently. This is a 
problem when family members are in- 
terviewed separately and not simul- 
taneously. The first person interviewed 
might discuss the topic with other 
family members before their interviews 
take place. 

Bossard and Strodtbeck have already 
been mentioned as obtaining verbal re- 
ports on husband-wife relations by us- 
ing relatives or friends of the family 
as informants. Although this technique 
is limited by the amount of knowledge 
friends have about intrafamily matters, 
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problems of deliberate distortion. 

Nondeliberate Distortion. Nonde- 
liberate distortion can be classified into 
three general categories. The first in- 
volves unconscious distortion, in which 
the material is threatening and is there- 
fore repressed. In this case the respon- 
dent’s report is an honest communica- 
tion of what he consciously believes 
to be true, but it is neither what is in 
fact true nor what at a deeper level he 
knows to be true. The second category 
involves distortion stemming from the 
respondent’s having insufficient knowl- 
edge about the situation—either because 
he has had insufficient opportunity to 
observe or experience the relevant ma- 
terial or because the material was rela- 
tively trivial and has been forgotten. 
The third source of nondeliberate dis- 
tortion is involved when the respondent 
has the information to report and is 
willing to report it, but there is a mis- 
communication between researcher and 
respondent. 

It is not always possible to pinpoint 
the source of distortion. Take, for ex- 
ample, the Herbst “Who decides?” 
questions. If parents do report more 
joint deciding than is reported by chil- 
dren, it is quite possible that (1) this 
is deliberate distortion by parents in 
the direction of the more socially ap- 
proved pattern, (2) it is an unconscious 
distortion of the existing asymmetric 
deciding pattern that is threatening to 
both parents, (3) the parents have more 
complete knowledge of the deciding 
process and know that what the child 
sees as a one-sided decision was actually 
discussed beforehand or was mutually 
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decided and routinized before the child 
was born, or (4) certain ambiguities ii 
the questionnaire lead adults and chil 
dren to interpret the questions and th 
answer form somewhat differently. The 
greatest difficulties in interpreting 
come when all four distortions are pos 
sible. It is therefore important that 
the researcher consider each of the four 
kinds beforehand and plan his researc! 
to minimize all unwanted distortion: 

Distortion through unconscious mo- 
tivation is not always undesirable, and 
some studies may call for only conscic 
perceptions of the family. Obtaini: 
conscious material requires relatively 
straight, direct questions, whereas u! 
conscious material involves the use of 
projective measures. The projective 
measure developed by Alexander (1952) 
focuses particularly on family interac 
tions as does Blum’s Blacky Test (1950), 
the Children’s Apperception Test (Bel- 
lak and Bellak, 1949), and the Michigan 
Picture Test (Hartwell et al., 1951). 
Whether conscious or unconscious 
material is called for, it is important that 
all persons respond at the same level 
and that the researcher is clear about 
what this level is. A combination of 
responses is sometimes used for han- 
dling the latter problem. For example, a 
pattern of overidealization of parents in 
combination with repressed hostility 
can be inferred when more direct ques- 
tions are answered with great praise 
of parents and indirect questions pro- 
duce fantasies about disasters occurring 
to parents. 

When objective data are required, 
unconscious distortion is as undesirable 
as deliberate distortion and some of the 


dat 
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solutions suggested for the latter apply 
equally here—for example, asking ques- 
tions that break up the Gestalt prop- 


erties of an event, selecting respondents 
vould be less motivated to distort, 
ing more than one informant in 
a family. 

Whereas both deliberate and uncon- 
scious distortion are motivated, cogni- 
tive distortion is not. A family may also 
be described differently by different 
members simply because each member 
experiences somewhat different aspects 
The mother is usually home all 
day and the father is not. The parents 


who 
and 


of it 


are usually up after the children are 
asleep. Parents do not treat all their 
children exactly the same, and two 


persons do not perceive the same inter- 
action identically. It is not surprising, 
then, that the correlations between dif- 
ferent members’ perceptions of their 
family are usually better than chance, 
but not high (Lippitt and Hoffman, 
1959; Kohn, 1959a). When the Herbst 
measure was given to fathers, mothers, 
and preadolescent children, it was found 
that each tended to exaggerate his own 
relative participation in tasks (Lippitt 
and Hoffman, 1959), probably because 
each person is always a witness to his 
own acts but is not necessarily a witness 
to the acts of others. If the researcher is 
interested in perceptual material, this 
is not a problem. Even if the interest 
is in objective aspects of the family, 
but only relative scores are required, 
this may not be a serious problem éf it 
can be assumed that all reporters are 
distorting more or less equally and in 
the same direction. However, when de- 
ciding which family member should be 
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the reporter and what one can ask of 
him, it is important to consider the 
opportunities for observation provided 
by the respondent’s family role. 

A closely related problem is that of 
forgetting. When adults are asked to 
recall their childhood or when parents 
report past interactions and previous 
attitudes toward their children, there is 
always the danger of inaccurate recall. 
Part of this may be psychologically 
motivated, but part may also be simply 
a matter of cognitive distortion. Asking 
about trivial experiences also invites 
forgetting. One way to collect data 
about trivial events is to ask respon- 
dents to keep a record each time they 
occur. 

We have noted that deliberate distor- 
tion is always undesirable but that un- 
conscious and cognitive distortion are 
not. Miscommunication is like the first 
—always undesirable. Whether a ques- 
tion is intended to yield subjective or 
objective data, it must communicate 
accurately. Too often a question is sub- 
ject to different interpretations, and 
responses reflect neither differences in 
the situation nor in the perception but 
are answers to different questions. For 
example, if children were asked, “How 
often do you and your mother do 
things together?” some might interpret 
this question to include tasks and some, 
only social activities; some might in- 
terpret the question to mean “you and 

our mother alone,” and some, to mean 
with and without other persons; some 
might include parallel behavior such as 
television-watching and some, only 
activities involving interaction. The 
answers would not be comparable or 
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meaningful, and saying it is a respon- 
dent’s perception—a subjective response 
—does not make it any more meaning- 
ful. Two respondents with precisely the 
same perception and evaluation of their 
interaction with their mothers, might 
respond differently because each has 
interpreted the question differently. 
When a vague question is to be asked, 
it should be followed by clarifying 
questions, such as in the above example, 
“What are these things you do to- 
gether?” or “Do you wish you did more 
things with your mother?” The original 
answer may then take on meaning in 
the context of the later answers. 

Even questions that seem simple, pre- 
cise, and objective may be misinter- 
preted. It is therefore important to pre- 
test all questions. “Are you employed?”, 
for example, is not sufficiently clear to 
obtain information about current ma- 
ternal employment. “Do you work out- 
side your home for money at the 
present time?” and “How many hours 
a week?” may be necessary. Working 
mothers usually have a spasmodic work 
history, and thus the question is more 
ambiguous than might seem. 

Prelisted response choices like 
“often,” “sometimes,” and “hardly 
ever” may also pose problems of com- 
munication, since the subject’s reference 
group is unknown to the researcher. 
The purposes of most research are bet- 
ter served by specifying the reference 
group (e.g, “more often than most 
boys my age,” “more often than others 
in my family,” or “more often than ’'d 
like”) or, if more objectivity is desired, 
by using a list of specified frequencies 
such as “once a week.” 

Responses to open-ended questions 
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are also capable of misinterpretat 
The role of the interviewer in m 
mizing this is taken up by Kahn and 
Cannell (1957). 

One special communication problem 
comes up in family studies when the 
research design calls for identical inter 
views with parents and children. [n 
most studies subjects vary in intelligence 
and education, and it is difficult to c 
struct questions that are comprehensible 
to the least educated but not overly 
simple to the more sophisticated and to 
avoid systematic distortion that stems 
from these differences. When t 
measure is to be used with adults and 
children, these difficulties are increased. 
Adults often cannot express their opin 
ions in the simple terms that childi 
can, and a questionnaire that communi- 
cates clearly to children might seem 
vague to adults and subject to varying 
interpretations. On the other hand, 
children often interpret questions more 
literally than adults, and this may be 
reflected in their answers. To com- 
pare the perceptions of parents and 
children, it might be more appropriate 
to vary the questionnaire forms in an 
effort to standardize the responses, 
rather than to standardize the questions 
and produce responses that differ be- 
cause of question interpretation. To 
develop such a measure one might do 
pretests, using a modified form of 
Strodtbeck’s “revealed difference” tech- 
nique in which the mother, father, and 
child answer questionnaires separately 
and are then brought together to dis- 
cuss their answers. In this way dif- 
ferences in interpretations and response 
sets can be studied for subsequent re- 
vision of the questionnaire. 
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OBSERVATION 


OF FAMILY INTERACTION 

ivioral observations are one way 
to pe the difficulties of the distortion 
problem in verbal reports. They are 
usually considered a more valid means 
of obtaining objective data, although 
there are problems even here. We will 
discuss some of these difficulties as they 


occur in family studies, taking up first 
observations of natural interaction in 
the home. Then we will discuss a tech- 
nigue still quite new in family research 
servations in artificially created 
sicuations either in the home or in the 
laboratory. 
ne Observations. Behavioral ob- 
servations in the home have been used 
for obtaining data that is objective and 
not filtered through the perceptions of 
family members. Three general criti- 
cisms have been leveled against this 
method: (1) the presence of the ob- 
server influences the interaction; (2) 
observations without introspective re- 
ports are insufficient for knowing what 
is in fact taking place; and (3) it is a 
costly way of obtaining data. We will 
not discuss the method of behavioral 
observation in any detail, since that is 
the special focus of another chapter, 
but only these difficulties as they affect 
the study of the child’s family. 
Certainly the observer is more obtru- 
sive in the home than in other settings. 
Because the household normally in- 
cludes few people and because of the 
private nature of the home, the presence 
of a stranger is apparent. Where there 
are many people, one more is less ob- 
vious than where there are normally 
only one or two other adults. In a 


setting in which strangers frequently 
pass by or in which a noninteracting per- 
son is an accepted part of the scene, 
as in certain factory settings, the ob- 
server can be absorbed into the normal 
course of events. But in the home the 
observer probably does influence the 
interaction. Even when, with time, the 
family adjusts to his presence, it is dif- 
ficult to know whether they have be- 
come less self-conscious in that they 
have worked out patterns of interaction 
acceptable for the new situation or 
whether they have, in fact, returned to 
their usual patterns. Nevertheless, even 
if one assumes that the observer does 
influence the scene, how serious is this 
problem? How much can a family 
manage to conceal in this situation? 
The children in the family will be less 
affected than their parents, and, to the 
extent that their behavior is somewhat 
natural and the routine demands of the 
home continue, the influence of the 
observer may not be severe enough to 
invalidate the material. 

The second problem, the possible 
need to supplement observations with 
introspective material, has been dis- 
cussed earlier. The extent of this prob- 
lem depends on the theory being 
studied and the inferences that are made 
from the observations. A theory about 
family life might deal only with be- 
havioral interactions and treat the data 
at the behavioral level without inferring 
underlying affect states. In this case, no 
introspective reports are needed. How- 
ever, if the variables deal with warmth, 
rejection, social climate, etc., the data 
would certainly be enriched by intro- 
spective reports. The fact that the 
family does consist, as we have pointed 
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out, of persons who have been inter- 
acting together for some time and the 
fact that all the persons interacting have 
particular forms of expression and 
certain perceptual sets means that not 
all of what is going on can be observed. 
Introspective reports would not be 
given at the exact time of the interac- 
tion but could be given afterwards. 
They could be either specific to the 
interactions observed or reported more 
generally as a framework for under- 
standing these data. 

The third criticism was that be- 
havioral observations are a costly way 
of gathering data. One of the assump- 
tions behind this is that more data will 
be gathered than will actually be used. 
This problem is not entirely a function 
of the method but of the marriage of 
this method with radical empiricism. 
Scientists who rejected theory were 
unable to say what data were important 
and what were not. Indeed, they did 
not want to say it but to collect with 
equal enthusiasm the meaningful and 
the trivial. This position often stems 
from a special concern with collecting 
data quite free from the scientist’s in- 
fluence. Behavioral observations in the 
natural setting seemed to be the method 
freest of influence, and as a result home 
observation studies are often detailed 
accounts of every event. A single proto- 
col may be quite interesting and sug- 
gestive of areas of study, but the sheer 
quantity of data along with the absence 
of full measures of any single variable 
make the material difficult to use unless 
it is recoded, selecting only certain parts 
for analysis. Thus, in the end, if the 
data are to be used to predict from 
the family situation to the child, much 
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of the data are never used. It is in this 
sense a costly way of gathering data 
because it is a wasteful way. Data hav: 
been gathered and processed that n 
not have been if the researcher had bee: 
selective beforehand and coded either 
larger units of behavior or only certain 
specified kinds of behavior. 

If, however, the researcher is in 
terested only in certain specified va! 
ables, there is another problem—narn 
that he may have to observe a gi 
family for a long time before signific 
events take place. Observation is bette 
suited for studying variables revealed in 
behavior that occurs frequently or witli 
a predictable regularity. 

Even if the variables selected 
study are revealed without an indefinite 
period of fruitless observation and even 
if the observational method is geared 
toward efficient codification, home ob- 
servations are still expensive. They re- 
quire more elaborate procedures for 
gaining subject cooperation, more time 
spent in obtaining data from each 
family, and more highly trained per- 
sons to do the actual field work. For 
many research projects, however, the 
greater objectivity of the data and the 
opportunity to capture the material 
instantaneously in the natural setting 
make behavior observations well worth 
this expense. 

Observations in Structured Situa- 
tions. To retain the objectivity of the 
observational method while eliminating 
the inefficiency of having to wait until 
significant events occur and to standard- 
ize the situation in which behavior oc- 
curs, some investigators have devised 
techniques whereby subjects are given 
a standard stimulus situation designed 


for 


TE 
to elicit codable behavior immediately. 
The main use of this technique has been 
in measuring husband-wife interaction. 
These studies, all of which have been 
mentioned previously, include those of 
Strodtbeck (1951, 1954, 1958), Kenkel 
(1956), Kenkel and Hoffman (1956), 
and Vidich (1956). The fact that all of 
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these researchers have been interested 
in suring power relations and social- 
emotional roles is very likely related to 
the difficulties encountered in obtaining 
such material from verbal reports. In 
fact, Kenkel and Hoffman report im- 
pressive discrepancies between respond- 
ents’ reports and investigators’ observa- 
tions in the structured situation. In addi- 
tion, these data are not readily obtained 
from behavioral observations in the na- 
tural setting, for revealing behavioral 


units do not occur frequently nor at 
predictable times, and they may not 
occur at all in front of an observer. The 
Strodtbeck, Kenkel, and Vidich tech- 
niques have already been described. It 
will be recalled that the Strodtbeck 
technique involves confronting subjects 
with differences in questionnaire re- 
sponses, and subjects are asked to recon- 
cile these differences. In the other 
techniques subjects were asked to make 
certain decisions. Strodtbeck and 
Kenkel scored interactions for the 
ability of each participant to influence 
the other and according to the Bales 
categories. Both of these investigators 
report satisfaction with the techniques, 
and they appear to be measuring mean- 
ingful concepts that relate to other 
variables. Vidich, in a refreshingly 
frank report, indicates extreme dissatis- 
faction with the technique. Vidich 
states that there was a high percentage 
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of refusals, that couples seemed to have 
difficulty in understanding the task, and 
that the presence of the interviewer was 
a great source of distraction, with 
couples frequently trying to draw him 
into the conversation, The tape re- 
corder was seen as inhibiting free com- 
munication, and interaction underwent 
a qualitative change when it was turned 
off. It seemed to the investigator that 
language was inappropriately formal, 
that extreme deference to the opinion 
of the spouse was used to communicate 
cohesion, and that, in general, the 
couple was giving a public performance. 
Most serious, Vidich reports that these 
difficulties occurred more with some 
subjects than with others, and, thus, 
differences between couples were, in 
large part, based on differences in re- 
sponses to the experimental situation 
rather than on differences in the hus- 
band-wife relationship. The Vidich 
design had certain problems, it is true. 
The directions given to the subjects 
were somewhat unclear, the interviewer 
might better have left the room if the 
interview was to be taped anyway, and 
the technique of waiting for disagree- 
ments might not be so effective as 
Strodtbeck’s technique of making the 
disagreement a fact at the start. Never- 
theless, Vidich’s report highlights many 
difficulties with this technique that are 
not so easily dismissed. 

The major difficulties with such 
measures concern the artificiality of the 
situation. Does the contrived situation 
sufficiently represent the real-life situa- 
tion that the resultant behavior reflects 
on-going patterns? For example, these 
measures require a discussion. If, in fact, 
power is asserted and not based on argu- 
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ment, the powerful partner in the arti- 
ficial situation may not be the powerful 
one in real life. In addition, there is the 
question of motivation, The structured 
situation often seems like a game to 
the subjects, and one partner may be 
more involved than the other. One 
spouse might have little involvement in 
how imaginary money is spent and thus 
agree to all of the other's suggestions, 
but when the family income is at stake 
the interaction might be very different. 
The structured situation can be taken 
only as a measure of actual interaction 
when these concerns are carefully con- 
sidered. Furthermore, the situation 
should be supplemented with reports 
by the subjects afterward as to how 
closely this interaction compares to 
what usually takes place in the family. 

There have been few attempts to use 
observations in structured situations to 
study parent-child interactions. One 
outstanding example, however, is the 
work of Merrill Bishop (1946, 1951) 
referred to earlier. Bishop observed 
mother-child interaction in a playroom 
that had been specifically set up with 
toys and with an adult chair and maga- 
zines. The mother was told to imagine 
that she was in her own home and to 
act as she would there. Mother-child 
interaction was observed from behind 
a one-way screen for two half-hour 
periods. Although the mothers knew 
the observations were taking place, 
they thought the child was the object 
of interest to the observer. In one study 
(1946) a stimulus was presented before 
the second period to half the mothers. 
They were told that their children had 
not acted up to their true capacities. This 
experimental group showed increased 
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domination over the child in the second 
period, whereas a control group showed 
no change. In another study (195:1) 
mother-child interaction was studied in 
the same way, and this time it was com- 
pared to the child’s interaction with a 
neutral adult. 

These studies are interesting from 
two related standpoints: 


1. The method of obtaining dat 
eliminates many of the problems 
verbal reports and home observations 
while keeping artificiality of the situa- 
tion at a minimum. 

2. These are among the very few 
real experimental studies in this area. 


These two points are related in that 
behavioral observations in structured 
situations are particularly well suited for 
experimental design, and Bishop’s suc- 
cess in developing this method of ob- 
taining data made the experimental de- 
sign possible. To be sure, this method 
of obtaining data is not without flaws. 
Outside the nursery-schoo] population, 
subject cooperation might be difficult 
to obtain. In addition, the mother’s be- 
havior might be quite different in this 
situation from what it is at home. A 
study by Schalock (1956) suggests this 
is true. However, it would seem that 
the young child is less likely to behave 
artificially in the situation and, there- 
fore, (1) the child’s response to the 
mother will itself be natural, (2) this 
response might indicate to the observer 
any tendency on the mother’s part to 
behave in a manner unfamiliar to the 
child, and (3) the naturalness of the 
child’s behavior helps to make the situ- 
ation more real for the mother so that 
her normal behavior is more likely to be 
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called forth. Furthermore, we have 
pointed out several times that the real 


danger of bias in the data is that the 
bias operates differently for different 
subjects. In this study comparisons are 
mainly for the same person under dif- 


ferent conditions (e.g., before and after 


the stimulus). Thus the differential ef- 
fect of bias is much less severe. In the 
earlier study in particular, in which 


Bishop has before and after measures 


for both the experimental group and 
a control group, the possibility that un- 
equal responses to the structured situa- 
tion could explain away her findings 
seems extremely slim. Bishop’s use of 
behavioral observations in structured 
situacions points this up as a very prom- 


ising method for research in the family. 
Her demonstration of the compatabil- 
ity between this method and the experi- 
mental design makes it particularly ex- 
citing. 


GENERALIZING RESULTS 


It has been said that psychology is 
the study of the attitudes of college 
students in introductory psychology 
courses. It might similarly be said that 
family-life studies deal with the percep- 
tions of mothers of children in nursery 
schools or of children in university affil- 
iated schools. This is a problem that 
plagues many of us. Essentially our con- 
cern is, to what extent can we generalize 
findings when the group studied is not 
representative of the larger population 
that we want to understand? The state- 
ment at the end of the report of such 
research that the findings cannot be 
generalized beyond the sample included 
in the study is a valid one that is often 
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quickly ignored by the investigator as 
well as the reader. What is the solution? 
Shall we give up trying to test general 
hypotheses and test only hypotheses 
that are specific to the group under 
study? Or shall we engage in research 
only when we can study representative 
samples? The problem is a complicated 
one and the answer depends on what 
one is trying to study and what methods 
are being used. 

A distinction has sometimes been 
made between what might be called sta- 
tistical generalization and theoretical 
generalization (Jahoda, Deutsch, and 
Cook, 1951; Cartwright, 1953). If one 
is interested in obtaining information 
about populations, such as how many 
parents spank their children or class 
differences in the use of punitive disci- 
pline, statistical generalization is para- 
mount. In such studies, representative- 
ness of sample is crucial, for the data 
have no meaning beyond the population 
the sample represents. However, when 
one is studying relationships between 
variables, e.g., between parental puni- 
tiveness and the child’s aggressiveness, 
the question of theoretical general- 
ization arises. Here the findings are less 
tied to the representativeness of the 
sample, but the sample becomes relevant 
in another way. In such studies the in- 
vestigator must ask: What is there about 
the group I have studied that might have 
affected this relationship? For example, 
the use of physical punishment clearly 
means something quite different in a 
sample of highly educated families than 
in the general population. The educated 
parent who uses this technique probably 
knows that he is doing something that 
is not currently acceptable among his 
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group, and the child may very well 
know this too. Therefore, the correlates 
of physical punishment in such a sample 
might be generalized to other educated 
families, but they may tell nothing 
about families in different segments of 
society. 

Besides the presence or absence of 
sample characteristics that may affect 
the relationship under study, additional 
factors must be considered. What are 
the limitations imposed by the condi- 
tions under which the observations take 
place? How discrepant are the opera- 
tional definitions from the theoretical 
concepts? What other variables may be 
operating to obscure or enhance the 
empirical relationships? The problem is 
what Jahoda, Deutsch, and Cook have 
called, “ . the classic one of induc- 
tion from particular facts to the validity 
of general scientific laws.” 

‘Thus representativeness is not always 
the primary issue in the study of a prob- 
lem, and the greater accessibility of non- 
representative samples may at times be 
an acceptable reason for using them. 
Further, the best test of the theory 
might require the use of homogeneous 
and nonrepresentative groups. The 
studies of Bowlby and Spitz discussed 
earlier are good examples. Finally, good 
studies in family life often require a 
great deal of cooperation by the sub- 
jects, and the cost of obtaining the data 
on each family may be high. For this 
reason, the use of small samples and 
groups who are willing to cooperate be- 
comes important. Imagine, for example, 
trying to carry out the Bishop studies 
with a national sample. The cost would 
be prohibitive. The question often boils 
down to this: Should we have quali- 
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tatively excellent data on a nonrepre- 
sentative sample or questionnaires and 
interviews on a representative sample? 
The answer is that we should have 
both. Representative samples are needed 
for statistical generalizations and to sup- 
plement and test the limitations of the- 
oretical generalizations, but smaller and 
less adequate samples can be used to test 
and generate theories and to study on- 
going processes. The ideal situation will 
exist when the two kinds of studies are 
carried out in coordination so that the 
limits of our theoretical generalizations 
can be checked statistically. 


Summary 


This is a chapter about measuring 
characteristics of the family as they are 
relevant to understanding the child’s 
development. We have given emphasis 
to the research objective of seeking 
causal connections between family vari- 
ables and some aspects of child behavior 
and development. By using a cau 
sequence outline we have discussed 
many of the concepts and mentioned 
some of the ways they have been used 
theoretically, some of the ways they 
have been measured, and some of the 
problems they pose in study design. We 
have also taken up a few of the more 
general methodological issues that seem 
to be of particular concern in family 
studies. We have only touched on each 
of these areas. Our bibliography is a 
long one, yet it leaves out a great deal 
of important and relevant research. The 
message we have hoped to communicate 
with this chapter is that a great deal of 
work has been done, and whether a 


THE MEASUREMENT OF FAMILY LIFE VARIABLES 


represents a failure or a major 
wbution there is often something 


con 

to be learned from it. By considering 
the relationships that have been demon- 
strated and the methods that have been 


used, by planning a new study carefully 


so that the errors and gains of the past 
can be utilized, we can build a sub- 
stantial structure, with each new study 


to our theoretical understanding 
srocess by means of which family 
fects the child. 
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type of subject, 380 
control of extraneous stimuli, 390 
control for verbal processes, 380 
CS-US interval, 383 
curve of, 382 
definition of, 377 
differential, 333, 396 
in simultaneous discrimination problems, 
397, 399 
eyelid, 384-385 
galvanic skin response, 157-158, 188, 292, 
296-298, 385 
instrumental, 377, 385-386 
avoidance, 393 
controlled-operant, 386, 390 
discrete-trial, 386, 388-389 
in early childhood, 176 
escape, 392 
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Conditioning, instrumental, free-operant, 386, 


390 
higher order, 393 
of infants, 158-168, 391-392 
in later childhood, 188-189 
reward, 378, 388-389, 390ff. 
runway, 389 
types of, 386 
types of problems studied, 386 
methods of recording, 384-385 
methods of rewarding, 389, 390, 391, 393 
neonatal, 379 
operant, free, 168, 177 
(instrumental) and cooperative behavior, 
188 
restricted, 168, 188-189 
types of, 377-378 
verbal, 538-539 
Conditioning techniques for hearing tests, 


297 
Conduct tendency, 777 
Confidentiality of data (see also Ethics), 650- 
651, 653 
Configuration, 342 
Conflict, choice of acts, 693 
choice of objects, 693-694 
Conflict behavior, 192-193 
Conflicts in child created by research, 653 
Consent for research, 23ff. 
Constancy, 315, 344ff. 
in animals, 344 
of color, 344, 351 
of interspaces, 344, 350-351 
shape, 344, 351 
size, 326, 344, 345, 348 
Constant method, 314, 333-335 
Construction techniques, projective, 604, 621- 
627 
Construct validity, 773-778 
of tests, 474 
Contact sensitivity, 303-304 
Contamination of information, 53-54 
Content analysis, 537 
Continuous records, parallel, 742-743 
time sample of, 743ff. 
Continuum coverage in observational child 
study, 72, 73 i 
Contrasted groups as criterion of test vali- 
dity, 474-475 
Contrasts, method of, 320 
Control, of experimental setting, 166 
in experimental technique, 140 
in groups, defined, 834 
Control groups, for effects of aging, 48 
maturation, 48 
for relevant comparison, 49 
Controls, selection of in longitudinal research, 
26 
Cooperation, organizational, 989 
and resistance to research, 988 
subject's, 650-651, 989 
Cooperative behavior, observed, 185 
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Correlational approach, in studying children’s 
groups, 826-827 
Correlational techniques, 140, 141 
Counterbalanced experiments, 
groups, 827 
Countersuggestion in interview, 565, 582 
Cross-sectional study, 41, 51-52 
advantages over repeated-measurements 
method, 52 
Crying, as a response, 163-164, 171 
Cues, irrelevant, 321 
recording of, 877 
sensitivity to, 659-660 
unconscious, 313 
Cultural factors, 
in social situation, 856-857 
in test scores, 479-484 
Cultural milieu and IQ changes, 476 
Culture, concept in child rearing, 918-944 
and customary behavior, 924-926 
Culture-free tests, 482-483 
Culture shock, 932 
Customary behavior and culture, 924-926, 


in children’s 


934 
Custom complex, 921-924, 933 


Davis-Eells games, 483 

Day-at-home measure of parental functions, 
962-963,995 

Day-care centers, subject source, 172 

Deaf mutes, classification and language in, 


2 
Death a father, 956-957 
Definitions (see also specific entries, such as 
Discrimination learning, Flexibility in 
groups, Group autonomy) 
equivalence of, 12 
inapplicability of, 13 
operational or empirical, 9, 11, 13, 17 
scientific, 9, 11 
Definition tests, in Binet scales, 438 
Delayed reactions, 425-426 
Delayed response, 167 
Democratic group, 830 
Dependent behavior, observation of, 898-899 
recording of, 907 
Dependent variable, 140-141 
Depression as family-setting variable, 953 
Deprivation, of food, 692-693 
with regression, 731-732 
of stimulation, animal experiments, 365 
blind-born, 365 
of toys, 693 
Deprivation-satiation cycle, 165, 181 
Depth perception, 335-336 
Derived measures, anatomic, 234 
Descriptive categories in observational child 
study, 125 
Descriptive research, 648 
Detour (intermediary) in problem solving, 
423, 425 ‘ 
Development, anatomic, 201 


SUBJECT INDEX 


Developmental change, age and environmenta! 
changes, 47 
age in limited populations, 42 
age with 7 variables, 49-50 
analysis of, 36 
function of age, 39 
response variation with age and other vari- 
ables, 39-50 
theoretical principles in, 50 
Developmental characteristic, 36 
Developmental formula, inadequacy of, 36- 


37 
Developmental level, and attitudes and values, 
656, 657 
and explanation of research purpose, 651 
influences on research, 671 
Developmental method of Piaget, 427-428 
Developmental stage as theoretical construct, 
60 


Developmental stages in perception, cognition, 
and motivation, 649 
Developmental status, 36 
Developmental transitions, 428 
Deviation IQ, 470 
Diary description, 73, 75, 79-83 
Diary records, 81, 82, 88 
of interaction, gor 
Diary of speech development, 529-530, 542, 
543 
Dietary history and records, 273 
Difference limens, 287 
Differential conditioning, 333, 396 
Differential responses in infants, 318-321 
Differential sensitivity, 333 
Differential thresholds, 288 
Differentiation, 320 
of abilities, 477-478 
Directive versus nondirective interviews, 574- 
i, 575» 577 2 d 
Disaster as family-setting variable, 953-954 
Discrimination, 142, 143, 145, 156, 164, 169, 
170, 177, 178, 179 
in animals, 321 
color, 291-292, 319, 333 
of form, 169, 170, 321, 338 
pattern, 319 
precision of, 320 
size, 321 
social, 165 
and socioeconomic background, 189 
two- and three-dimensional, 337-338, 361- 
362 
Discrimination learning, 338, 395-402 
definition of, 395 
effects of language facility on, 402 
intermediate size problem, 401 
methods of rewarding, 398 
orienting responses, 401 
patterned, 395, 399 
relational, 400-401 
definition of, 400 
set, 401 
simultaneous, 301ff., 395, 397-398 
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Discrimination learning, difficulty of, 398 
difficulty of, 398 
instructions for, 398 
measures of, 397, 398 
spatial and nonspatial cues, 398 
spatial, 397 
and maze learning, 410 
stimulus pretraining in, 402 
successive, 395, 399-400 
theories of, 395, 399 
transfer of training in, 401 
transverse patterning, 401 
types of, 395, 400 
Discriminative processes (see also Discrimi- 
nation), 156, 159-161, 177-180, 189, 294- 
205 
Disguised approach of investigator, 659-660 
Disguised questions, 655 
Displacements, group of (Poincaré), 429 
Disposition, 771-772 
versus behavior, 734-735 
Dispositional construct, 771-773 
Dispositions, goal-oriented, 777 
Distance, perception of, 332, 334, 335 
Distinctive features (phonemic), 
528-529 
Distortion, cognitive, 996 
deliberate or conscious, 992, 993 
motivation for, 995 i 
nondeliberate or unconscious, 996 
perceptual, 325-326, 991-992 
radiographic, 168 
Diurnal fluctuation, 204, 205-206 
Divorce, 956-957 
Doll play, and achievement-fantasy scores, 
808-810 
and achievement motivation, 807 
as projective technique, 565, 584-587, 657, 
667, 680, 729-730, 777, 788-790, 807- 


525-526, 


810 
reliability of, 788-789 
structuring questions, 729-730 
validity as index of aggression, 789 
Doll-play analysis, achievement theme, 807- 
88 
and behavioral observations, 807-808 
work routine theme, 807 
Doll-play interview, 565, 584, 587 
for appraisal of aggression, 788, 790 
Dominance in mothers, 970 x 
Dominant and submissive behavior, observa- 
tion of, 896 
Domination, adult, 826 
Domination categories, 896-897 
Double-alternation problems, 425, 430 
Dough situation test, 607, 632 
Dramatic techniques in attitudes and values, 
667 
Draw-a-person test, 632-633 
Deea ad painting, methods, 606, 632—633 
scoring criteria, 726-727 
Drawings, 338, 777 
Dreams, 605, 626-627 
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Drives, see Motivation, Learning (see also 
specific needs, such as Achievement, 
Aggression) 

Duplicity theory, 291 

Duration, 360 

auditory, 361 

Dvorine test, 292 


Echolocation, 366 
Ecological aim in observational study, 78, 81- 
82, 91, 108 
Eeclogica; applications of observational study, 
12 
Editing behavior, 886 
Education, importance of child study to, 3 
Egocentrism, 438 
Ego development, response sets in, 777 
Electrodermal response, 157, 158, 188, 296- 
298, 38. 
Elecupeheephalormnnic (EEG) threshold in 
hearing, 296-297 
Electronic tracking of children’s movements, 
124 
Electroretinography, 288 
Elementary-school children, testing problems, 
467-468 
Eliciting stimulus, 158, 159, 166 
Embedded figures, 341, 360 
Emotional climate and IQ changes, 476 
Emotional factors, in biochemical studies, 262 
in later childhood, 186-187 
Emotional orientations, 779 
Emotional responses, conditioning of, 385 
Empiricism, 311 
Equilibrium, sense of, 302 
Encouragement as reinforcement, 170, 183 
Endocrine balances, 255 
Environment, enriched, 366-367 
external, 256 
homeostatic internal, 256 
internal, 254, 256 
and mental disturbance, 256 
past, 144, 146 
physical, 255 
present, 141, 142 
restricted, 365-366 
Environmental deprivation, 365ff. 
Equivalence, point of, 316 
Equivalent-form reliability, 470-471 
Error of the standard, 316-317, 348 
Errors, average, 313 
Constant, 313, 325-326, 337, 347, 352 
sources, in child performances, 24 
systematic, 321 
Escape training, 184, 190-191 
Ethics, in anthropological research, 919 
in attitude and value research, 650-654, 670 
in research, 28-30 
Ethnocentrism, 967, 969 
Ethnographer, 926ff. 
Ethnographic domain, 924, 931 
Ethnographic interview, 918, 926-928 
Ethnographic sample, world, 875 
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Ethnoscience, 919 
Event sampling, 73, 75, 104-108 
Evoked potentials, 288 
Examiner reliability, 472 
Exeretory activity, 256 
Experimental extinction, 382, 392 
in differential conditioning, 396-397 
generalization of, 394 
Experimental manipulation, advantages and 
disadvantages of, 698-699 
Experimental method, child’s acquisition of, 
422 
Experimental technique, 140-145 
Experiments, meaning to subjects, 700-702 
natural, 706-710 
planning, 699-700 
Explanation, of experiment to child, 29 
post experimental, 30 
Explanations, children’s, 432-433 
Exploratory investigations, contributions of, 
702-704 
Exploratory survey, without explicit hypoth- 
eses, 51-52 
of raahon of child, 51 
Expression, avoidance of, 735-736 
directness of, 735-736 
Expressive acts, 885 
Expressiveness as response set, 776 
Expressive qualities, identification in human 
faces, 356-357 
identification in voice, 358-359 
Expressive techniques, 605, 630-635 
Extended family household, 935, 937-938, 940 
External criterion in test validation, 58 
Extinction (see also Experimental extinction), 
158, 162, 167, 177 
Eye reflex on the neck, 288 


Fables (see also Story techniques), 717 
Facial expression, identification, 358 
Factor analysis, in construct Ralidation: 774 
in differentiation of abilities, 477-478 
Failure, difficult puzzles, 694 
difficult tests, 694 
reactions to, 183 
timed tests, 694-695 
Family, see Chapters 21, 22 
child’s perception of, 982-985 
lack of comparability, 990 
large, and personality, 957 
as small group, 990 
status within, 990 
Family composition, 946, 954-959 
cross-cultural studies of, 918-944, 954 
data, accessibility of, 985-991 
decision making, 981 
disorganization and juvenile delinquency, 
965 
Family history, 991 
Family interaction, 
999-1003 
spatial index for, 954 


observation of, 42-46, 


SUBJECT INDEX 


Family members, correlations between per- 
ceptions, 997 
Family play test, 607, 632 
Family privacy as barrier to obtaining sub- 
jects, 987-989 
Family relations questionnaire, 983 
Family setting, bureaucratic, 948, 950-951, 953 
entrepreneurial, 948, 950-951, 953 
variables in, 945-946, 949, 954 
Fantasy, adolescent, 722 
and need achievement, 802-803, 807-808 
and reality, distinctions between, with age, 
657 
relation to behavior, 614-615 
Fantasy aggression (see also Aggression), 785 
Guttman scale of, 786 
predicting overt aggression, 785-786 
Fantasy content, related to picture 
786 
Fantasy materials and achievement, 805 
Fantasy measures, usefulness, 810-811 
Father, absence of, 956-957 
as disciplinarian, 991 
occupation of, 952 
Favorability, as response set, 776 
Fear, 691 
Fears, children’s, parental reports, 797-798 
Feeble-minded children, animism in thinking 
of, 435 y 
Fels child behavior rating scales, 782 
Fels parent behavior rating scales, 116-117 
Fels scales, 116-117, 745-746, 782 
Field methods, 146 
Field strategy in observation, 879 
Field unit analysis, 73, 75, 108-113 
Figure, ambiguous, 329, 341 
asymmetrical, 341 
embedded, 341 
geometric, 341 
hidden, 329-330, 341 
overlapping, 341 
reversible, 329-330 
symmetrical, 341 
Figure-ground relationship, 340, 360 
Finger painting, 606, 633, 634 
Fixation preference, 319, 332, 363 
Flexibility in groups, defined, 834 
Focused interview, 561 
Follow-up studies, 475 
Forced choice method, 192 
Forments, speech, adult versus child, 498 
Formboard, 169, 170, 338 
Form class (see also Syntax) 548-553, 554 
Form discrimination, 169, 170, 321, 338 
Form perception, 315, 337, 344, 361 
tactual, 337 
Form preference, 169 
in child speech, 553-554 
Forms, geometrical, 343 
incomplete, 343 
intersecting, 343 
nonsense, 315, 343 
Four picture test, 604, 624 


cues, 


SUBJECT INDEX 
Fractionation, method of (ratio judgments), 


31 
Frame of reference in groups, 836, 844-845 
Free association, 715 
lree play, and aggression, 728-729 
and negative attitudes, 728-729 
Vree variation among phones 
Phones), 523-527 
Freudian concepts, influence on behavior 
theory, 6-7, 59 
Frustration, aggressive reactions to, 787 
in early childhood, 182, 183 
in infants, 163-164 
Frustrating situations, 692-698 
Functional capacity for growth, 256 


(see also 


Galton bar, 313, 325 
Galvanic skin response, 157-158, 188, 296-298, 


385 

Genealogical method, 918, 930-933 
General factor, 477-478 
Generality, assumption of, 775 
Generalization, 320, 338 

social, 165 

statistical, 1003-1004 

theoretical, 1003-1004 

of verbal concepts, 188 
Generalizing from results, 1003-1004 
General to specific controversy, 6 
Genetic endowment, 141, 146, 147 
Genus-species relationship, comprehension of, 


437 
Geographical area as subject source, 986 
Gestalt factors in form perception, 340, 344 
Gestalt psychologists, 322 
Gestalt theory, in child development, 59 

in learning of language, 442 
Global perception, 340 
Goodenough draw-a-man test, 466, 472, 483, 


32 
Gottschaldt figures, 329-330, 341 | 
Grammar, child’s, transformations in, 549 
Grandmother in the home, 957 
Grapheme, 524 
Group, selection of, 828 
Group atmosphere, authoritarian, 830 
democratic, 830 
laissez-faire, 830 
Group autonomy, defined, 834 
Group behavior, observation, 835-836 
Group cohesiveness, 832 
Group control, defined, 834 
Group dynamics, 821, 834 
Group factors, 477 
Group flexibility, defined, 834 
Group hedonic tone, defined, 834 
Group homogeneity, defined, 834 
Group interview, 590-591 
Group intimacy, defined, 834-835 
Group participation, defined, 835 
Group permeability, defined, 835 
Group play techniques, 606, 631-632 
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Group polarization, defined, 835 
Group potency, defined, 835 
Group pressures, 832-833 
Group research approaches, 825-827 
correlational approach, 826-827 
counterbalanced experiments, 827 
normative-descriptive study, 825-826. 
Groups, experimental creation of, 68off. 
matched, 149, 150 
permeability in, 835 
previously established, 7ooff. 
random, 149ff. 
stratified, 828, 835 
Group size, defined, 835 
Group stability, defined, 835 
Group structure, 833-834 
matrix of, 873 
Group tests, 458 
Group viscidity, defined, 835 
Growth, invisible, 252, 258 
living, 253, 255, 260 
physical (see also Anatomic ontogenesis), 
201 
protoplasmic, 255, 260 
rate of, 256 
skeletal, 255, 260 
visible, 252 
Growth curves, 5 
Growth requirements, food, 254, 255 
hormonal, 255 
nonhormonal, 255 
non-nutritive, 255 
nutritive, 254-256 
Growth studies (see also individual studies), 5 
Guess-who technique, 847 
with third-grade children, 784 
Guilt and socialization, 939 
Gusii family, 935, 941 
Gustatory sensitivity, 299-3co 
Guttman scale of fantasy aggression, 451, 786 


Habit strength, 375-376 
Halo effects, 778 
Handedness, 169 
Handwriting, 634, 635 
Happy family, value on, 992-993 
Hearing, 160, 295-298 
EEG threshold of, 296-297 
in newborn, 295-296 
Hearing tests, learning techniques in, 297-298 
meaningful sounds, 297 
validation of, 298 
Hedonic tone in groups, 834 
Heredity and environment controversy, 6 
Hidden figures (see also Figure), 329-330 
Historical variables, 144-146 
History, 141, 144 ‘ 
Holism, influence on projective techniques, 
607-608 
Homogeneity in groups, 834 
Hopi, animism in thinking, 435 
Hormonal activity, 254, 256 
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Horowitz faces test, 674 
Hospitals as source of subjects, 986 
House and family setting, 954 
Household, 935-940 
communal, 935-936 
mother-child, 935-936 
polygynous, 935-936, 938 
House-tree-person test, 606, 632 
Hue discrimination (see also Color discrimi- 
nation), 291, 332 
Human versus aioa research, 23 
Husband-wife relationship, divisions of func- 
tions, 960-963 
power, 959-961 
Hypotheses, in body of theory, 55 
collateral, 10 
contingent and noncontingent, 8 
empirical, 9, 12, 15 
sources of, 51 
testing of, 9, 12, 54-55 
Hypothetical situations, inferences from, 655, 
665-666 


Identification, with parent, 960, 967, 975 
of stimuli, 318, 361 
Identification learning, 322 
Identification models, 936 
Idiographic aim in observational child study, 
79, 81-82, 103 
Idiographic applications of observational child 
study, 129 
Illumination, control of, 156 
Illusion, Müller-Lyer, 325, 363-364 
size-weight, 334, 363-364 
vertical-horizontal, 363-365 
Illusion experiments, 325-326, 331, 363-365 
Imposed leadership game, 607, 632 
Impoverishment (see also Deprivation, En- 
vironment), 328 
Incentives, concrete reward, 695 
controlling success and failure, 696 
invidious comparison, 696 
involvement in the task, 696 
praise and blame, 695 
temptation, 697 
Inclusion, 443-444 
Incomplete sentences (see also Sentence com- 
pletion), 605, 628, 655, 716-717 
Incomplete stories (see also Story comple- 
tion), 663, 667, 717-719 
Independence training, 937-938, 940 
Independent variable, 140, 141 
Indicator responses, 318-319 
Indices of aggressiveness (see also Aggression, 
Aggressiveness), 781 
Indirect measurement techniques, 660-661 
Individual tests (see also Tests), 458 
Individual variation, 37 
Inducements, 165, 174 
in later childhood, 186-187 
Infancy, 148, 151-172 
early, investigative procedures in, 154-165 
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Infancy, later, 165-172 
investigative procedures, 165 

Infant speech, recording of (see also Speech, 
Speech and language development, 
Speech sounds), 490-491 

Infant tests, 46, 426, 457, 465, 473, 476 

Infant vocalization, 488, 491-492 

Inferential behavior, 423-424 

Inflexion (see also Speech Inflexion), 540-547 

Influence techniques, 968, 978 

Informant, 926ff. 

Informants, relatives or friends of family, 


995-9 2 

as sources of subjects, 986 
Initiation ceremonies, 942 
Inkblots, Rorschach, see Rorschach test 
Innate releaser, 319 
Insight, 422, 423, 424, 429 

gradual, 660-661 
Insight test, 605, 629 
Instigations, external, in interaction, 855-856 

standardized, 871 
rare, 872 
sampling of, 867-870 

Institutionalized children, 955-956 
Institution, cultural, defined, 923 
Institutions, 152, 154, 185-193 
Instructions, 318, 340, 348-349, 354 

children’s lack of understanding, 23 

in later childhood, 186 

neutral, 691-692 

threatening, 692 

verbal, 166, 174, 188, 189, 190 
Instrumental acts, observation of, 886-887 
Instrumental aids in observational child study, 

122-125 

Intellectual operations, 421, 448 

genetic analysis of, 445-449 
Intelligence, factorial conception of, 450 

quantification of criteria, 450-451 
Intelligence quotient (IQ), changes in, 476 

constancy of, 476, 

ratio, 469 
Intelligence tests, 461, 475 

split-half reliability, 471 
Interact analysis, 909 
Interaction, 862 

coding of, 890 

continuous record of, 906 

diads or triads, 903-904. 

external instigations in, 855-856 

internal instigations in, 857 

objective approach to, 857 

personality differences in, 872-873 

sampling of, 870-875 

single acts in, 861 

subjective approach to, 857 
Interact recording, 862, 915 
Interact sequences, recording of, 907 
Interacts, in everyday settings, 911 
Intermediary, use in problem solving, 423- 


gs 
International Kindergarten Union test, 492 
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International phonetic alphabet, 488-490 
Interpersonal activities, 864 
Interpersonal behavior, see Chapter 20 
categories for recording, 891-892 
recording of, 875 
Interpersonal interaction, 863-864 
Interpersonal relationship, in interview, 568, 
573 
Interspaces, constancy of, 350-351 
Interval of uncertainty, 352 
Interview, with adolescents, 572 
advantages and limitations, 598-599 
anxiety in, 582-583 
in attitude and value measurement, 661- 
662, 668-670 
atypical children, with, 573 
content analysis, 595-598 
countersuggestion in, 565, 582 
defining situation, 570-572 
at different developmental levels, 563-568 
directive versus nondirective, 574-575, 577 
doll-play, 565, 584-587 
effects of preceding experiences, 570 
focused, 561, 672 
group, 590-591 
language limitations in, 562-565 
life space, forse 
mechanical recording, 593-594 
middle childhood, 572 
motivating response in, 565-566, 568 
multiple-choice techniques, 586, 588 
negativism in, 562 
nonverbal communication, 592-593 
observation of behavior during, 592-593 
with parents, 751-752 
personality data source, 776 
Piaget’s méthode clinique (see also concen- 
tric clinical method), 433, 451, 583-584 
picture techniques, advantages Atel limi- 
tations, 564-565, 587-590 
play meterials in, 566, 570-571 
with preschool children, 563-565, 571-572 
probing, 582 
questions, degree of structure, 575 
direct, 576-578 
formulation of, 573-579 
indirect, 576-578 
open-ended, 579 
PESICCRYOD 705570 
sequence of, 580-581 
structured versus unstructured, 578 
rapport in, 562, 565-566, 570-571 
research applications, 561 
research versus therapeutic, 569-570 
resistance at different ages, 565-567 
sentence-completion, 587 R 
setting and interpersonal relationship, 568- 
ANE 
sociometric, 672 
standardized versus free, 574, 581 
story-completion, 587 
structured, with special informants, 750 
suggestibility in, 582 
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Interview, unstructured, advantages and limi- 
tations, 574-578 
Interview data, coding, categories, 595-598 
coding, reliability, 597-598 
validity, 597 
variables, 596-598 
unit of analysis, 59-597 
Interviewer, authority role, 668-669 
bias, 431 
role relationships, 572 
sex, influence of, 566, 568 
symbolic value for child, 669 
Interview responses, amount of detail re- 
corded, 593-594 
effects of recording, 593-594 
mechanical recording, 593-594 
Interview schedules, construction of, 578-582 
length, 581 
pretesting, 578 
Interviews on motivation and feeling, 711ff. 
Interviews with peers, 753 
Introspective material, supplement to obser- 
vations, 999-1000 
Invariants, 427 
the permanent object, 427-428 
physical and spatial world, 428 
Inventories, 426, 711-712 
Inversion, recognition of, 339 
Investigator, effect on subjects, 65, 659-660 
IQ, changes in, 476 
constancy of, 476 
ratio, 
Ea, 292 
Isolates, in groups, 843 
Isolation cabinet, 296 


Judgments, child’s, 436 
Juvenile delinquency, and family disorgani- 
zation, 965 
relationship between mother and son, 948 


Keystone telebinocular, 294 

Kinesics, 113 

Kinesiological research, 123 

Kirkpatrick family interest scale, 964, 966 
Kuhlman mental test, 505 

Kwoma families, 928 

Kymograph, 159, 162, 167 


Laboratory, 140, 141, 142, 143 
child’s home, 153, 154 
hospital, 152, 154, 159 
institutional, 152, 154, 173 
mobile, 153, 172, 391 4 
nursery schools, as source of subjects, 172- 


17 

Labynit ihe receptors, 302 

Laissez-faire group, 830 

Landolt rings, 340 chi 

Language (see also Linguistics, Speech and 
subentries, and specific entries such as 
Phoneme, Verbal learning, Words) 
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Language, and concrete associations, 440 
as second signal system, 441 
Navaho, 533-534, 553-554 
Russian, 543-5 
Language acquisition, productive control, 521, 
527-528, 531 
receptive control, 521, 527-528, 531 
in twins, 441 
Language development (see also Speech and 
language development, other entries 
under speech, and specific entries), 487- 


557 
Language elements, allograph, 524 
allomorph, 540, 546 
allophones, 524, 530, 540 
grapheme, 524 
morpheme, 518-532, 539-548 
phone (see also Phones, Speech sounds), 
521-531 
phoneme, 518-534 
voiced-voiceless contrast, 519-525, 530 
written, 524 
Language functions, social behavior, 509 
egocentric speech, 509 
evocative, 510 
expressive, 509 
representative, 510 
Language limitations in young children, 657 
Language skills, interrelations, 510 
measures of, 494-495 
tests of, 492 
Language systems (see also Language ele- 
ments) 
morphology, 518-521, 539-548 
phonology, 518-534, 540 
syntax, 518, 519, 539 
Language and thought characteristics of chil- 
dren, 658 
Later childhood, defined, 148 
studies of, 185-193 
Laughter, 690-691 
Leadership, in children’s groups, 833 
supervisory, 831 
Leadership games, 606-607, 632, 634 
Learning, definition (see also specific types), 
37 
discrimination (see also Discrimination 
learning), 338, 395-402 
goals in study Be 374 
versus insight, 422-423 
instrumental, in infants, 164 
meaning of, 375 
social, 164 
trial-and-error, 385-386 
Learning sets in early childhood, 177 
Learning techniques in hearing tests, 297-298 
Learning theory in child development, 59 
Leiter international performance scale, 483 
Length, perception of, 334 
Level of aspiration, 775, 978 
correlates of, 809 
in preschool children, 810 


SUBJECT INDEX 


Lever pressing, 168 

Life cycle and growth, 253-255 

Life space interview, 591-592 

Limen, two-point, 286-287, 314, 334, 337 

Limits, method of, in detecting thresholds, 
313-314, 316, 328, 334 

Linguistic determinism, 555 

Linguistic reference, 519, 520, 521, 527 

and form class, 551-553 

Linguistic relativity, 555 

Linguistics in child speech (see also Language, 
Speech and subheadings, and specific 
entries, such as Grammar, Phoneme, 
Verbal learning), 517, 555 

Locomotion, 171 

Locomotor sequences, 179 

Logic, intuitionist, 450 

mathematical, 450 

Logical ability, tests of, 432 

Logical errors, 449 

Logical models, 449 

Logical multiplication and matrix tests, 444 

Logical principle, child’s discovery of, 422, 


30 
Logie quantifiers, 437 
Logical thinking, 432 
Longitudinal principle, 81 5 
Longitudinal research, change of interests in, 
26 
history of, 5, 26 
lack of environmental controls, 25 
original strategy, 25 
sample selection, 26 
study of early experiences, 26 
test of complex theories, 27 
Longitudinal study, 40, 41, 49, 50, 52, 53, 56, 
81, 426, 451 
attrition in, 62 
cases of lost sign, 44 
changes in attitudes, 952-953 
changes in procedure, 63-65 
cross-validation, 54 
cultural shifts in, 41-42 
data-analysis problems, 65-67 
as data-collection device, 64 
inefficiency of, 54 
intelligence tests in, 475 
of intrafamily comparisons, 959 
matched group design, 41 
optimal number cases, 63 
parental cooperation, 62-63 
parents’ backgrounds, 949 
post hoc analyses, 53 
practical problems, 61 
of rates of change, 47 
repeated measurements design, 49, 50 
replication in, 53-54 
search for relationships in, 64 
as source of hypotheses, 64 
as technique of confirmation, 64 
as variation study, 52-54 
Lorge-Thorndike intelligence test, 458 
Loss of support, 193 
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Lost sign, 44, 46 
Luminosity, equated in color testing, 291-292 


Magic, 432, 434 
Make-a-picture story test, 604, 624 
Manipulation, experimental, 48, 698-699 
validity of methods, 757 
Manipulation of subjects, physical, 690 
psychological, 691 
Manipulative study, dangers of, 38 
isolation of manipulable antecedents, 47 
random assignment, in, 48 
Manipulatory sequences, 179 
Man-to-man defense in observation, 881 
Manus, animism in thinking, 435-36 
Marital adjustment scales, 966 
Marital adjustment and tension 
Parents’ relationships), 963-966 
Massachusetts vision test, 294 
Matched groups, 149, 150 
Matching color, 333 
Matching form, 338 
Matching methods, 321, 333 
Material coverage in observational child study, 
72) 73 
Maternal reports, retrospective, 44-45 
Maternal separation, 171, 693, 955-956 
ternity hospitals and wards, 152, 159, 163 
Matrix, of group structure, 873 
of nominations, 841 
Matrix multiplication, 843 
Matrix tests, 444 
Maturation, 156, 158 
motor, 320 
perceptual, 320 
as theoretical construct, 48 
Maturation and learning, controversy over, 
46 
Maze learning, 409 
Mazes in later childhood, 189, 192 
Measurement, anatomic, see 
measurement 
Measurement, spontaneous, 446 
Measurement scales, anatomic, 228 
Measurement techniques, choice of, 649-654 
indirect, 660-661, 994 
Mental age, 469 
Mental development, scales of, 438 
Mental health, importance of child study to, 


(see also 


Anatomic 


3ff. 
Mentality, child’s, 434, , 
primitive, characteristics of, 432 
Mental measurements yearbooks, 456 
Mental operations, 436 
Mental-testing movement, 5 
Merrill-Palmer scale, 469 
Meribolts balance, 257, 258, 260 
method, 257, 258, 259, 261 3 
Metabolic A biochemical growth, 
absorption, 253, 255, 256 
digestion, 253, 255, 256 
utilization, 253, 255 
Metabolites, urinary, 258 
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Michigan picture test, 624, 996 

Michotte disk, 323, 354-355 

Milk as eliciting stimulus, 158, 159, 162, 166, 
167, 168 

MMPI in study of stutterers’ mothers, 968 

Miniature life toys, 607, 631 

Minnesota preschool scale, 457, 477 

Minority, in groups, 831 

Miscommunication as source of distortion, 


.. 995-998, 4 
Misinterpretation of child’s responses, 653-654 
Mixtecan families, 925, 939, 941 
Molar behavior, 9 
Mold and cast aie 219-222 
Molecular behavior, 93 
Montessori method, 366-367 
Mormon families, 925, 936-937 
Moro reflex, 164 
Morpheme, 518, 531, 539-548 
bound, 518, 539, 541 
defined, 518, 531-532, 539 
free, 518, 541 
Morphology (see also Language elements, 
Language systems) 518-521, 539-548 
Mother behavior, categories for describing, 


RATA: 

Mother-child household, 935-936 

Mother-child interaction, LE of, 904 

sequential aspects of, 906 

Mothering behavior toward infants, 981 

Mothers’ absence, 171, 693, 955-956 

Mothers’ character structure and child ad- 
justment, 971-972 

Mothers’ personality and stuttering children, 
968 

Motility, 160 

Motion, velocity, 356, 360 

Motion pictures, of child behavior, 122, 124, 


125 
of fear, 182 
as projective device, 724-725 
of restraint, 163 
Motivation (see chapter 17), 156, 158, 161- 
163, 170, 321, 688-769 
for achievement, 802-812 
in early childhood, 181 
in infants, 161-163 
in later childhood, 190-192 
771-773 
test validation, 756-758 
Motor skills, 183 g 
Movement, apparent (stroboscopic), percep- 
tion of, 353-354 
complex, perception of the causal effect, 
354 
perception of, 352-356 
lower threshold, 352 
velocity, 352-353 
Movements, observation of, 882-885 
Mouth region, early sensitivity to touch, 304 
Mouthing, 169 
Miiller-Lyer illusion, 325, 363-364 
Multiple-aptitude batteries, 464, 477 
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mals studies, ie 1, 193 
Makita research designs, gi 
Muscle mas index, 266 


Norms in group, 847 
Northwestern infant intelligence test, 457. 473 
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Peers, interviews, 753 
sociometric choices, 753-754 
Per cent agreement, 86, 99, 838-839 
Percentile rank, 469 
Perception, of area, 335 
effects of early experience on, 311 
global, 340ff. 
in infants, 319-320 
of movement (see also Movement), 352- 
356 
and personality, 326, 331, 362-363 
of space, 320, 334 
syncretic, 342 
Perception theory and projective techniques, 
613-614 
Perceptual configurations and operational 
mechanisms, 444 
Perceptual distortion, 325-326, 991-992 
Perceptual learning, 366-367 
Perceptual organization, 320 
Permeability in groups, 835 
Personal documents, 677-678 
Personality and perception, 326, 331, 362- 
363 
Personality characteristics, defined, 771 
sources of data, 776-778 
Personality development, 146 
Personality scales, standard, 745-746 
Personality traits, theory of change, 22 
Personality variables, sources of error in 
study, 812-813 
Phenomenal attitude, 350 
Phenomenal qualities, 312 
Phenomenal world, 322-32 
Phenomenological Reena 322-324, 354-355 
validity of, 324 
Phoneme, 518-534 
in child speech, 526-531 
complementary distribution with phones, 
523-524, 52 
defined, TRH 
distinctive features, 525 
free variation, 524, 527-528 
speech sounds, 489 
Phones, relationships among, 523-527 
in child speech, 527 
complementary distribution, 523-524, 527 
defined, 521-522 
free variation, 523-524 
Phonology (see also Language elements, Lan- 
_ guage systems), 518-534, 540 
Phorias, tests for, 294 
Photographic technique, 217 
Photographs, rating scales, 216 
research usefulness, 215ff. 
Physical anthropology, 269 
Physical environment, 255 
Physical measurements (see also Anatomic 
_ Measurement), 201, 229-237 
Physical pathologies, 707 
Physiognomic qualities, 356, 359, 368 
Physiologic capacity and utilization of foods, 
255 
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Physiologic functions, 254 
Physiologic methods, and nutrition, 265-267 
of studying growth, 265-267 
Piaget’s méthode clinique (see also Concen- 
tric clinical method), 433, 451, 583-584 
Picture-arrangement test, 605, 635, 636 
Picture-choice technique in interviewing, 587- 
588, 590 
Picture cues and fantasy content, 786 
Picture frustration test, see Rosenzweig pic- 
ture frustration study 
Picture-interpretation in appraisal of aggres 
sion, 784 
Picture-interview techniques, 587-590 
Pictures, cloud, 604, 620 
cultural factors in perception of, 361 
designed for specific motive, 786 
in intelligence tests, 361 
measuring anxiety in nursery-school chil- 


dren, 800 A X 
Picture-story test for perception of authority, 
978 


Picture techniques with sociometric-type ques- 
tions, 655, 662, 667 
Picture test of anxiety, 725 
Picture tests (see also Projective techniques 
and specific tests), 604-606, 624-635 
Pinna reflex, 295 
Pintner general ability tests, verbal series, 
458 
Pintner nonlanguage test, 458-459 
Pitch, 333-334 
lane figures, perception of, 335 
Play, 171 
examiner’s influence, 740 
explaining research purpose, 651 
highly structured, 730-731 
in study of aggression, 788 
Play behavior, indicating change, 731-732 
Play symbols, 437 
Play techniques, 728-732 
group, 606, 630-632 
Pneumograph, 160, 193 
Point of subjective equality, 286 
Polarization in groups, 835 
Polygraph, 160 
Polygynous household, 935-936, 938 
Population change and IQ tests, 479 
Population-sampling techniques, 987 
Position habits, 170 
Positioning subjects, 205-207, 212-213, 219 
Position of objects, 315 
Posture scales, 216 
Potency in groups, 835 
Potential leadership game, 606, 632 
Power relations in family, 910, 1000-1001 
Power test, 461 
Practical problems, fathers as subjects, 704 
limitation of space and time, 704 
limitations in sampling, 706 
mothers as subjects, 705 
teacher anxieties, 704 
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Practice, and coaching, effect on intelligence 
tests, 468 
effects of, 325, 327, 329 
Preadolescence, 148, 185 
Precategorial level in child’s thought, 442 
Precision, index of, 312 
of judgments, 347 
Preconcepts, 43 
Preconceptua iba 442 
Predictive validity of intelligence tests, 473- 


474 
Pre-experimental conditions, 165 
Preference, fixation, 319, 332, 363 
Preference method, 169, 192 
Prefixes and suffixes, derivational, 543 
Pregnancy, premarital, and effects on parents, 


9 
Prejudice in parents and children, 967, 971, 


975 
Premature birth, 986 
Preschool children, testing problems, 467 
Preschool tests, 457-458, 465, 473, 476 
Pressure, response to, 163 
Pressure sensitivity, 303-304 
Pretests, importance in longitudinal research, 
26, 27 
Pretraining, 321, 341, 367 
Preverbal period and problem solving, 426 
Primary mental abilities test, 478 
Privacy of families and clients, 987 
Privileged knowledge of child, 653 
Probability, objective, concept of, 447 
subjective, 447-448 
Problem solving, 180-182, 422, 426 
genetic approach (Piaget), 427 
in later childhood, 190-192 
Procedures, changes during longitudinal study, 
63-65 
Productive and receptive control, 521, 527- 
528, 531 
Projected size, 344-345, 349, 350 
Projective behavior, congruence with social, 
SE EEA 
Projective materials 
achievement, 803 
Projective pictures (see also specific tests 
such as Blacky and TAT), 719-723 
Projective procedures for study of parental 
behavior, 979ff. 
Projective questions in interview, 576-578 
Projective responses, in appraisal of aggres- 
sion, 784-792 
in appraisal of anxiety, 799-800 
source of personality data, 777 
Projective situations, 606 
Projective stimuli, ambiguity in, 611 
Projective technique, implication of, 777 
validity of, 777 
Projective techniques, 603-644, 666-668 
categorization of, 603 
conceptual background of, 607 
influences on growth, 607-608 


for measuring need 
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Projective techniques, nature and definition, 
6c 8-609 
and perception theory, 613-614 
in personality study, 636 
an S-R theory, 613-614 
in therapy, 606, 630 
verbal, 714-715 
Projective tests, ambiguity and goals, 736 
cartoons, 605, 629 
choice of stimuli, 737 
contents, 739-740 
interpretation of protocols, 737 
multiple-choice endings, 738 
in parent-child relationships, 976 
structure of, 736-741 
unambiguous, 737-738 
Protein synthesis, 255, 277 
Proto-theories, defined, 16ff. 
types of, 17-18 
values of, 18 
PSE (point of subjective equality), 286 
Pseudoconditioning, 168, 297-298 
controls for, 378, 384 
Psychoanalysis in child development theories, 


6-7, 59, y 
Psychoanalytic theory, influence upon pro- 
jective techniques, 607, 612-613 
predictions from, 27 
Psychodrama, 606 
Psycholinguistics, 517, 551, 554-557 
Psychological ecology, 22 
Psychological specificity in observational child 
study, 125 
Psychological theories and projective tech- 
niques, 612-615 
Psychometric functions, 314 
Psychophysical methods, 312-315 
absolute judgment, 317 
average error, 286 
bias in, 313, 318 
constant methods, 287 
constant stimuli, 304 
fractionation, 317 
limits, 286, 299 
ascending, 286 
descending, 286 
in detecting thresholds, 313-314, 316, 328, 


334 
ratio judgment, 317 
reproduction, method of adjustment, 313, 
348, 361 
single stimuli, 31 
substitutes for infants, 318 
Psychophysics, criteria of judgment, 289 
defined (see also Thresholds), 285 
experience of observer, 290 
Psychosexual development, Blacky Test of, 
723-724 
Punitiveness of parent, 984 
Pupillary reflex, 288 
Pursuit movements of eyes, 332 


Quantitative scales, 654 
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Quantity, conservation and principle of, 446 
numerosity, perception of, 359-360 
physical invariants of, 446 

Questionnaire, ambiguities in, 996 
on family, 992 
measures of anxiety in children, 794 
personality data source, 776 

Questionnaire approach, conditions for use 

of, 661, 669 
Questionnaires, mail, of child-rearing prac- 
tices, 977 
usefulness and suitability, 669, 711 

Questions, concreteness of, 672 
disguised, 655 
impersonal, assumptions in use of, 655 

projective approaches, 666-668 
indirect, 657 
open-ended, 655, 998 
personal, 655, 666 
with props, 671 
structured, alternative-response, 655, 664 
techniques of asking, 670-671 
undisguised, 655 


Radiation control, 214-215 
Radiographic distortion, 214 
Radiographic technique, 211 
Rage, 163 
Rajput families, 924, 925, 936, 938 
Random groups, 149-150 
Rapport, introductory period, 652 
in testing preschool children, 467 
Rating, 711 
anatomic, 223, 224, 226, 227 
anxiety for, 797-799 
intelligence, 474-475 
interpolation, 226 
by professional personnel, 750-751 
Rating categories, anatomic, definition of, 
225-227 
Rating scales, aggressiveness, 783 
anatomic, 225-227 
from nursery-school teachers, 783 
photographic, 216, 226 
Ratio 1Q, 469 
Ratio judgments, method of, 317 
Rationalization, 922, 933 
Rational learning, 180-182 
Raven progressive matrices, 483 
Raw score in intelligence tests, 469 
Reaching, 164, 169 
Reaction time, 169, 190 
Reading, related to abstract forms, 361 
transfer of skills to, 362-363 
Real-life situations, 655 
Reasoning, 180-182 
Recall data of parents backgrounds, 949 
Receptivity, somesthetic, 302-303 
vestibular, 302-303 
Receptor functions, 284-307 
Recognition, of form, 338 
method of, 315 
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Records, continuous, 742-743 
county, to obtain subjects, 985-986 
parents’, 752-753 
of professional agencies, 749-751 
specimen, 73, 75, 83-92 
Recording, speech (see also Speech entries), 
490-491 ; 
Recording of conversations, 988 
Recording interpersonal behavior, 875-876 
Reduplication, in child speech, 543 
Reference, linguistic, see Linguistic reference 
Reflex, neck, 288, 291 
tensor tympani, 296 
Reflex response to pain, 301 
Reflexes, auditory, 295-296 
Reinforcement, 320, 327 
withdrawing of, 182 
Reinforcement conditions, 190-191 
Reinforcers, 143, 159, 162, 169, 170, 176-179, 
181-184, 187-192 
classes of, 191-192 
Reinforcing stimulus, 162, 166, 169-171, 177 
Reiz Limen, 286 
Reliability (see also Specific tests and/or types 
of measurement) 
of coder, 837 
equivalent form, 470-471 
examiner, 472 
interobserver, 781, 906 
Kuder-Richardson formula, 471 
of physical measurements, 232-233 
retest, 470, 683, 774 
scores, 472 
split-half, 471 
Repeated measurements, 49-50 
absence of single observational technique, 


47 
susceptibility to post hoc analysis, 53 
Replication, as commitment to theory, 59 
in longitudinal study, 54 
Reports, of attitude, 776-777 
of behavior, 776 
maternal, of aggression in children, 784 
retrospective, 44-45, 977, 982 
verbal, 710-711 
evaluation of, 712-714 
family, objectivity in, 991 
sources of distortion, 991-998 
Reproof, 181 
Reproduction, method of (adjustment), 313, 
348, 361 
Research, child, effects on subjects, 652-653, 
669, 681 
public reaction to, 28, 33 
special problems, 22 
Research design, accuracy in physiological 
studies, 257, 261 
choice of, 39, 43 
general considerations, 38-39 
natural experiments, 48-49 
sampling procedures, 39-40 
theory, in selection of, 50, 59 
Researchers, as expert, 992-993, 905 
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Research goals, exploration, 51-52 
test of hypotheses, 54-55 
variation study, 52-54 
Research interview, 568-569 
Research purpose, concealment of, 651 
Research purposes, child’s understanding of, 
651-654 
explanation to child, 651-652 
Respiration, receptor function measure, 301 
temperature sense test, 304 
Respiratory changes, 304 
Respiratory response, 157 
Respondent, choice of, 995 
Response, electrodermal (see also Galvanic 
skin response) 157-158, 188 
nonreinforcement ae 163 
prevention of, 163 
respiratory, 157 
sucking, 158-159, 162 
withdrawal, 157 
Response choices, prelisted in questionnaires, 


Response consistency and inference of values, 
647, 683-684 
Response generalization, 164 
in paired associate learning, 495-496 
to pinprick, 301 
Responses, precoded 
993-994 i 
unstructured (see also Interview), 662 
Response set in reports, 776-777 
Response tendencies, 771-772, 777 
Responsibility training, 938 
Restriction of environmental interaction, 164 
Retention, 167 
Retest reliability, 470 
in construct validation, 774 
Retinal image, 344-345 
Retrospective reporting of child-rearing prac- 
tices (see also Reports), 977, 982 
Revealed difference technique, in parental 
relationships, 961 
in parents’ interactions with older children, 
8i 
Reversbility in problem solving, 429 
Reversible figures, 330 
Reversible operations in language learning, 
442 
Ringer (in groups), 824 
Roentgenograms, in body-fat assessments, 269 
research usefulness, 210-211 
Roger’s test of personality, 973-974 
Role-playing, 606 
attitude and value measurement, 680 
Rorschach test, 342-343, 603, 610, 612-619, 622 
examiner's influence, 740 
movement response, 614 
norms validation, 725-726 | 
Rosenzweig picture frustration study, 723 
children’s form, 605, 629-630, 787 
Rote learning, 402-412 
definition of, 403 
motor, 403, 407 


in question wording, 
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Rote learning, motor, complex skills, 410 
paired-associate, 408-409 
serial, 410 
verbal processes in, 407 
verbal, 403ff. 
complete presentation, 404 
item presentation, 404 
list, 403-404 
materials for, 404 
paired-associate, anticipation method, 405 
complete presentation, 405 
definition, 405 
difficulty of, 407 
prereading subjects, 406 
prereading subjects, 404 
response measures, 406 
serial, anticipation method, 405 
complete presentation, 404 
definition, 404 
limitations, 405 
Rotter incomplete sentences blank (see also 
Sentence completion tests), 628 
Running time, 190 
Rural children and information tests, 479 
Rural groups and intelligence tests, 482 
Rural-urban location and family setting, 954 
Russian language, 543-544 


Sacks sentence completion test (see also Sen- 
tence completion tests), 628 
Salivation, 167 
Sample, nonrepresentative, 100. 
representative, generalizing from, 1003 
world ethnographic, 875 
Sample bias, 185-186, 987 
Sample selection, 154, 173 
in longitudinal researc! 
Sampling methods, of ins 
of observations, 781 
observations of groups, 836-837 
single acts, 866-867 
Sampling problems, 148, 173, 185-186 
Sampling procedures, matched group, 41 
natural experiments, 48-49 
random sample, 40, 48 
replication, 40 
stratified sample, 40 
unsampled subjects, 40 
Sampling unit, primary, 874-875 
Satiation, 693 
Satiation-deprivation, 187-188, 190-191 
Saturation in color experiments, 333 
Scalable items and problems, 451 
Scale, categories, 327-328 
conceptual, 317 
extent of, 315 
marital adjustment, 966 
measurement, anatomic, 228 
mental development, 426, 438 
personality, 745-746 
Pe ycholopedl 317 
quantitative, 654 
rating, anatomic, 226 


igations, 867-870 
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Scale level in size perception, 345 
Scale score, 677 
Scale steps, 314 
Schedule interaction, 188-189 
Schema (Piaget), definition of, 428-429 
Schemata (Piaget), 428-429 
Schizophrenic adults and parental authority, 
982-983 
Scholastic aptitude, 462 
Schooling, urban-rural differences, 482 
School research, administrative approval, 32 
description for administrators, 32 
interference with school activities, 33 
Schools, and family privacy, 987 
as subject source, 185, 986 
Science, philosophy of, 8ff. 
Scientific theory, confirmation and discon- 
firmation of, 9 
as a deductive system, 8 
terminology, 10 
Scorer reliability, 472 
Scotopic sensitivity, 288 
Seashore test for pitch and loudness discrimi- 
nation, 314 
Self-concept, 777 
Self-esteem, 809 
Self-evaluations of children, 975 
Self-ratings, 677-678 
Semantic differential in psycholinguistics, 519 
Sensitivity, auditory, 295-298 
to color (wavelength), 160 
differential, 333 
to pain, 301-302 
scotopic, 288 
to thermal stimulation, 161 
Sensitization, 158 
Sensori-motor intelligence, 426-429 
Sensori-motor processes, 421-429, 436, 445 
Sensory stimulation, 690-691 
Sensory toys, 632 
Sentence classification, 508-509 
Sentence completion in aggression study, 787 
i S techniques in interviews, 
557 
Sentence-completion tests, 605, 628, 655, 716- 


17 
Genre ceromplenan to assess mothers, 969 
Sentence structure in child speech (see also 
Language development, Speech and 
language development), 494, 506-509 
definition and classifications, 506-509 
terminal status measure, 494 
Separation from mother, 693 
Sequential tests of educational 
(STEP)A, 349 
Serial learning techniques, 179 
in later childhood, 189-190 
Settings in observational child study, 76, 77 
Settlement patterns, 938, 941 
Sex differences, in expression of need achieve- 
ment, 806 
in sensitivity to discord, 964-966 
Sex-role reversal, 963 


progress 
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Shape constancy, 351-352 
Shock, as UCS, 158 
Sibling composition, 957-959 
Signal system, second in speech, 539 
Sign in projective responses, 777 
Single acts, in interaction, 861 
quantification of, 890 
Single stimuli, method of, 317 
Simultaneous choice, 321 
Simultaneous comparison, 345 
Siriono, 935 
Sitting, 164 
Situational influences in attitude and value 
assessments, 683-684 
Situations, contrived, 655 
contrived and anxious behavior, 800-801 
observations of, 76-77, 776 
real-life, 655 
representative, 872 
special, 708-709 
threatening, 692-698 
Situation tests, 607, 632, 634 
Size of angles, perception of, 317 
Size constancy, 345ff. 
representative design for, 350 | 
Size perception, central tendency in, 345, 347 
Size of unit, 835, 836 
Skin senses, 334 
Sleeping arrangements, 925, 937 
Smiling, 162, 691 
Smiling response, 162, 168, 357-358 
Smith vocabulary test, 504-505 
Snellen acuity, 293 
Snellen chart, 339 
Snellen fraction, 292-293 
Social attitudes, 648, 675 
Social awareness in young children, 648 
Social behavior, 146 
congruence with projective, 732-736 
Social class, and family setting, 949-954 
measurement of, 951 
Social-class bias, 483 
Social climate in groups, 830 
Social desirability as response set, 776-777 
Social distance, 674 
Social-distance scales, 840 
Social-emotional roles in family, 1000-1001 
Social interaction, positive, 907 
Social matrix, cliques in, 843, 847 
Social orientations, 779 
Social perception, 313, 359 
animal experiments in, 359 
Social position, index of, 951 
Social positions and categories, 707-708 
Social preference, 843 
Social reaction interview, 978 
Social relations, Syracuse scales, 844 
Social stimuli, 165 
Society for Research in Child Development, 6 
Socioeconomic classes, 479 
Sociological activities, 864-865 
Sociological interactions, 862ff. 
Sociometric choices, 753-754 
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Sociometric indices, 773 
Socicmetric method, 840-850, 930-931 
pictorial materials, 672-675, 753-754 
Sociometric questionnaire, 841 
Solid objects, perception of, 335 
Somesthetic receptivity, 302-303 
Sorting tests for cerebral injury, 442 
Sound recording of child behavior, 124 
Sound spectrograph, 497-499 
Space perception, 320, 334 
Special aptitudes, 463-464 
Special-aptitude tests, 462 
Specific gravity as measure of body composi- 
tion, 270 
Specificity, assumption of, 775 
Specimen day record, 85 
Specimen description, 73, 75, 83-92 
analytical schemes in, 
Specimen records, 83-92, 743 
reliability of, 86-87 
Speech (see also entries under Language, 
Linguistics, and specific entries such as 
Phoneme Rote learning, and Words), 
487-557 
content analysis, 527-528 
egocentric, defined, 509-510 
gender, 544 oat Oe 
grammar, transformations in, 549 
infant, 488, 490-492, 497 
inflexion, English, 545-547 
plural, 540, 545-546 
possessive, 518, 541, 546 
and language devalo bike areas of, 488 
bilingual child, a 494, 529, 544, 553-554 
comparative studies, 493-497 
cross-sectional studies, 491-492 
diary of, 529-530, 542-543 
longitudinal studies, 491-493 
mastery of sentence structure (see also 
sentence structure), 488, 492, 506-509 
phonetic, in children, 492 
phonetic, in infants, 490-491 
in primary aments, 500 
sentence classification, 508-509 
vocabulary test, 503-506 
and language development measures, 
ferential percentage index, 494 
expression unit, 507 
grammatical form, 508-509 
length of utterance, 508 
parts of speech, 508-509 
terminal status, 494,510 
thought unit analysis, 488, 507 
vocabulary, 488, 492, 503-506 l 
and language development research, evalua- 
tion of methodology, 511-512 
observer reliability, 490-491 
problems in classifying, 487-488 
parts of, 508-509, 548-553 
adjectives, 545 
comparatives, 547 
mass nouns, 552-553 
nouns, 549-551, 553 
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Speech, parts of, past tense, 519-520, 546 
prefixes and suffixes, 543 
verbs, 544, 546, 550-551 
perceptual factor in, 501 
socialized, defined, 509 
transcription of, observer reliability, 490 
491 
phonemic, 521 
phonetic, 521-523, 526-527, 529 
Speech analysis instruments, heterodyne wave 
analyzer, 498 
Speech defects, articulation tests, 499-506 
articulatory errors, 499-506 
in grade-school children, 500 
in nursery-school children, 499-500 
in preschool children, 500 
scaling methods, 494 
types of, 499 
cerebral-palsied children, 512 
tests for, 500-501 
cerebral palsy, 
(EEG) in, 495 
comparative studies, 495 
incidence of, 487 
research needs in, 511-512 
stuttering, 500 
electroencephalography in, 495 
electromyography in, 495 
and personality of mother, 968 
test construction, 499-506 
tests for, 499-506 
Speech errors, developmental (see also speech 
defects), 488 
as measures of anxiety, 801 
Speech sound analysis, instruments, 497ff. 
oscillograph, 497 
phonophoneloscope, 497 
sound spectrograph, 497-498 
Speech sound analysis, measures, 497-498 
Speech sound discrimination, definition of, so1 
terminal status measure, 494 
tests for, 501=§03 
Speech sound stimulation, maternal, 495 
Speech sounds, behavioral unit, 493 
consonant blends, 489 
consonants, categories, 489-490 
definition and description, 489-490 
position in words, 502 
gisas ewo 
diphthongs, 4 
aristona] Phonetic Alphabet, 488-490 
phoneme frequency, 492-493 
phonemes (see also Phones and Phoneme), 
489 
phoneme types, 492-49; 
relative to difficulty of, 499-500 
voiced-voiceless contrast, 489, 519-525, 530 
vowels, 488-489 | 
auditory identification of, 498-499 
resonance pattern, 498-499 
Speed test, 461 
reliability of, 472 
Speed-time-space concepts, 445-447 


electroencephalography 
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Sphygmomanometer, 193 

Splinter subgroups, 833 

Spontancous recovery, 158, 167 

Stabilimeter, 160, 163, 288, 318 

Stability in groups, 835 

Stages, conceptual and causality, 435 

Standard inventories, 711-712 

Standard personality scales, 745-746 

Standard scores, 470 

Standard words, definition, 503 

Stanford achievement test, 

Stanford-Binet intelligence test, 457, 465-466, 
| 4725 475-476, 479 

Stanine, 470 

Stars in groups, 843 

Startle response, 295 

Statistical analysis in observational research, 


448 
Statistical generalization, 986-987 
and sample population, 1003-1004 
Statistical models in problem solving, 450-452 
Status in the family, 990 
Status characteristics, index of, 951 
Step size between stimuli, 334 
Stercopsis, tests for, 294 
Stereotypy as response set, 776 
Stimulus, comparison, 314 
dimensions of, 312, 317 
frustrating, 156, 163-164, 171, 182, 192-193 
negative, 169 
positive, 169 
standard, 33 
sudden and/or strong, 163-164, 171-172, 
_ 182, 192-193 
variable, 313 
Stimulus characteristics, 771-773 
Stimulus conditions, optimal, 328 
reduced, 328 
Stimulus generalization, 388-389 
definition of, 304-395 
as function of age, 1 
gradient of, 394-395 
measures of, 395 
methods of study, 3 
in paired association Ng, 495-496 
Stimulus position, control of, 315 
Stimulus-response theory and projective tech- 
, niques, 613-614 
Stimulus situations in attitude and value 
_ measurement, 655 
Stories, incomplete, 717-719 
reactions to, 716 
reflecting need achievement, 803-804" 
Story completion, 629, 777 
with guided tions, 718-719 
in parent-child relationships, 967-968 
techniques in interviews, $87 
Story elaboration test, 6:9 
Story memory, 716 
Story techniques, 715-719 
Story telling, 629, 715-716 
Story tests, objective, 719 
Stratification in groups, 828, 835 
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Street figures, 329 J 

Structured interview, special informants, 750 
Stuttering (see also Speech defects), 190-191 
Subculture membership and family setting, 


954 
Subgroup formation, 249) 833 
Subjects, cooperation of in anatomical studies, 
203, 219 
Jocepdon of, 28 $ 
of observational studies, 75 
obtaining of, access to home, 31 
access to schools, 32 
early childhood, 172-173 
families, 985-986 
infants, 148, IBIS 
introductory letter, 31 
late childhood, 18 
schools (see also School research), 32 
positioning of, 205-207, 212-213, 219 
postexperimental reports to, 31 
nsibiliry for, in child research, 28 
vulnerability to experimental procedures, 
28 
Subjects’ parents, consent of, 30 
deception of, 29 
Subjects selection, 173, 757-759 
age levels, 758-759 
availability and representativeness, 758 
homogeneity, 758 
ponios factors in, 262 
source of, day-care centers, 172 
institutions, 152-154, 185-193 
schools, 185, 386 
Submission categories, 897 
Sucking response, 158-159, 162, 168 
measure of taste, 299 
Suffixes, in child's speech, 543 
Suggestibility, in child, 659 
Surrogate mothers, 359 
Swallowing mse, 166-167 
Symbolic activity, 437 
Symbolic forms, identification of, 361 
Symbolic functions, acquisition of, 436 
Sympathetic behavior, observation of, 9o01- 


993 
Sympathetic » stimuli, gor 
Sympathy sarah responses to, peat 
Syncretic perception, 342 
Syntax, in child speech, 519, 539 
form class, 548-553 
immediate constituents, 548 
parts of speech, 548-553 
transformations in grammar, $49 
Syracuse scales of social relations, 
Systematic aim in observational study, 78, 82, 
oR, 103-104 ¢ 
Systematic applications of observational study,, 


129 
Szondi test, 605, 635-636 
Tachistoscopes, 328, 362 


Tactile-kinesthetic impressions of causality, 
355 
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Tactual perception of form, 337 

‘Tactual space, 337 

‘Tarongans, 940 

‘Task orientations, 779 z 
Task-oriented behavior and motivation, 8o2~ 


Bo: 
Taste, FA 
miy 299 
and sucking responses, 2 
‘Taste rey method Mienie response, 


300 
preference method, 300 
Taurophone, 604 5 
‘Teacher rating and observation, correlation 
between, 783 
Teachers and professional personnel as in- 
__ formants, 749-750 
Technique, as type of custom, groff. 
‘Technology, 919 
‘Temperament, 772 
‘Temperatures, 156, 161 
Temperature sense, control of touch, 304 
‘Temperature sensitivity, 303-304 
Temporal maze, 425 
‘Temporal relations, qs 
Terman vocabulary list, so4-sos 
Termination of study, 193-154 
Terminology and definitions, 16 


Texan families, 925, 8 
Test sdminietration. wu inane of, 465-466, 
744 

examiner's influence on, 740-741 

roblems of, 
other ey kat Aeaee seid agli che 

k ae, 794 

in Classroom, 

correlated wid ratings of, 798 

sex differences in, 795 
Test-anxiety scale, 794 798 

correlated with audience-anxiety scale, 797 
Testing, elementary: children, 467 

preschool children, 46 
‘Testing procedure, Ry . 
Test length and reliability, 472-473 
Ter reabili 6s 

est reliability, 4) 73. OBS 
Ei laret and preschool, 473 

ests, . 

SRS Besson ality, relation between, 

as 
achievement, 464-465 
effects of Fam tan, on, 464 


458 
infant, 457. 465. 473, 476 
infant and preschool, reliability of, 473 
intelligence, 461, 475 
internal consistency, 475 
motor, 463 
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Tests, nonlanguage, 458-459 


paper-and. sk 
personality and iiticy, relation between, 


foc Praca ytedidloapped, garde 


power, 461 
preschool, ei 465, 473, 475476 
sources of information on, 456-457 


1 
Pretty of, 472 
me, 456-484 

Test POONA, Cokan factors in, 470-484 
Test istication, 468 
Test validity, 473-475, 682 
‘Thematic apperception test, Gog, 613-615, 621= 
625, 7197723 
and achievement motive, 721-722, Rog 
and child-rearing practices, 977 


N 86 
Sa tinae and child personalities, 957-958 


Theoretical controversies, effects on psycho- 
logical research, 6ff, 
Theoretical integration in observational study, 


n 
Theoretical models, in family-life study, 947 
use of, 2 
Theories a “ita development, 9 
in Spi i 47 
tests potheses in, $4, $9 
Theory, PE Carag development from child 
study, l. 


naive, 17-18 
building, 8 
antitheoretical position, 19 
Theory of change, 11 
Thermal stimulation, 161 
Thinking, 180-183, 421-422, 446 


aio 
Thorndike word list, 7 so 
Thought contents, 452-4 
Threatening eral runton, do- 
Three-dimensional di 


ó: 
Threshold, 2 mi 
sheata, 8.18 yea 
objective versus subjective respomies, 


critical fickor frequency (CFF), 390 
differential, 288, 313-414 


Aspens 
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Thresholds, in older children, 289 
Tilt, discrimination of, 337 
perception of, 363 
tilting chair, 363 
tilting room, 363 
Time, as variable in family research, 952-953 
Time perception, 360-361 
Time samples, uses and abuses of, 748 
Time sampling, 73, 92-99, 123, 746-749 
advantages and limitations, 99-100 
reliability of, 98 
‘Token-reward machine, 179-181 
Tools in problem solving, 423-425, 429 
Topographical development, 145 
Toys, miniature, 607, 631 
sensory, 632 
Traditional liy ideology, test of, 969 
Trait rating, 73, 75, 113-117 
Transfer, 424 
and pretraining, 367 
Transposability, 338 
Transposition, 339-340, 401 
experiments in early childhood, 177, 181 
relation of language to, 338-339 
Traumatic events and neglect, 706-707 
Trial and error, 423-424, 429 
T score, 470 
Twins, fraternal, 146-147, 164 
identical, 146-147 
linguistic learning in, 441 
relationships of, 958 
Twin studies, 986 
rere discrimination, 337-338, 361- 
362 
Two-point limen, 314, 334, 337 
‘Two-team organization, 67 


Unconditioned stimulus, 156-158, 168-169 
selection of, 379, 384 
shock, 158, 167 
Under-constancy, 348 
Unique events, 709 
Units of action (Heinicke), 112 
Unit time measure, 912 
Urban children and intelligence tests, 479 
Urban groups and intelligence tests, 482 
Urban-rural differences, 482 
Urinary metabolites, 258 
U. S. Office of Education, 821 


Validity, achievement tests, 473-474 
concurrent, 473-474 
construct, 474 
criterion, 332 
factorial, 474 
personality variables, 812 
predictive, 473-474 
tests of motives, 756-758 

Value, concepts of, 645-650 
as type of custom, groff. 

Value experiments, 326-327 

Value system, 920-922 
justification of, 921-922 
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Variability, effect on reliability, 472 
Variable, contamination, 992 
dependent, 140-141 
selection in longitudinal research, 27 
historical, 144-146 
independent, 140-141 
interaction of, 331-332 
internal, 327-328, 329, 331 
Variation, behavior, reduction of, 39 
of conditions, 48, 49 
individual, 37 
in longitudinal studies, 43 
sources, in special groups, 43 
subgroups, 42 
Variation ae 52 
within age groups, 52 
longitudinal, 53 
Verb in child speech, 544-546, 553, 549-55! 
Verbal behavior revealing thought, 431-432 
Verbal conditioning, 538-539 
Verbal development, 165 
Verbal indices in attitude and value measure- 
ment, 654, 656-659 
Verbalization (see also Speech and language 
development), 506ff. 
Verbal learning (see also Rote learning, ver- 
bal) 403-405 
paired associate, 495-496 
Verbal problems, 431-432 
Verbal reports, 710-711 
evaluation of, 712-714 
Verbal sequences, 179-180 
Verbal story cues and construct validity, 804 
and need achievement, 803-805 
reliability of scores, 804 
Verbal summator, 328-329, 604, 620-621 
Vestibular nystagmus, 303 
Vestibular receptivity, 302-303 
Vestibular testing, training technique, 303 
Vicarious trial-and-error behavior, 192-193 
Victor of Aveyron, 534-535 
Viscidity, in groups, 835 
Vision, binocular, 335 
monocular, 335 
stereoscopic, 335-336 
Visual acuity, 292-295, 339-340 
definition, 292 
foveal, 29 
illiterate R test, 294 
luminosity, 292 
minimum separable, 292 
minimum visible, 292 
picture test, 294 
Snellen versus optokinetic, 293 
variability of, 294 
Visual angle, perception of, 335 
Visual cliff, 336 
Visual depth discrimination, 336 
Visual fixation, 160, 169 
Visual functions, 287-295 
Visual sensitivity, photopic, 288, 291 
scotopic, 288, 291 
Visual size, perception of, 334-335 
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Visual stimulation, 16off. 
Visual threshold, 287-295 
Vitamin balances, 255 
Vitamins, precursors of hormones, 255, 280 
Vocabulary, observations of, 437 

of recognition, 506 

standard word, defined, 503 

terminal status, measure, 494 

of use, 506 

word approximation, defined, 503 
Vocabulary measurement (see also Speech 

and language development), 502-506 

Vocabulary tests, 503-506 

Ammons picture, 505-506 

picture, multiple-choice, 505 

Seashore-Eckerson, 506 

Smith, 504-505 

standardization and validation, 503-504 
Vocalization, 162, 168-169, 171 


War as family-setting variable, 953-954 
Weaning, cultural variation in, 933-935, 937- 
938, 940 
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Wechsler intelligence scale for children 
_ (WISC), 458-459, 470, 479 

Weight, perception of, 326, 334 

Wellman test, 492 

Whole-part, perception of, 340-342 

Wisconsin general test apparatus, 179ff. 

Withdrawal response, 157 

Word approximation, definition, 503 

Word association, 439, 549-550 

Word association test, 604, 619-620 

Words, definition of, 537-538 

Work output, 190 

World test, 607, 631 


Yale developmental examination, 457 
Yerkes alley, 170 


Z scores, 470 

Zone defense in observation, 881 
Zuni, animism in thinking, 435 
Zuni families, 935-936, 938-939 
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